Senior Site Reliability Engineer, Reliability Team - USDS

TikTok
San Jose, CA
Job Description
The Site Reliability Engineering (SRE) team at TikTok combines software and systems engineering to build and run large-scale, massively distributed, and fault-tolerant systems. As a Site Reliability Engineer, you will be responsible for the end-to-end reliability of our production ecosystem, balancing traditional SRE functions with a focus on disaster recovery and rapid incident response.

Requirements

  • Bachelor’s degree in Computer Science, related technical field, or equivalent practical experience.
  • Proficiency in one or more programming languages (e.g., Go, Python, Java, or C++).
  • Strong understanding of Linux system internals, networking (TCP/IP, DNS, Load Balancing), and distributed systems.
  • Experience managing containerized environments (e.g., Kubernetes, Docker).

Benefits

  • Generous Paid Time Off
  • 401k Matching
  • Retirement Plan
  • Four Day Work Week
  • Generous Parental Leave
  • Tuition Reimbursement
  • Relocation Assistance
]]>