Senior Site Reliability Engineer, Reliability Team - USDS

TikTok
San Jose, CA
Job Description
The Site Reliability Engineering team at TikTok is seeking a Senior Site Reliability Engineer to ensure the end-to-end reliability of their production ecosystem. Responsibilities include system design & optimization, automation & efficiency, observability & monitoring, disaster recovery & resilience, incident management & response, continuous improvement, and capacity planning.

Requirements

  • Bachelor’s degree in Computer Science, related technical field, or equivalent practical experience.
  • Proficiency in one or more programming languages (e.g., Go, Python, Java, or C++).
  • Strong understanding of Linux system internals, networking (TCP/IP, DNS, Load Balancing), and distributed systems.
  • Experience managing containerized environments (e.g., Kubernetes, Docker).

Benefits

  • Generous Paid Time Off
  • 401k Matching
  • Retirement Plan
]]>