Site Reliability Engineer, Recommendation Infrastructure - USDS

TikTok
San Jose, CA
Category Data Analyst
Job Description
The USDS TikTok Recommendations Infra SRE team works with engineering and product teams to build and run large-scale, globally distributed, observable, fault-tolerant systems. SREs on this team will deliver on production ownership and be responsible for observability and automation across complex, large-scale service mesh architectures.

Requirements

  • Bachelor's degree or above majoring in Computer Science or related fields, with at least 2 years of related work experience
  • Experience in SRE of large-scale systems deployment with high reliability and scalability
  • Familiar with system operation skills in Linux and network
  • Experience programming in at least one of the following languages: Python, Perl, Go, or C/C++
  • Experience in designing, analyzing and troubleshooting large-scale distributed systems
  • Familiar with popular CI/CD procedures and environments
  • Effective communication skills and a sense of ownership and drive

Benefits

  • Medical, dental, and vision insurance
  • 401(k) savings plan with company match
  • Paid parental leave
  • Short-term and long-term disability coverage
  • Life insurance
  • Wellbeing benefits
  • 10 paid holidays per year
  • 10 paid sick days per year
  • 17 days of Paid Personal Time (prorated upon hire with increasing accruals by tenure)
]]>