Site Reliability Engineer (SRE)

Bright Vision Technologies
Any Location, IL
Remote
Job Description
Bright Vision Technologies is a software development company looking for a Site Reliability Engineer (SRE) to ensure the availability, performance, and operational excellence of large-scale distributed systems in production.

Requirements

  • Define, instrument, and continually refine service-level objectives (SLOs), service-level indicators (SLIs), and error budgets for critical services
  • Lead incident response and resolution for production issues
  • Design and implement comprehensive monitoring, logging, and tracing strategies
  • Build and maintain robust on-call processes, runbooks, and escalation paths
  • Automate operational toil aggressively by writing production-grade tooling
  • Architect and operate large-scale Kubernetes clusters and container-based workloads
  • Design CI/CD pipelines that promote safe, frequent, and observable releases
  • Lead capacity planning and performance engineering activities
  • Partner closely with application development teams to embed reliability practices early in design

Benefits

  • Competitive base salary commensurate with experience
  • Benefits
  • Paid Time Off
  • 401k Matching
]]>