SRE Metrics Analyst Intern

Leidos
San Diego, VA
Job Description
We are seeking a detail-oriented and analytical SRE Metrics Analyst Intern to join our Site Reliability Engineering (SRE) team. In this role, you will be responsible for establishing and managing the collection of metrics related to system performance, reliability, and incidents. You will develop and maintain reporting frameworks to provide actionable insights to stakeholders, driving improvements in our systems and processes. Your work will support the organization’s commitment to delivering high-quality, reliable services.

Requirements

  • Design and implement a comprehensive metrics collection framework that captures key performance indicators (KPIs) related to system reliability and operational efficiency.
  • Analyze collected metrics to identify trends, patterns, and anomalies that impact system reliability and performance.
  • Create regular reports on system performance, reliability, incident response times, and other critical metrics for various stakeholders, including technical teams and management.
  • Work closely with SRE teams to identify their metric needs and ensure alignment with operational goals.
  • Continuously evaluate and enhance the metrics collection and reporting processes to improve data accuracy, relevance, and accessibility.

Benefits

  • Paid Time Off
  • 401k Matching
  • Relocation Assistance
]]>