1

Site Reliability Engineer Manager Jobs in Reston, VA

Site Reliability Engineer

Reston, VA

$59.25 - $78.75/hr

Site Reliability Engineer - The Site Reliability Engineer (i.e., SRE ) role is responsible for the ... Red Hat OpenShift, inclusive of operators, routing/ingress, and cluster management * Azure cloud ...

Site Reliability Engineer - Hybrid

Reston, VA · On-site

$59.25 - $78.75/hr

Second round would be an in-person interview Manager's call notes * This is an SRE role. SRE is under a shared services team within Fannie Mae who works with different application teams. So, multi ...

Site Reliability Engineer

Chantilly, VA · On-site

$62K - $141K/yr

Site Reliability Engineer The Opportunity: Engineering to make a system more resilient and efficient frees up time and money to build more capabilities. Whether you come from a background in network ...

Site Reliability Engineer

Herndon, VA · On-site

$58.50 - $77.75/hr

Booz Allen Hamilton is seeking a Site Reliability Engineer to enhance system resilience and ... Management activities Company : Booz Allen Hamilton is a consulting firm that specializes in ...

Site Reliability Engineer

Herndon, VA · On-site

$86K - $198K/yr

Site Reliability Engineer The Opportunity: Engineering to make a system more resilient and ... Experience with performing Release Management activities Clearance: Applicants selected will be ...

Site Reliability Engineer

Herndon, VA · On-site

$58.50 - $77.75/hr

Booz Allen Hamilton is seeking a Site Reliability Engineer to enhance system resilience and ... Management activities Company : Booz Allen Hamilton is a consulting firm that specializes in ...

Senior Site Reliability Engineer

Mclean, VA · On-site

$57.50 - $76.50/hr

As an SRE, your primary responsibility is to combine aspects of software engineering with ... Log Management and Analysis tools such as Splunk * Automation and Configuration Management tools ...

next page

Showing results 1-20

Site Reliability Engineer Manager information

See Reston, VA salary details

$11

$66

$95

How much do site reliability engineer manager jobs pay per hour?

As of Jun 9, 2026, the average hourly pay for site reliability engineer manager in Reston, VA is $66.31, according to ZipRecruiter salary data. Most workers in this role earn between $57.02 and $75.77 per hour, depending on experience, location, and employer.

What is a Site Reliability Engineer Manager?

A Site Reliability Engineer (SRE) Manager oversees a team of site reliability engineers tasked with maintaining the reliability, scalability, and performance of software systems. Their role combines leadership and technical expertise, focusing on automating operations, managing incidents, and ensuring high availability of services. They work closely with engineering and operations teams to implement best practices in monitoring, incident response, and system design. SRE Managers also mentor their teams, set reliability goals, and help drive a culture of continuous improvement within the organization.

What is the difference between Site Reliability Engineer Manager vs Site Reliability Engineer?

AspectSite Reliability Engineer (SRE)Site Reliability Engineer Manager
ResponsibilitiesFocuses on designing, implementing, and maintaining reliable systems and automationOversees SRE teams, manages projects, and aligns reliability goals with business objectives
Required SkillsStrong coding, system design, and troubleshooting skillsLeadership, team management, strategic planning
CertificationsGoogle Cloud, AWS certifications, Linux, scriptingSame as SRE, plus management certifications (e.g., PMP) often preferred
Work EnvironmentTechnical, hands-on with systems and automationManagerial, coordinating teams and projects

The main difference is that a Site Reliability Engineer focuses on technical system reliability, while a Site Reliability Engineer Manager oversees teams and strategic initiatives to ensure reliability goals are met across projects.

How does a Site Reliability Engineer Manager typically balance technical leadership with team management responsibilities?

A Site Reliability Engineer Manager often splits their time between overseeing technical projects, such as system reliability improvements and incident response strategies, and managing the growth and well-being of their engineering team. This includes mentoring SREs, facilitating communication between teams, setting priorities, and ensuring that operational goals align with business objectives. Balancing these responsibilities requires strong organizational skills and a proactive approach to both technical challenges and people management. Successful managers regularly engage in hands-on problem-solving while also fostering a collaborative team environment.

What are the key skills and qualifications needed to thrive as a Site Reliability Engineer Manager, and why are they important?

To thrive as a Site Reliability Engineer Manager, you need expertise in systems engineering, incident management, and a strong background in software development or computer science, often supported by a bachelor’s degree or equivalent experience. Familiarity with cloud platforms (like AWS, GCP, or Azure), infrastructure as code tools (such as Terraform), monitoring systems (like Prometheus), and certifications in cloud or DevOps practices are highly valued. Strong leadership, effective communication, and problem-solving abilities help you guide teams and foster collaboration across departments. These skills and qualities ensure the stability, scalability, and reliability of critical systems while enabling teams to respond effectively to complex technical challenges.
What are the most commonly searched types of Site Reliability Engineer jobs in Reston, VA? The most popular types of Site Reliability Engineer jobs in Reston, VA are:
What cities near Reston, VA are hiring for Site Reliability Engineer Manager jobs? Cities near Reston, VA with the most Site Reliability Engineer Manager job openings:
Site Reliability Engineer

$59.25 - $78.75/hr

Other

Posted 19 days ago


Job description

Site Reliability Engineer -

The Site Reliability Engineer (i.e., SRE ) role is responsible for the optimization and reliability of core technical platforms and platform services, and exerting significant technical leadership in the continuous improvement of service reliability to platform stakeholders.

The core technical platform is Red Hat OpenShift, with a variety of platform services to include, but not limited to, Red Hat AMQ, HashiCorp Vault, and Keycloak, that are consumed by various platform stakeholders. This role will span from the OpenShift platform to services provided by Azure.

We re proud of the way our teammates have a positive impact on everything we do. Our employees are committed to and exemplify our Core Values:

  • Integrity through accountability, consistency, transparency and trust
  • Agility through adaptability, continuous improvement, expertise, and flexibility
  • Partnership through collaboration, communication, leadership, and teamwork
  • Inclusivity through diversity, relationships, respect, and suppor

PRINCIPAL RESPONSIBILITIES

  • Maintain overall health and reliability of core technical platforms and platform services to ensure business continuity and high availability.
  • Maintain and improve the end-to-end observability of the platform, to ensure that platform state is at all times understood in context with supporting information and data that can be quickly marshalled into action.
  • Lead incident response, root-cause analysis, and postmortems that advance the overall health of the system and prevent or diminish reoccurrence of platform issues.
  • Partner with development teams to troubleshoot platform issues, to include deployment, routing, and configuration challenges.
  • Build and maintain automated deployment pipelines that support engineering, development and data teams.
  • Write, test, and deploy solutions that reduce unneeded human intervention and improve quality.
  • Lead the delivery of new platform features, services, and capabilities.
  • Prioritize, deliver, and operate new platform capabilities products and services.
  • Develop and maintain accurate and up-to-date documentation, including but not limited to operational procedures, deployment plans, incident response plans.
  • Participate in on-call rotation.
  • Assist with other job duties as assigned.

PRINCIPAL JOB REQUIREMENTS

  • Bachelor's degree in computer science or related field, or equivalent experience.
  • Minimum of 5-7 years of experience in a Site Reliability Engineering and/or Platform Engineering role, with progressively increasing scope of responsibility.
  • Extensive hands-on experience and knowledge of the following technologies:
  • Red Hat OpenShift, inclusive of operators, routing/ingress, and cluster management
  • Azure cloud services and solutions
  • Messaging platforms like AMQ, Kafka, Reddis
  • HashiCorp Vault
  • Scripting languages like Bash, Python, Go, PowerShell
  • Observability tools like Datadog, Grafana, Prometheus
  • Strong scripting and automation skills in Bash, Python.
  • Strong prior experience with observability tools and connecting trends, incidents and alerts with actions.
  • Prior experience troubleshooting complex production issues using logs, metrics, traces, packet captures, and Kubernetes debugging tools.
  • Prior experience working in a heavily audited environment is preferred, with focus on mitigating risks and ensuring compliance with policies and procedures.
  • Knowledge of enterprise-level technologies and concepts.
  • Ability to multi-task in a dynamic environment while continuing to progress on longer term projects.
  • Ability to communicate well, both orally and in writing, including producing thorough documentation of all work.
  • Ability to conduct independent technical research and share results with management and/or peers.
  • Ability to listen and integrate ideas from different views, build and maintain respectful relationships, collaborate with others, and resolve conflicts constructively.
  • Proof of eligibility to work in the United States.