1

Reliability Engineer Manager Jobs in Washington (NOW HIRING)

Prepare, maintain, and execute a System Engineering Plan (SEP) for managing all systems ... Perform site reliability engineering to build and maintain a reliable, scalable, and efficient ...

Prepare, maintain, and execute a System Engineering Plan (SEP) for managing all systems ... Perform site reliability engineering to build and maintain a reliable, scalable, and efficient ...

Aerospace Reliability Engineer

Sterling, VA ยท On-site

$101K - $127K/yr

The Aerospace Reliability Engineer is responsible for leading RM&T activities for Dowty Propellers ... Support system safety management (SMS), providing Engineering inputs to system safety assessments ...

Site Reliability Engineer

Mclean, VA ยท On-site

$125K - $200K/yr

Overview As a Site Reliability Engineer (SRE) , you will help design, build, and operate reliable ... Design and implement infrastructure-as-code (IaC) to provision and manage cloud resources (e.g ...

As a Site Reliability Engineer (SRE) , you will help design, build, and operate reliable, secure ... Design and implement infrastructure-as-code (IaC) to provision and manage cloud resources (e.g ...

Site Reliability Engineer

Mclean, VA ยท On-site

$125K - $200K/yr

Overview As a Site Reliability Engineer (SRE) , you will help design, build, and operate reliable ... Design and implement infrastructure-as-code (IaC) to provision and manage cloud resources (e.g ...

Design, deploy, and manage AWS infrastructure, including EC2, VPCs, networking, security controls ... Perform Site Reliability Engineering (SRE) functions, including automation of operational tasks ...

next page

Showing results 1-20

Reliability Engineer Manager information

How much do SRE managers make in the US?

Reliability Engineer Managers, often called SRE Managers, typically earn between $120,000 and $180,000 annually in the US, depending on experience, location, and company size. They oversee teams responsible for system reliability, incident response, and automation, often requiring skills in cloud platforms, monitoring tools, and leadership. Compensation may also include bonuses and stock options.

What does a Reliability Engineer Manager do?

A Reliability Engineer Manager oversees teams responsible for improving the reliability and performance of systems, machinery, or processes within an organization. They develop maintenance strategies, lead root cause analyses of failures, and implement best practices to minimize downtime and costs. Additionally, they collaborate with other departments to ensure that reliability goals align with business objectives and compliance standards. Their role is crucial in industries such as manufacturing, energy, and technology, where system uptime and safety are critical.

What engineering jobs pay $500,000?

Senior engineering roles such as Reliability Engineer Managers, Petroleum Engineers, and Software Engineering Directors can reach or exceed $500,000 annually, especially with experience, bonuses, and stock options. These positions often require advanced skills, leadership, and industry expertise, typically found in high-demand sectors like energy, technology, and aerospace.

What is the highest salary of SRE?

The highest salary for a Reliability Engineer (SRE) can exceed $200,000 annually in high-demand markets, especially for those with extensive experience, advanced skills in automation and cloud platforms, and leadership responsibilities. Senior SREs or SRE Managers often earn higher compensation, including bonuses and stock options, reflecting their expertise and strategic impact on system reliability.

What are some common challenges Reliability Engineer Managers face when balancing long-term reliability improvements with immediate operational demands?

Reliability Engineer Managers often need to prioritize urgent maintenance issues while also driving long-term reliability initiatives. Balancing these competing demands can be challenging, as immediate equipment failures may require quick fixes that temporarily interrupt ongoing improvement projects. Effective managers work closely with operations, maintenance, and engineering teams to communicate priorities, allocate resources, and implement sustainable solutions that address root causes rather than just symptoms. This role typically involves using data-driven decision-making and fostering a culture of proactive maintenance and continuous improvement.

What are the key skills and qualifications needed to thrive as a Reliability Engineer Manager, and why are they important?

To thrive as a Reliability Engineer Manager, you need a strong background in engineering principles, reliability analysis, and maintenance strategies, typically supported by a degree in engineering and experience in reliability roles. Familiarity with reliability-centered maintenance (RCM), failure mode and effects analysis (FMEA), and asset management software such as SAP or Maximo is common, along with certifications like Certified Reliability Engineer (CRE). Leadership, problem-solving, and effective communication are vital soft skills for managing teams and driving cross-functional initiatives. These competencies are crucial for minimizing downtime, optimizing equipment performance, and ensuring long-term operational efficiency.

What is the difference between Reliability Engineer Manager vs Reliability Engineer?

AspectReliability EngineerReliability Engineer Manager
Required CredentialsBachelor's in Engineering or related field; certifications like CRC, CRESame as Reliability Engineer, plus leadership experience
Work EnvironmentDesign, analyze, and improve system reliability; often in teamsOversees Reliability Engineers; manages projects and teams
Employer & Industry UsageManufacturing, aerospace, energy, automotiveSame industries, with added managerial responsibilities
Common Search & ComparisonFocuses on technical skills and hands-on reliability tasksFocuses on leadership, team management, and strategic planning

The main difference between a Reliability Engineer and a Reliability Engineer Manager lies in their responsibilities. The Reliability Engineer focuses on technical analysis and system improvements, while the Reliability Engineer Manager oversees teams, manages projects, and develops strategies to enhance reliability across the organization.

What is a reliability engineering manager?

A reliability engineering manager oversees teams responsible for ensuring the dependability and performance of equipment, systems, or products. They develop maintenance strategies, analyze failure data, and implement improvements to enhance system uptime, often using tools like FMEA and reliability modeling. Strong leadership, technical expertise, and knowledge of industry standards are essential for this role.
What are the most commonly searched types of Reliability Engineer jobs in Washington? The most popular types of Reliability Engineer jobs in Washington are:
What cities in Washington are hiring for Reliability Engineer Manager jobs? Cities in Washington with the most Reliability Engineer Manager job openings:
DevSecOps and Site Reliability Engineering (SRE) Technical Director

DevSecOps and Site Reliability Engineering (SRE) Technical Director

i4DM

Millersville, MD โ€ข On-site, Remote

$55.50 - $73.50/hr

Full-time

Posted 21 days ago


Job description

Description
About Our Team
Our employees thrive in a culture that is fast-paced, collaborative, and ego-free, where innovation and teamwork are encouraged at every level. We provide Federal agencies with immediate access to highly skilled professionals who understand complex mission challenges and deliver efficient, scalable solutions. By continuously investing in talent, technology, and specialized capabilities, we maintain expert teams prepared to support evolving Federal missions through tailored technical solutions and modern service delivery approaches.
We value diverse perspectives and strive to attract talent from all backgrounds. We are seeking professionals who are passionate about technology, mission success, and solving complex operational challenges with creativity and purpose. If you enjoy expanding your technical expertise while supporting impactful Federal initiatives, you will thrive within our organization. Veterans and military spouses are strongly encouraged to apply and bring their valuable experience to our team.
About the Role
We are seeking an experienced and highly motivated DevSecOps and Site Reliability Engineering (SRE) Technical Director to serve as the Contractor's senior technical authority for DevSecOps practices, platform reliability engineering, and automated delivery pipelines supporting VA enterprise healthcare platforms and applications.
In this role, you will provide technical leadership across cloud engineering, CI/CD automation, infrastructure reliability, and secure software delivery practices within a mission-critical, 24x7 enterprise environment. You will work closely with the Program Manager, Maintenance Technical Director, Monitoring & Incident Management teams, and VA stakeholders to ensure platform reliability, scalability, and secure delivery of healthcare application services.
The DevSecOps and SRE Technical Director will drive adoption of modern engineering practices, ensuring alignment with Federal security requirements, architectural standards, and VA governance while improving system performance, resiliency, and delivery efficiency.
RESPONSIBILITIES
DevSecOps & SRE Leadership
  • Serve as the senior technical authority for DevSecOps and Site Reliability Engineering (SRE) practices across platform services and hosted applications.
  • Establish and enforce engineering standards, DevSecOps practices, and reliability frameworks aligned with enterprise architecture and VA requirements.
  • Provide technical leadership and mentorship to DevSecOps and SRE engineering teams.

CI/CD & Automation
  • Oversee design, implementation, and maintenance of CI/CD pipelines supporting secure, automated, and repeatable application delivery.
  • Drive automation of build, test, deployment, and infrastructure provisioning processes using Infrastructure as Code (IaC).
  • Ensure pipelines include automated security testing, quality validation, and compliance controls throughout the software delivery lifecycle.

Platform Reliability & Performance
  • Lead efforts to improve system reliability, scalability, performance, and operational efficiency across mission-critical environments.
  • Define and monitor reliability metrics (e.g., availability, latency, deployment success rates) and drive improvements in service stability.
  • Reduce operational toil through automation and proactive system improvements.

Cloud Engineering & Modernization
  • Support and guide adoption of cloud-native architectures, containerized environments (e.g., Kubernetes), and platform modernization initiatives.
  • Ensure infrastructure is scalable, resilient, and aligned with Federal cloud standards and best practices.
  • Drive continuous improvement of platform capabilities to support evolving healthcare application needs.

Security & Compliance Integration
  • Ensure DevSecOps practices align with Federal security requirements, including NIST, Zero Trust principles, and VA cybersecurity policies.
  • Collaborate with cybersecurity teams to implement secure-by-design practices across infrastructure and application pipelines.
  • Support vulnerability management, secure configuration enforcement, and compliance validation within DevSecOps workflows.

Cross-Functional Collaboration
  • Coordinate with Program Management, Engineering, Architecture, Monitoring, and Incident Management teams to ensure seamless integration of development and operations.
  • Partner with SRE, operations, and monitoring teams to improve observability, incident response, and system resilience.
  • Align DevSecOps and SRE practices with Agile and SAFe delivery methodologies to support continuous delivery.

Incident Support & Continuous Improvement
  • Support incident response activities, including root cause analysis, remediation planning, and implementation of corrective actions.
  • Identify recurring issues and system weaknesses, driving improvements in reliability and deployment practices.
  • Continuously evaluate and enhance engineering processes, tools, and automation frameworks to improve efficiency and system performance.

TAG: #LI-I4DM
TAG: INDMJC
Requirements
QUALIFICATIONS
  • Bachelor's degree in Computer Science, Engineering, Information Technology, or a related field.
  • 8+ years of experience in DevSecOps, Site Reliability Engineering (SRE), or platform engineering roles supporting enterprise or mission-critical environments.
  • Strong hands-on experience with cloud platforms (AWS preferred), Infrastructure as Code tools (e.g., Terraform), and configuration management tools (e.g., Ansible).
  • Experience with container orchestration platforms (e.g., Kubernetes, EKS, ECS) and cloud-native application architectures.
  • Proven leadership experience guiding engineering teams within Agile or SAFe environments.
  • Experience supporting CI/CD pipelines, automation frameworks, and modern software delivery practices.
  • Strong understanding of system reliability, monitoring, and performance optimization principles.
  • Ability to collaborate across cross-functional teams in high-availability, 24x7 operational environments.
  • Experience scaling SRE practices across large, complex, multi-region cloud environments.
  • Candidates must be eligible to obtain and maintain a Public Trust clearance.

PREFERRED QUALIFICATIONS
  • Experience supporting VA or Federal Government environments, including compliance with Federal cloud and security policies.
  • Experience implementing Zero Trust security principles within DevSecOps pipelines and cloud-native architectures.
  • Familiarity with observability tools, AIOps, and automation-driven reliability engineering practices.
  • Experience supporting large-scale enterprise modernization initiatives or healthcare application platforms.
  • SAFe, DevSecOps, or cloud-related certifications.

I4dm logo

About I4dm

Sourced by ZipRecruiter

Industry

Software development

Company size

11 - 50 Employees

Headquarters location

Millersville, MD, US

Year founded

2002