1

Reliability Engineer Manager Jobs in Washington (NOW HIRING)

SRE Engineer

Arlington, VA · On-site

$65.75 - $87.25/hr

... manage Service Level Indicators (SLIs), Service Level Objectives (SLOs), and error budgets for ... Site reliability engineering, monitoring, automation, incident response, performance optimization ...

Apply derating and thermal management analysis to high-power EW RF components; review junction ... Must have 8+ years of reliability engineering experience, including significant work on RF ...

SRE Engineer

Arlington, VA · On-site

$65.75 - $87.25/hr

... manage Service Level Indicators (SLIs), Service Level Objectives (SLOs), and error budgets for ... Site reliability engineering, monitoring, automation, incident response, performance optimization ...

Site Reliability Engineer

Washington, DC · On-site

$114K - $190K/yr

Manage incident response, root cause analysis, and post-mortem processes for the AI platform ... , DevOps, or production operations. * Extensive experience with cloud-native infrastructure ...

Site Reliability Engineer - Hybrid

Reston, VA · On-site

$59.25 - $78.75/hr

Second round would be an in-person interview Manager's call notes * This is an SRE role. SRE is under a shared services team within Fannie Mae who works with different application teams. So, multi ...

Reliability Engineer

Mclean, VA · On-site

$103K - $130K/yr

The Reliability Engineer will act generally as a member of a design, analysis or review team on ... Coordinate and work closely with other engineering, logistics, financial, and program management ...

Site Reliability Engineer

Washington, DC · On-site

$112K - $179K/yr

The SRE will drive automation initiatives, observability improvements, and incident response ... Define and manage Service Level Objectives (SLOs) and Service Level Indicators (SLIs). * Support ...

Site Reliability Engineer

Washington, DC · On-site

$112K - $179K/yr

The SRE will drive automation initiatives, observability improvements, and incident response ... Define and manage Service Level Objectives (SLOs) and Service Level Indicators (SLIs). * Support ...

Reliability Engineer

Mclean, VA · On-site

$103K - $130K/yr

The Reliability Engineer will act generally as a member of a design, analysis or review team on ... Coordinate and work closely with other engineering, logistics, financial, and program management ...

The SRE will drive automation initiatives, observability improvements, and incident response ... Define and manage Service Level Objectives (SLOs) and Service Level Indicators (SLIs). * Support ...

next page

Showing results 1-20

Reliability Engineer Manager information

How much do SRE managers make in the US?

Reliability Engineer Managers, often called SRE Managers, typically earn between $120,000 and $180,000 annually in the US, depending on experience, location, and company size. They oversee teams responsible for system reliability, incident response, and automation, often requiring skills in cloud platforms, monitoring tools, and leadership. Compensation may also include bonuses and stock options.

What does a Reliability Engineer Manager do?

A Reliability Engineer Manager oversees teams responsible for improving the reliability and performance of systems, machinery, or processes within an organization. They develop maintenance strategies, lead root cause analyses of failures, and implement best practices to minimize downtime and costs. Additionally, they collaborate with other departments to ensure that reliability goals align with business objectives and compliance standards. Their role is crucial in industries such as manufacturing, energy, and technology, where system uptime and safety are critical.

What engineering jobs pay $500,000?

Senior engineering roles such as Reliability Engineer Managers, Petroleum Engineers, and Software Engineering Directors can reach or exceed $500,000 annually, especially with experience, bonuses, and stock options. These positions often require advanced skills, leadership, and industry expertise, typically found in high-demand sectors like energy, technology, and aerospace.

What is the highest salary of SRE?

The highest salary for a Reliability Engineer (SRE) can exceed $200,000 annually in high-demand markets, especially for those with extensive experience, advanced skills in automation and cloud platforms, and leadership responsibilities. Senior SREs or SRE Managers often earn higher compensation, including bonuses and stock options, reflecting their expertise and strategic impact on system reliability.

What are some common challenges Reliability Engineer Managers face when balancing long-term reliability improvements with immediate operational demands?

Reliability Engineer Managers often need to prioritize urgent maintenance issues while also driving long-term reliability initiatives. Balancing these competing demands can be challenging, as immediate equipment failures may require quick fixes that temporarily interrupt ongoing improvement projects. Effective managers work closely with operations, maintenance, and engineering teams to communicate priorities, allocate resources, and implement sustainable solutions that address root causes rather than just symptoms. This role typically involves using data-driven decision-making and fostering a culture of proactive maintenance and continuous improvement.

What are the key skills and qualifications needed to thrive as a Reliability Engineer Manager, and why are they important?

To thrive as a Reliability Engineer Manager, you need a strong background in engineering principles, reliability analysis, and maintenance strategies, typically supported by a degree in engineering and experience in reliability roles. Familiarity with reliability-centered maintenance (RCM), failure mode and effects analysis (FMEA), and asset management software such as SAP or Maximo is common, along with certifications like Certified Reliability Engineer (CRE). Leadership, problem-solving, and effective communication are vital soft skills for managing teams and driving cross-functional initiatives. These competencies are crucial for minimizing downtime, optimizing equipment performance, and ensuring long-term operational efficiency.

What is the difference between Reliability Engineer Manager vs Reliability Engineer?

AspectReliability EngineerReliability Engineer Manager
Required CredentialsBachelor's in Engineering or related field; certifications like CRC, CRESame as Reliability Engineer, plus leadership experience
Work EnvironmentDesign, analyze, and improve system reliability; often in teamsOversees Reliability Engineers; manages projects and teams
Employer & Industry UsageManufacturing, aerospace, energy, automotiveSame industries, with added managerial responsibilities
Common Search & ComparisonFocuses on technical skills and hands-on reliability tasksFocuses on leadership, team management, and strategic planning

The main difference between a Reliability Engineer and a Reliability Engineer Manager lies in their responsibilities. The Reliability Engineer focuses on technical analysis and system improvements, while the Reliability Engineer Manager oversees teams, manages projects, and develops strategies to enhance reliability across the organization.

What is a reliability engineering manager?

A reliability engineering manager oversees teams responsible for ensuring the dependability and performance of equipment, systems, or products. They develop maintenance strategies, analyze failure data, and implement improvements to enhance system uptime, often using tools like FMEA and reliability modeling. Strong leadership, technical expertise, and knowledge of industry standards are essential for this role.
What are the most commonly searched types of Reliability Engineer jobs in Washington? The most popular types of Reliability Engineer jobs in Washington are:
What cities in Washington are hiring for Reliability Engineer Manager jobs? Cities in Washington with the most Reliability Engineer Manager job openings:
SRE Engineer

$65.75 - $87.25/hr

Full-time

Posted yesterday


Job description

Job Summary:
Spatial Front, Inc. is seeking a SRE Engineer to support their growing team. The SRE Engineer will improve the reliability, availability, performance, and operational resilience of mission-critical systems for a federal enterprise program.
Responsibilities:
• Define, implement, and maintain site reliability engineering practices for mission-critical applications and shared services, with emphasis on uptime, resiliency, recoverability, and operational excellence.
• Establish and manage Service Level Indicators (SLIs), Service Level Objectives (SLOs), and error budgets for critical services and environments.
• Implement and maintain monitoring, alerting, and observability solutions for production systems.
• Support production and pre-production operations across development, test, training, staging, and production environments.
• Lead incident response activities, conducting root cause analysis and implementing permanent fixes.
• Support capacity planning, performance analysis, trend monitoring, and scalability planning for enterprise platforms and services.
• Create and maintain runbooks, standard operating procedures, incident playbooks, operational dashboards, and knowledge articles.
• Support high availability, disaster recovery, backup/restore validation, and business continuity activities.
• Develop and implement automation to reduce manual operational toil and improve system reliability.
• Contribute to post-deployment validation, smoke testing, rollback readiness, and environment health checks during releases and maintenance windows.
• Collaborate with teams supporting Oracle/PeopleSoft platforms, integration services, reporting services, and shared enterprise tooling to improve reliability end to end.
• Collaborate with development teams to improve system reliability through design reviews and reliability engineering practices.
• Perform capacity planning and performance optimization for production systems.
• Other duties as assigned.
Qualifications:
Required:
• Bachelor's in Computer Science, Engineering, or related field.
• 5 years software engineering, 3 years site reliability engineering, production support engineering, or platform reliability for enterprise systems, 1 year unix/solaris experience.
• Experience supporting enterprise applications in a high-availability, security-conscious, and compliance-driven environment.
• Experience creating operational documentation, runbooks, and incident response procedures.
• Strong troubleshooting skills across application, middleware, integration, and infrastructure layers.
• Strong verbal and written communication skills, including the ability to work across engineering, security, testing, and program stakeholders.
• Demonstrated expertise in: Site reliability engineering, monitoring, automation, incident response, performance optimization; experienced with UNIX/Solaris.
• Must be a U.S. Citizen.
• Must possess an active Secret security clearance or be able to obtain one.
Preferred:
• DevOps Engineer or equivalent SRE certification.
• Experience supporting environments subject to RMF, STIG, audit, ATO, or similar compliance requirements.
• Experience with Splunk, enterprise monitoring/observability tooling, or similar operational analytics platforms.
• Experience supporting Oracle-based enterprise environments, including Oracle middleware, Oracle Database, or related platform services.
• Experience supporting PeopleSoft or similarly complex ERP / HCM / payroll platforms.
• Exposure to F5, Oracle Data Guard, Oracle GoldenGate, Kafka, or other enterprise integration / traffic / replication technologies.
• Familiarity with scripting and automation using tools such as Shell, Python, or PowerShell.
• Knowledge of DevOps, testing and scanning tools esp. within PeopleSoft environment such as PHIRE, PFT, Tricentis, Palo Alto, CAST etc.
• Experience as an SRE supporting DoD or federal agency programs.
• Familiarity with UNIX/Solaris administration and systems programming.
• Experience with observability platforms such as Prometheus, Grafana, Datadog, or Splunk.
Company:
SFI effectively delivers the right Information Technology solutions and Business Support services using thoughtful analysis, strategic planning and precise execution. Founded in 2008, the company is headquartered in Mc Lean, USA, with a team of 501-1000 employees. The company is currently Late Stage.