1

Process Reliability Manager Jobs in Washington, DC

Site Reliability Engineer

Sterling, VA ยท On-site

$56.50 - $75/hr

Leverage operational data to automate systems administration, operations and incident response processes to improve enterprise reliability to manage IT environment complexity. * Works with LSA, Lab ...

The SRE will help build resilient systems that scale, automate manual processes, manage fleetwide configurations, and ensure robust system monitoring. The selected candidate will support operations ...

The SRE will help build resilient systems that scale, automate manual processes, manage fleetwide configurations, and ensure robust system monitoring. The selected candidate will support operations ...

Site Reliability Engineer

Herndon, VA ยท Remote

$70 - $75/hr

Support and maintain cloud-native analytics platforms by troubleshooting production issues, improving operational reliability, and managing infrastructure, automation, and CI/CD processes. Day-to-Day ...

Automate and improve development, testing, deployment, and release processes. * Testing and ... Design and implement infrastructure-as-code (IaC) to provision and manage cloud resources (e.g ...

Ensure processes meet aerospace and defense quality and reliability expectations. * Build and standardize engineering management systems, workflows, and escalation processes to improve organizational ...

Site Reliability Engineer - Hybrid

Reston, VA ยท On-site

$59.25 - $78.75/hr

... process: 2 rounds. First round would be a video interview. Second round would be an in-person interview Manager's call notes * This is an SRE role. SRE is under a shared services team within Fannie ...

Ensure processes meet aerospace and defense quality and reliability expectations. * Build and standardize engineering management systems, workflows, and escalation processes to improve organizational ...

Site Reliability Engineer

Mclean, VA ยท On-site

$125K - $200K/yr

Automate and improve development, testing, deployment, and release processes. * Testing and ... Design and implement infrastructure-as-code (IaC) to provision and manage cloud resources (e.g ...

Site Reliability Engineer

Mclean, VA ยท On-site

$125K - $200K/yr

Automate and improve development, testing, deployment, and release processes. * Testing and ... Design and implement infrastructure-as-code (IaC) to provision and manage cloud resources (e.g ...

Site Reliability Engineer

Washington, DC ยท On-site

$114K - $190K/yr

Manage incident response, root cause analysis, and post-mortem processes for the AI platform ... , DevOps, or production operations. * Extensive experience with cloud-native infrastructure ...

next page

Showing results 1-20

Process Reliability Manager information

See Washington, DC salary details

$70.2K

$133.1K

$190.8K

How much do process reliability manager jobs pay per year?

As of Jun 9, 2026, the average yearly pay for process reliability manager in Washington, DC is $133,066.00, according to ZipRecruiter salary data. Most workers in this role earn between $107,000.00 and $158,600.00 per year, depending on experience, location, and employer.

What is a Process Reliability Manager?

A Process Reliability Manager is a professional responsible for ensuring that manufacturing or production processes operate efficiently, consistently, and with minimal downtime. They analyze process data, identify areas for improvement, and implement strategies to enhance equipment reliability and overall process performance. By collaborating with maintenance, engineering, and operations teams, they help reduce failures, optimize productivity, and maintain quality standards. Their work is crucial for minimizing costs and ensuring that production targets are met safely and reliably.

What is the difference between Process Reliability Manager vs Maintenance Engineer?

AspectProcess Reliability ManagerMaintenance Engineer
CertificationsReliability certifications, Six Sigma, PMPMechanical/Electrical certifications, HVAC, PLC certifications
Work EnvironmentManufacturing plants, industrial facilitiesFactories, equipment maintenance sites
Industry UsageFocus on reliability, uptime, and process optimizationFocus on equipment repair, preventive maintenance

The Process Reliability Manager primarily focuses on improving equipment reliability and process efficiency through data analysis and strategic planning. In contrast, Maintenance Engineers handle the hands-on repair and maintenance of machinery. Both roles are essential in manufacturing, but the Process Reliability Manager emphasizes proactive reliability strategies, while Maintenance Engineers focus on reactive and preventive maintenance tasks.

How does a Process Reliability Manager typically collaborate with maintenance and production teams to achieve operational goals?

A Process Reliability Manager works closely with both maintenance and production teams to identify areas of improvement in equipment reliability and process efficiency. This often involves facilitating cross-functional meetings, analyzing downtime data, and implementing preventive maintenance strategies. Clear communication and teamwork are key, as the role requires aligning the objectives of different departments to minimize unplanned outages and optimize production output. By fostering a proactive culture and sharing best practices, the Process Reliability Manager helps ensure the plant operates smoothly and efficiently.

What are the key skills and qualifications needed to thrive as a Process Reliability Manager, and why are they important?

To thrive as a Process Reliability Manager, you need a strong background in engineering, process optimization, and reliability analysis, often supported by a degree in engineering and experience in manufacturing or industrial settings. Familiarity with reliability-centered maintenance (RCM), root cause analysis tools, and data analysis software such as SAP or Maximo is typically required. Exceptional problem-solving, leadership, and communication skills help drive cross-functional teams and foster a culture of continuous improvement. These skills are crucial to ensure equipment reliability, minimize downtime, and optimize operational efficiency within complex production environments.
What job categories do people searching Process Reliability Manager jobs in Washington, DC look for? The top searched job categories for Process Reliability Manager jobs in Washington, DC are:

Site Reliability Engineer

Nightwing

Sterling, VA โ€ข On-site

$56.50 - $75/hr

Full-time

Posted 23 days ago


Job description

Nightwing provides technically advanced full-spectrum cyber, data operations, systems integration and intelligence mission support services to meet our customers' most demanding challenges. Our capabilities include cyber space operations, cyber defense and resiliency, vulnerability research, ubiquitous technical surveillance, data intelligence, lifecycle mission enablement, and software modernization. Nightwing brings disruptive technologies, agility, and competitive offerings to customers in the intelligence community, defense, civil, and commercial markets.
Job Title: Site Reliability Engineer
Location: Sterling, VA
Clearance: TS/SCI Poly
**This position is CONTINGENT upon contract award**
The Site Reliability Engineer (SRE) collaboratively works closely with the contract leadership, Platform teams, and Sponsor to refine the operational and technical strategy to automate key portions of IT operations and enable the Product team (Platform) to bring new software or new features to production as quickly as possible. The SRE executes and analyzes manual IT operations/admin tasks (log analysis, performance tuning, patch management, testing, and incident response) and converts them to automated tasks. The SRE works with the Platform, Network and Data Operations teams to assist in deployment planning and onboard systems. They assist with monitoring, system analysis, and IT operations support. Daily tasks include, but are not limited to:
  • Work with Sponsor, Mission partners, and technical personnel to deliver robust scalable operations architecture that meets the customer goals for the enterprise.
  • Analyze, define, and document requirements for data, workflow, logical processes, hardware and operating system environment, and network connectivity, other system interfaces, internal and external checks and controls, and outputs.
  • Monitor and track metrics, logs and traces across all services in the system/network and provide context for identifying root causes in the event of an incident, performance degradation, or availability issue.
  • Perform Network/Cloud optimization and resilience planning
  • Develop capabilities to automate hardware/software provisioning, monitoring, patching, and troubleshooting.
  • Collaborate with and assist Platform team and leadership in network and security health, intrusions or inappropriate activities.
  • Optimize business processes, workflows, and service operations by building efficient on-call processes and streamlining alerting workflows.
  • Leverage operational data to automate systems administration, operations and incident response processes to improve enterprise reliability to manage IT environment complexity.
  • Works with LSA, Lab Manager, and CM to compose technical documents including Design, Deployment, System specifications and Host Nation baselines, updates, user's manuals, training materials, installation guides, proposals, and reports.
  • Work with the OM to implement ITSM best practices for ICA/Service discrepancy and reporting, issue resolution and operations support to include Tier 2/3 escalation.

Required Skills:
  • Programming: Proficiency in at least one programming language (e.g., Python, Go, Java, or JavaScript) is essential for automating tasks and developing tools.
  • Linux/Unix Systems Administration: Strong knowledge of Linux/Unix operating systems, including command-line tools and system administration tasks.
  • Networking: Understanding of network protocols, infrastructure, and troubleshooting techniques.
  • Database Management: Familiarity with database technologies and principles.
  • Automation: Experience with automation tools and techniques, such as configuration management (e.g., Ansible, Puppet, Chef) and orchestration (e.g., Kubernetes).
  • Monitoring and Logging: Experience with monitoring tools and logging systems.
  • Problem-Solving: Strong analytical and problem-solving skills to diagnose and resolve system issues.
  • Communication: Ability to communicate technical information clearly and concisely to both technical and non-technical audiences.
  • Collaboration: Ability to work effectively with cross-functional teams, including software developers and operations personnel.

Desired Skills:
  • Cloud Technologies: Experience with cloud platforms (e.g., AWS, Google Cloud, Azure).
  • Containerization: Knowledge of containerization technologies (e.g., Docker, Kubernetes).
  • DevOps Principles: Understanding DevOps principles and practices.
  • Service Level Objectives (SLOs) and Service Level Agreements (SLAs): Experience with defining, tracking, and managing SLOs and SLAs.
  • Data Analysis: Experience with data analysis and visualization tools.

Desired Certs:
  • Global Skill Development Council (GSDC) Site Reliability Engineering (SRE) Foundation Certification (CSREF).
  • AWS Certified SysOps Administrator - Associate.
  • Google Cloud Certified Professional Cloud Architect.
  • Azure Certified Solutions Architect Expert.

At Nightwing, we value collaboration and teamwork. You'll have the opportunity to work alongside talented individuals who are passionate about what they do. Together, we'll leverage our collective expertise to drive innovation, solve complex problems, and deliver exceptional results for our clients.
Thank you for considering joining us as we embark on this new journey and shape the future of cybersecurity and intelligence together as part of the Nightwing team.
Nightwing is An Equal Opportunity/Affirmative Action Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability or veteran status, age or any other federally protected class.