1

Reliability Engineering Jobs (NOW HIRING)

Manager, Site Reliability Engineering

Seattle, WA · Remote

$58.25 - $77.50/hr

You'll promote modern SRE practices, drive down incident response times, and shape a culture where automation replaces toil and every incident becomes a learning opportunity. This role combines ...

Site Reliability Engineering

Philadelphia, PA · On-site

$57.50 - $76.50/hr

Serve as subject matter expert in an SRE mindset, best practices, and cloud-native principles * Scale systems sustainably through automation to improve reliability and velocity * Assist with all ...

Manager, Site Reliability Engineering

Denver, CO · Remote

$58.25 - $77.50/hr

You'll promote modern SRE practices, drive down incident response times, and shape a culture where automation replaces toil and every incident becomes a learning opportunity. This role combines ...

We are seeking a hands-on Reliability Engineering leader to establish, own, and execute reliability engineering for an InP-based III-V photonic semiconductor manufacturing operation producing optical ...

The SRE Manager owns two things: keeping RapidSOS's cloud infrastructure running reliably, and helping product teams get to a place where they can run their own services without routing every ...

Site Reliability Engineering Manager

Boston, MA · On-site

$62 - $82.25/hr

The SRE Manager owns two things: keeping RapidSOS\'s cloud infrastructure running reliably, and helping product teams get to a place where they can run their own services without routing every ...

Manager, Site Reliability Engineering

Irving, TX · Remote

$58.25 - $77.50/hr

You'll promote modern SRE practices, drive down incident response times, and shape a culture where automation replaces toil and every incident becomes a learning opportunity. This role combines ...

Manager, Site Reliability Engineering

Middleton, WI · Remote

$58.25 - $77.50/hr

You'll promote modern SRE practices, drive down incident response times, and shape a culture where automation replaces toil and every incident becomes a learning opportunity. This role combines ...

Site Reliability Engineering

Phoenix, AZ · On-site

$56.50 - $75.25/hr

Serve as subject matter expert in an SRE mindset, best practices, and cloud-native principles * Scale systems sustainably through automation to improve reliability and velocity * Assist with all ...

$53.75 - $71.50/hr

Lead and grow the SRE team , setting direction, mentoring and managing engineers, and fostering excellence. * Design and manage cloud and containerized infrastructure with IaC (Terraform)

Site Reliability Engineering

San Francisco, CA

$67.25 - $89.25/hr

Serve as subject matter expert in an SRE mindset, best practices, and cloud-native principles * Scale systems sustainably through automation to improve reliability and velocity * Assist with all ...

We are seeking a handson Reliability Engineering leader to establish, own, and execute reliability engineering for an InPbased IIIV photonic semiconductor manufacturing operation producing optical ...

Site Reliability Engineering

Dallas, TX · On-site

$56.50 - $75/hr

Serve as subject matter expert in an SRE mindset, best practices, and cloud-native principles * Scale systems sustainably through automation to improve reliability and velocity * Assist with all ...

Site Reliability Engineering

New York, NY

$62.25 - $82.75/hr

Serve as subject matter expert in an SRE mindset, best practices, and cloud-native principles * Scale systems sustainably through automation to improve reliability and velocity * Assist with all ...

Site Reliability Engineering

Plano, TX · On-site

$54.50 - $72.50/hr

Serve as subject matter expert in an SRE mindset, best practices, and cloud-native principles * Scale systems sustainably through automation to improve reliability and velocity * Assist with all ...

Site Reliability Engineering

Sunnyvale, CA · On-site

$67 - $89/hr

Serve as subject matter expert in an SRE mindset, best practices, and cloud-native principles * Scale systems sustainably through automation to improve reliability and velocity * Assist with all ...

next page

Showing results 1-20

Reliability Engineering information

See salary details

$61K

$118K

$141K

How much do reliability engineering jobs pay per year?

As of Jun 14, 2026, the average yearly pay for reliability engineering in the United States is $117,973.00, according to ZipRecruiter salary data. Most workers in this role earn between $102,500.00 and $129,000.00 per year, depending on experience, location, and employer.

What engineers make $500,000?

Senior reliability engineers with extensive experience, advanced certifications, and expertise in systems optimization can earn $500,000 or more annually, especially in high-demand industries like aerospace, technology, or energy. Achieving this level often requires leadership roles, specialized skills, and working in competitive markets.

What are the key skills and qualifications needed to thrive as a Reliability Engineer, and why are they important?

To thrive as a Reliability Engineer, you need a solid background in engineering principles, data analysis, and problem-solving, often supported by a degree in engineering or a related field. Familiarity with reliability analysis tools (such as Weibull Analysis, FMEA), statistical software, and certifications like Certified Reliability Engineer (CRE) are typically required. Strong communication, attention to detail, and collaborative skills help Reliability Engineers effectively work with cross-functional teams and convey technical findings. These skills and qualifications are crucial for minimizing system failures, optimizing maintenance strategies, and ensuring long-term operational efficiency.

What is reliability engineering?

Reliability engineering is a field of engineering that focuses on ensuring systems, components, or products consistently perform their intended function without failure over a specified period. Reliability engineers analyze data, design tests, and implement processes to improve product durability and reduce the likelihood of failures. Their work often spans across product design, manufacturing, and maintenance to enhance safety, performance, and customer satisfaction.

What is the difference between Reliability Engineering vs Maintenance Engineering?

AspectReliability EngineeringMaintenance Engineering
FocusDesigning systems for reliability and minimizing failuresMaintaining and repairing existing equipment to ensure operational uptime
CertificationsReliability Engineering certifications, Six Sigma, engineering degreesMaintenance certifications, HVAC, electrical, or mechanical licenses
Work EnvironmentDesign offices, project planning, analysis labsOn-site equipment, repair shops, plant floors
Industry UsageManufacturing, aerospace, energy, oil & gasManufacturing, facilities management, utilities

Reliability Engineering focuses on designing and improving systems to prevent failures, while Maintenance Engineering emphasizes repairing and maintaining equipment to ensure continuous operation. Both roles are essential in industries like manufacturing and energy, but they differ in their approach—one is proactive, the other reactive.

How does a Reliability Engineer typically collaborate with cross-functional teams to improve system reliability?

Reliability Engineers work closely with design, maintenance, operations, and quality teams to identify potential failure points and implement preventive measures. They often facilitate root-cause analyses after incidents, share findings with relevant departments, and help develop long-term solutions to recurring problems. Collaboration involves regular meetings, data sharing, and coordinating reliability improvement projects to ensure that all teams are aligned in enhancing system performance and reducing downtime.

What does a reliability engineer do?

A reliability engineer is responsible for ensuring that systems, equipment, or products perform consistently and without failure over time. They analyze failure data, develop maintenance strategies, and implement improvements using tools like FMEA and root cause analysis to enhance reliability and reduce downtime.

What engineers make $300,000 a year?

Senior reliability engineers, especially those with extensive experience, specialized skills, and certifications, can earn $300,000 or more annually. High compensation is often associated with roles in industries like aerospace, oil and gas, or technology, where advanced knowledge of systems, data analysis, and tools like MATLAB or Python are valued. Salary levels depend on location, company size, and individual expertise.

What is a SRE engineer's salary?

The salary for a Site Reliability Engineer (SRE) typically ranges from $90,000 to $150,000 annually, depending on experience, location, and company size. SREs often have skills in cloud platforms, automation, and monitoring tools, which can influence compensation levels.
More about Reliability Engineering jobs
What cities are hiring for Reliability Engineering jobs? Cities with the most Reliability Engineering job openings:
What states have the most Reliability Engineering jobs? States with the most job openings for Reliability Engineering jobs include:

Manager, Site Reliability Engineering

Paradigm

Seattle, WA • Remote

$58.25 - $77.50/hr

Full-time

Posted 29 days ago


Job description

Paradigm is a software company transforming the way that the residential, construction & building product industries operate across the globe. We are looking for a Manager, Site Reliability Engineering to be part of revolutionizing these industries.

We're looking for a hands-on SRE leader to build and develop a high-performing team that oversees reliability across our Azure-based platform. You'll promote modern SRE practices, drive down incident response times, and shape a culture where automation replaces toil and every incident becomes a learning opportunity.

This role combines technical depth with people leadership. You'll design reliability frameworks, lead incident response, coach engineers, and partner with product teams to embed reliability into everything we build. Working closely with the Senior Director of SRE & Cloud Operations, you'll transform reactive operations into proactive, data-driven service management with increasing use of AI and automation to get there faster.

What You Will Do:

  • Lead and grow a team of site reliability engineers. Provide guidance, mentorship, and career development.

  • Contribute to and mature SRE practices across production services: SLOs, SLIs, error budgets, toil reduction, and blameless post-mortems that turn incidents into lasting improvements.

  • Oversee the incident management lifecycle end-to-end including detection, response, resolution, post-incident review, and systemic improvement.

  • Design on-call rotations, runbooks, and escalation procedures that balance service reliability with engineer well-being and sustainable work practices.

  • Drive measurable reductions in MTTR and MTTD through improved observability, intelligent automation, and predictive monitoring.

  • Build automation to eliminate manual operational work including provisioning, deployment, scaling, self-healing, and reporting.

  • Implement chaos engineering practices to validate system resilience and surface weaknesses before they cause outages.

  • Partner with engineering and product teams to embed reliability requirements into the development lifecycle, from design through deployment.

  • Collaborate with the observability team to ensure comprehensive instrumentation, smart alerting, and actionable dashboards across all critical services.

  • Measure, report, and advocate for reliability improvements with both technical and executive stakeholders using data to drive investment decisions.

What You Need to Succeed:

  • Bachelor’s degree in Engineering, or a related field or equivalent experience.

  • 7+ years in site reliability engineering, DevOps, or infrastructure engineering, with at least 1 year in people management (or demonstrated tech lead experience with direct influence over team processes and career growth).

  • Hands-on experience running production systems on Azure (including proficiency with key services such as AKS, App Services, Service Bus, Event Grid, and Azure Monitor) or comparable cloud platforms.

  • Proven track record implementing SRE practices with measurable reliability improvements and familiarity with modern observability platforms (Datadog, Prometheus/Grafana, or equivalent). AI-enhanced observability experience is preferred.

  • Experience leading incident response for high-severity production issues and running effective post-mortems.

  • Strong background in automation, infrastructure as code (Terraform, Bicep, or similar), and systematically eliminating manual operational work.

  • Experience with Kubernetes container orchestration with production-grade operational experience.

  • Ability to automate workflows and build scripts using Python, Bash, PowerShell, or Go.

  • Experience with AI coding assistants and CI/CD systems (GitHub Actions, Azure DevOps, ArgoCD) with automation capabilities is preferred.

  • Knowledge of distributed systems patterns is preferred.

  • Exposure to AIOps platforms or using LLMs for operational automation is preferred.

  • Strong communication with the ability to make complex technical issues clear for both engineers and executives.

  • Data-driven approach. You use metrics and telemetry to guide decisions, not gut feel.

  • You are collaborative cross-functionally and build trust and alignment naturally.

Ready to Join? Apply now at myparadigm.com/careers/
#Paradigm