1

Software Reliability Engineer Jobs (NOW HIRING)

Reliability Engineer

Costa Mesa, CA

$110K - $138K/yr

Anduril's Reliability Engineering organization is seeking an experienced Reliability Engineer to ... You will support the Systems, Hardware, & Software Engineering teams across the product development ...

Site Reliability Engineer

Wilmington, DE · On-site

$55.25 - $73.50/hr

As part of the Site Reliability Engineering team within the Reference Data Engineering group, you'll help build a meaningful engineering discipline, combining software and systems to develop creative ...

Reliability Engineer

Irvine, CA · On-site

$110K - $138K/yr

Reliability Engineer Full Time 40 hours/Week Duration: 12 months and flexible to extend further ... Proficiency in relevant software/tools (reliability modelling software, statistical tools, MS Excel ...

SOFTWARE ENGINEER (FLIGHT RELIAIBLITY) The Flight Reliability software team creates mission critical applications that are used throughout SpaceX to accelerate launch vehicle production and flight as ...

Reliability Engineer

Costa Mesa, CA · On-site

$110K - $138K/yr

Anduril's Reliability Engineering organization is seeking an experienced Reliability Engineer to ... You will support the Systems, Hardware, & Software Engineering teams across the product development ...

Reliability Engineer

Wooster, OH

$89K - $112K/yr

Reliability Engineer General Description The Reliability Engineer supports equipment and process ... Equipment Used Personal computer with appropriate engineering and office software, PLC programming ...

next page

Showing results 1-20

Software Reliability Engineer information

See salary details

$39

$67

$88

How much do software reliability engineer jobs pay per hour?

As of Jun 11, 2026, the average hourly pay for software reliability engineer in the United States is $67.07, according to ZipRecruiter salary data. Most workers in this role earn between $59.13 and $74.52 per hour, depending on experience, location, and employer.

What are the key skills and qualifications needed to thrive as a Software Reliability Engineer, and why are they important?

To thrive as a Software Reliability Engineer, you need a strong background in software development, system architecture, and incident response, often supported by a degree in computer science or related field. Familiarity with monitoring tools (like Prometheus), cloud platforms (AWS, GCP), automation frameworks, and certifications such as AWS Certified DevOps Engineer are highly valuable. Excellent problem-solving, collaboration, and communication skills help you coordinate effectively during high-pressure situations and with cross-functional teams. These abilities are crucial for maintaining system uptime, quickly resolving outages, and ensuring the overall reliability of critical software services.

What are Software Reliability Engineers?

Software Reliability Engineers (SREs) are IT professionals who focus on ensuring that software systems are reliable, scalable, and maintain high availability. They work at the intersection of software development and IT operations, often automating processes, monitoring system performance, and responding to incidents. SREs use engineering principles to solve operational problems, aiming to reduce downtime and improve user experience. Their responsibilities can include building tools, managing infrastructure, and collaborating with development teams to implement best practices for reliability.

How does a Software Reliability Engineer typically interact with development and operations teams to improve system stability?

Software Reliability Engineers (SREs) work closely with both development and operations teams to ensure that systems are reliable, scalable, and maintainable. They often participate in design reviews, provide input on architectural decisions, and help define service-level objectives. SREs also collaborate with developers to automate deployment processes and create monitoring solutions, and they partner with operations staff to manage incident response and root cause analysis. This collaborative environment enables them to proactively identify potential issues and drive cross-functional improvements.

Will AI replace SRE jobs?

AI can automate certain tasks within a Software Reliability Engineer's role, such as monitoring, alerting, and incident response, but it is unlikely to fully replace SREs. SREs focus on designing systems, managing complex infrastructure, and applying human judgment, which are difficult for AI to replicate entirely. Instead, AI tools are expected to augment SREs' capabilities, allowing them to focus on more strategic and complex issues.

What is the difference between Software Reliability Engineer vs Software Test Engineer?

AspectSoftware Reliability EngineerSoftware Test Engineer
Primary FocusEnsuring software reliability, stability, and performance over timeDesigning and executing tests to identify bugs and verify functionality
Skills & CertificationsKnowledge of reliability engineering, scripting, monitoring toolsTesting methodologies, automation tools, scripting
Work EnvironmentCollaborates with development and operations teams, often in DevOpsWorks primarily in QA/testing teams, often in dedicated testing phases
Industry UsageCommon in software companies focusing on product stabilityWidely used in software development and QA departments

The main difference is that Software Reliability Engineers focus on maintaining long-term software stability and performance, while Software Test Engineers concentrate on identifying bugs through testing. Both roles require technical skills and often collaborate, but their core objectives differ: reliability versus defect detection.

What does a software reliability engineer do?

A software reliability engineer focuses on ensuring software systems are dependable and perform consistently by analyzing failure data, developing testing strategies, and implementing automation tools. They often work with monitoring tools, scripting, and quality assurance processes to identify and mitigate potential issues before deployment.

What engineers make $500,000?

Software Reliability Engineers with extensive experience, specialized skills in automation and testing, and working at large tech companies or in senior leadership roles can earn $500,000 or more annually. High compensation often includes base salary, bonuses, and stock options, especially in competitive markets or executive positions.

Who gets paid more, SRE or DevOps?

Software Reliability Engineers (SREs) and DevOps engineers often have similar salary ranges, but SREs tend to earn slightly higher due to their focus on system reliability, incident management, and advanced monitoring skills. Compensation varies based on experience, certifications, and industry, but both roles are well-compensated in the tech field.
More about Software Reliability Engineer jobs
What cities are hiring for Software Reliability Engineer jobs? Cities with the most Software Reliability Engineer job openings:
Who are the top companies hiring for Software Reliability Engineer jobs? The top employers for Software Reliability Engineer jobs are:
What job categories do people searching Software Reliability Engineer jobs look for? The top searched job categories for Software Reliability Engineer jobs are:
Reliability Engineer

$94K - $118K/yr

Full-time

Medical, Dental, Vision, Life, Retirement, PTO

Posted 26 days ago


Atlas Roofing Corporation rating

9.1

Company rating: 9.1 out of 10

Based on 6 frontline employees who took The Breakroom Quiz


Job description

Atlas Roofing Corporation, a privately owned company, established in 1982, is an international and multi-divisional manufacturer of roofing, sheathing, facer, and insulation product solutions.
From a single asphalt shingle manufacturing facility in 1982, Atlas has grown to 36 facilities in North America providing worldwide product distribution. Today, products from the company's four major divisions, Polyiso Roof & Wall Insulation, Shingle & Underlayment, Molded Products, and Web Technologies, are manufactured in state-of-the-art facilities and shipped from our network of manufacturing plants and distribution facilities in the United States, Canada, and Mexico. It's amazing how our customer-first philosophy has grown on our clients.
Atlas protects because WE Care! Live, Work, Play!
Atlas Roofing Shingles, Underlayments & Ventilation division is seeking a Reliability Engineer for the Franklin, OH & Clinton, IA plants/facilities. Based in Franklin, OH with travel up to 50% to Clinton, IA.
The Reliability Engineer is responsible for improving the reliability, availability, and maintainability of equipment and systems within the Franklin, OH and Clinton, IA plants via the ARMOR program. ARMOR (Atlas Reliability and Maintenance for Operational Rewards) includes the following: FMEAs/RCAs facilitation/implementation, the CMMS system for PMs and work orders, Predictive maintenance tools, and the overall maintenance workflow management system. This role involves analyzing performance data, identifying trends and failure modes, and implementing solutions to increase efficiency and reduce downtime. The Reliability Engineer works closely with maintenance, operations, and engineering teams to ensure asset reliability and performance.
Reliability Engineer Primary Responsibilities
  • Sustain and drive forward the reliability strategies and programs to improve asset performance.
  • Analyze equipment failures, teach, lead, and perform root cause analysis (RCA), and recommend corrective actions.
  • Teach, lead and perform failure mode and effects analysis (FMEA) and risk assessments.
  • Assist and lead Plants in establishing preventive and predictive maintenance plans.
  • Track key performance indicators (KPIs) such as PM completion rate and compliance, planned maintenance percentage, MTBF (Mean Time Between Failures) and MTTR (Mean Time to Repair).
  • Collaborate with maintenance and operations to optimize maintenance best practices.
  • Support the development of asset life cycle management and spare parts strategies.
  • Use data analysis tools and reliability software to model performance and predict failures.
  • Ensure compliance with safety, environmental, and quality standards.
  • Develop, implement and improve training programs for Maintenance, Reliability, and Operations in the areas of asset care and reliability best practices.
  • Coach and lead the Plants in the development of Operations TPM processes and procedures.
  • Support development and proper use of (CMMS) software.
  • Train personnel on reliability engineering tools and best practices.

Reliability Engineer Qualifications & Experience
  • Knowledge of reliability tools such as FMEA, RCA, RCM, Weibull Analysis.
  • Experience with CMMS (Computerized Maintenance Management Systems) such as eMaint, SAP or Maximo.
  • Strong analytical and problem-solving skills.
  • Excellent communication and teamwork abilities.
  • Experience with change management an dnew process deployment
  • Certification in reliability (CRE, CMRP) is a plus.

Reliability Engineer Knowledge, Skills and Abilities
  • Experience working in a continuous line production facility.
  • Knowledge of Six Sigma or Lean methodologies.
  • Familiarity with vibration analysis, thermography, or other condition monitoring tools.
  • Experience with power drive equipment such as, (belts, chains, rollers, fans, motors, pumps, sprockets, bearings, lubrication) with the ability to assist the plants in reliability related issues, processes, and procedures.

Reliability Engineer Education, Licenses & Certifications
  • Bachelor's degree in mechanical, or related field.
  • 3+ years of experience in a reliability, maintenance, or engineering role within a manufacturing facility.

Total Compensation
  • Atlas Roofing Corporation offers a competitive compensation and vacation/holiday package as well as a comprehensive benefits program including Medical, Dental, Vision, Life/AD&D/LTD insurance, 401k and Medical & Dependent Care Spending Accounts.

Atlas Roofing Corporation is an Equal Employment Opportunity Employer.
No calls or agencies please.
SUW3