1

Reliability Engineering Jobs in Texas (NOW HIRING)

Reliability Engineer Job

Houston, TX

$97K - $123K/yr

Arkema Inc. is actively recruiting for a Reliability Engineering II who directly reports to Reliability Engineering and Project Manager and will interface with the appropriate Arkema and contractor ...

Reliability Engineer Job

Houston, TX

$97K - $123K/yr

Arkema Inc. is actively recruiting for a Reliability Engineering II who directly reports to Reliability Engineering and Project Manager and will interface with the appropriate Arkema and contractor ...

Reliability Engineer Job

Houston, TX · On-site

$97K - $123K/yr

Arkema Inc. is actively recruiting for a Reliability Engineering II who directly reports to Reliability Engineering and Project Manager and will interface with the appropriate Arkema and contractor ...

next page

Showing results 1-20

Reliability Engineering information

See Texas salary details

$56.8K

$109.9K

$131.4K

How much do reliability engineering jobs pay per year?

As of Jun 14, 2026, the average yearly pay for reliability engineering in Texas is $109,910.00, according to ZipRecruiter salary data. Most workers in this role earn between $95,500.00 and $120,200.00 per year, depending on experience, location, and employer.

What engineers make $500,000?

Senior reliability engineers with extensive experience, advanced certifications, and expertise in systems optimization can earn $500,000 or more annually, especially in high-demand industries like aerospace, technology, or energy. Achieving this level often requires leadership roles, specialized skills, and working in competitive markets.

What are the key skills and qualifications needed to thrive as a Reliability Engineer, and why are they important?

To thrive as a Reliability Engineer, you need a solid background in engineering principles, data analysis, and problem-solving, often supported by a degree in engineering or a related field. Familiarity with reliability analysis tools (such as Weibull Analysis, FMEA), statistical software, and certifications like Certified Reliability Engineer (CRE) are typically required. Strong communication, attention to detail, and collaborative skills help Reliability Engineers effectively work with cross-functional teams and convey technical findings. These skills and qualifications are crucial for minimizing system failures, optimizing maintenance strategies, and ensuring long-term operational efficiency.

What is reliability engineering?

Reliability engineering is a field of engineering that focuses on ensuring systems, components, or products consistently perform their intended function without failure over a specified period. Reliability engineers analyze data, design tests, and implement processes to improve product durability and reduce the likelihood of failures. Their work often spans across product design, manufacturing, and maintenance to enhance safety, performance, and customer satisfaction.

What is the difference between Reliability Engineering vs Maintenance Engineering?

AspectReliability EngineeringMaintenance Engineering
FocusDesigning systems for reliability and minimizing failuresMaintaining and repairing existing equipment to ensure operational uptime
CertificationsReliability Engineering certifications, Six Sigma, engineering degreesMaintenance certifications, HVAC, electrical, or mechanical licenses
Work EnvironmentDesign offices, project planning, analysis labsOn-site equipment, repair shops, plant floors
Industry UsageManufacturing, aerospace, energy, oil & gasManufacturing, facilities management, utilities

Reliability Engineering focuses on designing and improving systems to prevent failures, while Maintenance Engineering emphasizes repairing and maintaining equipment to ensure continuous operation. Both roles are essential in industries like manufacturing and energy, but they differ in their approach—one is proactive, the other reactive.

How does a Reliability Engineer typically collaborate with cross-functional teams to improve system reliability?

Reliability Engineers work closely with design, maintenance, operations, and quality teams to identify potential failure points and implement preventive measures. They often facilitate root-cause analyses after incidents, share findings with relevant departments, and help develop long-term solutions to recurring problems. Collaboration involves regular meetings, data sharing, and coordinating reliability improvement projects to ensure that all teams are aligned in enhancing system performance and reducing downtime.

What does a reliability engineer do?

A reliability engineer is responsible for ensuring that systems, equipment, or products perform consistently and without failure over time. They analyze failure data, develop maintenance strategies, and implement improvements using tools like FMEA and root cause analysis to enhance reliability and reduce downtime.

What engineers make $300,000 a year?

Senior reliability engineers, especially those with extensive experience, specialized skills, and certifications, can earn $300,000 or more annually. High compensation is often associated with roles in industries like aerospace, oil and gas, or technology, where advanced knowledge of systems, data analysis, and tools like MATLAB or Python are valued. Salary levels depend on location, company size, and individual expertise.

What is a SRE engineer's salary?

The salary for a Site Reliability Engineer (SRE) typically ranges from $90,000 to $150,000 annually, depending on experience, location, and company size. SREs often have skills in cloud platforms, automation, and monitoring tools, which can influence compensation levels.
What cities in Texas are hiring for Reliability Engineering jobs? Cities in Texas with the most Reliability Engineering job openings:
Workplace Platforms - Site Reliability Engineer (SRE) Lead - Dallas

Workplace Platforms - Site Reliability Engineer (SRE) Lead - Dallas

Goldman Sachs, Inc.

Dallas, TX

$56.50 - $75/hr

Other

Posted 12 hours ago


Goldman Sachs rating

8.3

Company rating: 8.3 out of 10

Based on 25 frontline employees who took The Breakroom Quiz

29th of 141 rated banks


Job description

Team Overview

The Workplace Engineering organization is responsible for the reliability, resilience, and operational integrity of the firm's endpoint compute platforms and services, including:

  • Corporateowned physical devices
  • Virtual and cloudhosted desktops
  • Core endpoint services such as device lifecycle management, access and identity integration, profile and session services, and application delivery frameworks

The Endpoint Compute SRE function applies Site Reliability Engineering (SRE) principles to ensure these platforms and services are highly available, observable, scalable, and recoverable, while meeting operational and regulatory expectations.

Role Summary

We are seeking an Endpoint Compute SRE Lead to own reliability engineering and operational excellence across endpoint compute platforms and their foundational services.

This role is focused on systems and services, not applications, and covers the reliability of:

  • Endpoint compute platforms (physical, virtual, cloud desktops)
  • Device and desktop lifecycle services
  • Access and signin dependency platforms
  • Profile, policy, and session services
  • Application delivery and execution frameworks (packaging, deployment, availability-not app functionality)

The successful candidate will define service-level objectives, observability strategies, failure models, and operational practices that ensure a predictable and resilient enduser compute experience at enterprise scale.

Job Responsibilities 

Reliability Engineering Across Endpoint Services

  • Own end-to-end reliability of endpoint compute platforms and supporting services
  • Define service boundaries, dependencies, and critical paths from user signin through productive desktop use
  • Model failure modes and blast radius across lifecycle, access, and delivery services
  • Drive designs that support graceful degradation and fast recovery

Observability & Telemetry

  • Establish observability standards across endpoint compute services, including:
    • Enrollment and provisioning success rates
    • Access and session establishment health
    • Policy and profile delivery latency/failures
    • Application delivery availability
  • Ensure telemetry enables:
    • Fast incident detection
    • Root cause analysis
    • Proactive trend identification

SLOs, SLIs & Error Budgets

  • Define SLOs and SLIs for key endpoint services (e.g., signin success, provisioning time, policy convergence)
  • Implement error budget frameworks to guide change, security control rollout, and platform evolution
  • Use reliability signals to influence platform design and operational priorities

Incident, Problem & Resilience Management

  • Lead reliability aspects of incident response involving endpoint compute or services
  • Drive postincident reviews focused on systemic corrections
  • Identify recurring failure patterns in:
    • Lifecycle flows
    • Access paths
    • Policy or profile delivery
  • Sponsor and track permanent fixes, not workarounds

Operational Excellence & Automation

  • Define and maintain runbooks, playbooks, and escalation models for endpoint services
  • Drive automation to reduce:
    • Manual remediation
    • Repeat incidents
    • Operational toil
  • Influence engineering designs to improve operability and debuggability

Risk & Governance Alignment

  • Partner with Technology Risk and Security teams to:
    • Demonstrate reliability and recoverability controls
    • Support operational risk and resilience assessments
    • Provide auditready evidence for availability and incident management
  • Ensure reliability metrics support control effectiveness narratives

Leadership & Collaboration

  • Act as the reliability authority for endpoint compute and services
  • Partner closely with:
    • Endpoint platform engineers
    • Device management teams
    • Security engineering and identity teams
  • Mentor engineers in applying SRE principles to workplace platforms
  • Communicate reliability posture clearly to leadership

Basic Qualifications

  • 8+ years in SRE, platform operations, reliability engineering, or workplace infrastructure roles
  • Strong experience operating endpoint compute platforms and core supporting services at enterprise scale
  • Proven ability to define and implement:
    • Observability frameworks
    • SLOs / SLIs
    • Incident and problem management models
  • Strong systems thinking across lifecycle, access, and service dependencies
  • Excellent documentation and communication skills

Preferred Qualifications

  • Experience applying SRE concepts to enduser computing or digital workplace platforms
  • Deep understanding of:
    • Device lifecycle and provisioning services
    • Identity and access dependencies (availability-focused)
    • Profile, policy, and session orchestration
  • Experience in regulated or highassurance environments
  • Strong ability to influence architecture using datadriven reliability insights

What Success Looks Like

  • Endpoint compute and services have clear reliability targets
  • Lifecycle, access, and delivery failures are predictable, observable, and fast to remediate
  • Incidents are less frequent, shorter, and less impactful
  • Platforms are designed with operability and resilience built in
  • Leadership has confidence in desktop stability as a service

What Goldman Sachs employees say

Pay

Benefits

Hours and flexibility

Workplace

Get the full story on Breakroom


Goldman Sachs logo

About Goldman Sachs

Sourced by ZipRecruiter

At Goldman Sachs, we commit our people, capital and ideas to help our clients, shareholders and the communities we serve to grow. Founded in 1869, we are a leading global investment banking, securities and investment management firm. Headquartered in New York, we maintain offices around the world. We believe who you are makes you better at what you do. We're committed to fostering and advancing diversity and inclusion in our own workplace and beyond by ensuring every individual within our firm has a number of opportunities to grow professionally and personally, from our training and development opportunities and firmwide networks to benefits, wellness and personal finance offerings and mindfulness programs.

Industry

Finance and insurance

Company size

10,000+ Employees

Headquarters location

New York, NY, US

Year founded

1869