Team Overview The Workplace Engineering organization is responsible for the reliability, resilience, and operational integrity of the firm's endpoint compute platforms and services, including:
Team Overview The Workplace Engineering organization is responsible for the reliability, resilience, and operational integrity of the firm's endpoint compute platforms and services, including:
Lifecycle Engineering (LCE) is responsible for ensuring our products are Safe, Reliable ... Reliability, System Safety and Supportability. The Senior Principal Reliability Engineer will ...
Lifecycle Engineering (LCE) is responsible for ensuring our products are Safe, Reliable ... Reliability, System Safety and Supportability. The Senior Principal Reliability Engineer will ...
Reliability Engineer III - Project Based
$92K - $116K/yr
Collaborate with engineering teams to incorporate Design for Reliability (DFR) principles into system development * Establish and monitor reliability metrics to evaluate system performance and ...
Reliability Engineer III - Project Based
$92K - $116K/yr
Collaborate with engineering teams to incorporate Design for Reliability (DFR) principles into system development * Establish and monitor reliability metrics to evaluate system performance and ...
Reliability Engineer III - Project Based
Fort Worth, TX · On-site
$112K - $131K/yr
Collaborate with engineering teams to incorporate Design for Reliability (DFR) principles into system development * Establish and monitor reliability metrics to evaluate system performance and ...
Reliability Engineer III - Project Based
Fort Worth, TX · On-site
$112K - $131K/yr
Collaborate with engineering teams to incorporate Design for Reliability (DFR) principles into system development * Establish and monitor reliability metrics to evaluate system performance and ...
Lifecycle Engineering (LCE) is responsible for ensuring our products are Safe, Reliable ... Reliability, System Safety and Supportability. The Senior Principal Reliability Engineer will ...
Lifecycle Engineering (LCE) is responsible for ensuring our products are Safe, Reliable ... Reliability, System Safety and Supportability. The Senior Principal Reliability Engineer will ...
Lifecycle Engineering (LCE) is responsible for ensuring our products are Safe, Reliable ... Reliability, System Safety and Supportability. The Senior Principal Reliability Engineer will ...
Lifecycle Engineering (LCE) is responsible for ensuring our products are Safe, Reliable ... Reliability, System Safety and Supportability. The Senior Principal Reliability Engineer will ...
Lifecycle Engineering (LCE) is responsible for ensuring our products are Safe, Reliable ... Reliability, System Safety and Supportability. The Senior Principal Reliability Engineer will ...
Lifecycle Engineering (LCE) is responsible for ensuring our products are Safe, Reliable ... Reliability, System Safety and Supportability. The Senior Principal Reliability Engineer will ...
Reliability Engineer Job
$97K - $123K/yr
Arkema Inc. is actively recruiting for a Reliability Engineering II who directly reports to Reliability Engineering and Project Manager and will interface with the appropriate Arkema and contractor ...
Reliability Engineer Job
$97K - $123K/yr
Arkema Inc. is actively recruiting for a Reliability Engineering II who directly reports to Reliability Engineering and Project Manager and will interface with the appropriate Arkema and contractor ...
... Reliability Engineering, AiDP The Applied Machine Learning team in AI and Data Platform org has been at the forefront of accelerating digital transformation through machine learning across Apple ...
... Reliability Engineering, AiDP The Applied Machine Learning team in AI and Data Platform org has been at the forefront of accelerating digital transformation through machine learning across Apple ...
Reliability Engineer III - Project Based
Fort Worth, TX · On-site
$112K - $131K/yr
Collaborate with engineering teams to incorporate Design for Reliability (DFR) principles into system development * Establish and monitor reliability metrics to evaluate system performance and ...
Quick apply
Reliability Engineer III - Project Based
Fort Worth, TX · On-site
$112K - $131K/yr
Collaborate with engineering teams to incorporate Design for Reliability (DFR) principles into system development * Establish and monitor reliability metrics to evaluate system performance and ...
Reliability Engineer Job
$97K - $123K/yr
Arkema Inc. is actively recruiting for a Reliability Engineering II who directly reports to Reliability Engineering and Project Manager and will interface with the appropriate Arkema and contractor ...
Reliability Engineer Job
$97K - $123K/yr
Arkema Inc. is actively recruiting for a Reliability Engineering II who directly reports to Reliability Engineering and Project Manager and will interface with the appropriate Arkema and contractor ...
Site Reliability Engineer - Vice President Site Reliability Engineering (SRE) is an engineering ... You will combine deep software and systems engineering expertise to architect, build, and run large ...
Site Reliability Engineer - Vice President Site Reliability Engineering (SRE) is an engineering ... You will combine deep software and systems engineering expertise to architect, build, and run large ...
As a SRE Engineer you will be filling a mission-critical role ensuring that our systems are healthy, monitored, automated, and designed to scale. You will collaborate with engineering teams to ...
As a SRE Engineer you will be filling a mission-critical role ensuring that our systems are healthy, monitored, automated, and designed to scale. You will collaborate with engineering teams to ...
Engineering - SRE Platforms - Site Reliability Engineer - Vice President - Dallas
Dallas, TX · On-site
$56.50 - $75/hr
Site Reliability Engineer - Vice President Site Reliability Engineering (SRE) is an engineering ... You will combine deep software and systems engineering expertise to architect, build, and run large ...
Engineering - SRE Platforms - Site Reliability Engineer - Vice President - Dallas
Dallas, TX · On-site
$56.50 - $75/hr
Site Reliability Engineer - Vice President Site Reliability Engineering (SRE) is an engineering ... You will combine deep software and systems engineering expertise to architect, build, and run large ...
Site Reliability Engineer - Vice President Site Reliability Engineering (SRE) is an engineering ... You will combine deep software and systems engineering expertise to architect, build, and run large ...
Site Reliability Engineer - Vice President Site Reliability Engineering (SRE) is an engineering ... You will combine deep software and systems engineering expertise to architect, build, and run large ...
This Reliability Engineering position is seeking a candidate with 3 or more years of working experience that is able to complete Reliability work assignments independently using technical reliability ...
This Reliability Engineering position is seeking a candidate with 3 or more years of working experience that is able to complete Reliability work assignments independently using technical reliability ...
This Reliability Engineer/Specialist will be primarily responsible for executing Vistra's reliability improvement projects, applying sound engineering judgment and ensuring alignment with established ...
This Reliability Engineer/Specialist will be primarily responsible for executing Vistra's reliability improvement projects, applying sound engineering judgment and ensuring alignment with established ...
Reliability Engineer Job
Houston, TX · On-site
$97K - $123K/yr
Arkema Inc. is actively recruiting for a Reliability Engineering II who directly reports to Reliability Engineering and Project Manager and will interface with the appropriate Arkema and contractor ...
Reliability Engineer Job
Houston, TX · On-site
$97K - $123K/yr
Arkema Inc. is actively recruiting for a Reliability Engineering II who directly reports to Reliability Engineering and Project Manager and will interface with the appropriate Arkema and contractor ...
Compliance Engineering, Site Reliability Engineering, Vice President, Dallas
Dallas, TX · On-site
$56.50 - $75/hr
As a SRE Engineer you will be filling a mission-critical role ensuring that our systems are healthy, monitored, automated, and designed to scale. You will collaborate with engineering teams to ...
Compliance Engineering, Site Reliability Engineering, Vice President, Dallas
Dallas, TX · On-site
$56.50 - $75/hr
As a SRE Engineer you will be filling a mission-critical role ensuring that our systems are healthy, monitored, automated, and designed to scale. You will collaborate with engineering teams to ...
Service Reliability Engineer, G&A Solutions Engineering
Austin, TX · On-site
$56.50 - $75/hr
Join Apple's General and Administrative (G&A) Solutions Engineering team as a Service Reliability Engineer and play a vital role in supporting our global, mission-critical production systems.
Service Reliability Engineer, G&A Solutions Engineering
Austin, TX · On-site
$56.50 - $75/hr
Join Apple's General and Administrative (G&A) Solutions Engineering team as a Service Reliability Engineer and play a vital role in supporting our global, mission-critical production systems.
Reliability Engineering information
See Texas salary details
$56.8K - $63.6K
0% of jobs
$63.6K - $70.4K
2% of jobs
$70.4K - $77.2K
3% of jobs
$77.2K - $83.9K
8% of jobs
$83.9K - $90.7K
7% of jobs
$97.5K is the 25th percentile. Wages below this are outliers.
$90.7K - $97.5K
5% of jobs
$97.5K - $104.3K
4% of jobs
$104.3K - $111K
3% of jobs
$111K - $117.8K
2% of jobs
The median wage is $119.4K / yr.
$117.8K - $124.6K
63% of jobs
$124.6K - $131.4K
2% of jobs
$56.8K
$109.9K
$131.4K
How much do reliability engineering jobs pay per year?
What engineers make $500,000?
What are the key skills and qualifications needed to thrive as a Reliability Engineer, and why are they important?
What is reliability engineering?
What is the difference between Reliability Engineering vs Maintenance Engineering?
| Aspect | Reliability Engineering | Maintenance Engineering |
|---|---|---|
| Focus | Designing systems for reliability and minimizing failures | Maintaining and repairing existing equipment to ensure operational uptime |
| Certifications | Reliability Engineering certifications, Six Sigma, engineering degrees | Maintenance certifications, HVAC, electrical, or mechanical licenses |
| Work Environment | Design offices, project planning, analysis labs | On-site equipment, repair shops, plant floors |
| Industry Usage | Manufacturing, aerospace, energy, oil & gas | Manufacturing, facilities management, utilities |
Reliability Engineering focuses on designing and improving systems to prevent failures, while Maintenance Engineering emphasizes repairing and maintaining equipment to ensure continuous operation. Both roles are essential in industries like manufacturing and energy, but they differ in their approach—one is proactive, the other reactive.
How does a Reliability Engineer typically collaborate with cross-functional teams to improve system reliability?
What does a reliability engineer do?
What engineers make $300,000 a year?
What is a SRE engineer's salary?
$56.50 - $75/hr
Other
Posted 12 hours ago
Goldman Sachs rating
8.3
Based on 25 frontline employees who took The Breakroom Quiz
29th of 141 rated banks
Job description
Team Overview
The Workplace Engineering organization is responsible for the reliability, resilience, and operational integrity of the firm's endpoint compute platforms and services, including:
- Corporateowned physical devices
- Virtual and cloudhosted desktops
- Core endpoint services such as device lifecycle management, access and identity integration, profile and session services, and application delivery frameworks
The Endpoint Compute SRE function applies Site Reliability Engineering (SRE) principles to ensure these platforms and services are highly available, observable, scalable, and recoverable, while meeting operational and regulatory expectations.
Role Summary
We are seeking an Endpoint Compute SRE Lead to own reliability engineering and operational excellence across endpoint compute platforms and their foundational services.
This role is focused on systems and services, not applications, and covers the reliability of:
- Endpoint compute platforms (physical, virtual, cloud desktops)
- Device and desktop lifecycle services
- Access and signin dependency platforms
- Profile, policy, and session services
- Application delivery and execution frameworks (packaging, deployment, availability-not app functionality)
The successful candidate will define service-level objectives, observability strategies, failure models, and operational practices that ensure a predictable and resilient enduser compute experience at enterprise scale.
Job Responsibilities
Reliability Engineering Across Endpoint Services
- Own end-to-end reliability of endpoint compute platforms and supporting services
- Define service boundaries, dependencies, and critical paths from user signin through productive desktop use
- Model failure modes and blast radius across lifecycle, access, and delivery services
- Drive designs that support graceful degradation and fast recovery
Observability & Telemetry
- Establish observability standards across endpoint compute services, including:
- Enrollment and provisioning success rates
- Access and session establishment health
- Policy and profile delivery latency/failures
- Application delivery availability
- Ensure telemetry enables:
- Fast incident detection
- Root cause analysis
- Proactive trend identification
SLOs, SLIs & Error Budgets
- Define SLOs and SLIs for key endpoint services (e.g., signin success, provisioning time, policy convergence)
- Implement error budget frameworks to guide change, security control rollout, and platform evolution
- Use reliability signals to influence platform design and operational priorities
Incident, Problem & Resilience Management
- Lead reliability aspects of incident response involving endpoint compute or services
- Drive postincident reviews focused on systemic corrections
- Identify recurring failure patterns in:
- Lifecycle flows
- Access paths
- Policy or profile delivery
- Sponsor and track permanent fixes, not workarounds
Operational Excellence & Automation
- Define and maintain runbooks, playbooks, and escalation models for endpoint services
- Drive automation to reduce:
- Manual remediation
- Repeat incidents
- Operational toil
- Influence engineering designs to improve operability and debuggability
Risk & Governance Alignment
- Partner with Technology Risk and Security teams to:
- Demonstrate reliability and recoverability controls
- Support operational risk and resilience assessments
- Provide auditready evidence for availability and incident management
- Ensure reliability metrics support control effectiveness narratives
Leadership & Collaboration
- Act as the reliability authority for endpoint compute and services
- Partner closely with:
- Endpoint platform engineers
- Device management teams
- Security engineering and identity teams
- Mentor engineers in applying SRE principles to workplace platforms
- Communicate reliability posture clearly to leadership
Basic Qualifications
- 8+ years in SRE, platform operations, reliability engineering, or workplace infrastructure roles
- Strong experience operating endpoint compute platforms and core supporting services at enterprise scale
- Proven ability to define and implement:
- Observability frameworks
- SLOs / SLIs
- Incident and problem management models
- Strong systems thinking across lifecycle, access, and service dependencies
- Excellent documentation and communication skills
Preferred Qualifications
- Experience applying SRE concepts to enduser computing or digital workplace platforms
- Deep understanding of:
- Device lifecycle and provisioning services
- Identity and access dependencies (availability-focused)
- Profile, policy, and session orchestration
- Experience in regulated or highassurance environments
- Strong ability to influence architecture using datadriven reliability insights
What Success Looks Like
- Endpoint compute and services have clear reliability targets
- Lifecycle, access, and delivery failures are predictable, observable, and fast to remediate
- Incidents are less frequent, shorter, and less impactful
- Platforms are designed with operability and resilience built in
- Leadership has confidence in desktop stability as a service
What Goldman Sachs employees say
Pay
Benefits
Hours and flexibility
Workplace
Get the full story on Breakroom
About Goldman Sachs
Sourced by ZipRecruiter
At Goldman Sachs, we commit our people, capital and ideas to help our clients, shareholders and the communities we serve to grow. Founded in 1869, we are a leading global investment banking, securities and investment management firm. Headquartered in New York, we maintain offices around the world. We believe who you are makes you better at what you do. We're committed to fostering and advancing diversity and inclusion in our own workplace and beyond by ensuring every individual within our firm has a number of opportunities to grow professionally and personally, from our training and development opportunities and firmwide networks to benefits, wellness and personal finance offerings and mindfulness programs.
Industry
Finance and insurance
Company size
10,000+ Employees
Headquarters location
New York, NY, US
Year founded
1869