1

Cloud Site Reliability Engineer Jobs (NOW HIRING)

Cloud/DevOps Engineer - III

Dallas, TX · On-site

$56.50 - $75/hr

JOB SUMMARY As a Senior Cloud Engineer in the Cloud SRE team, you will be responsible for designing and developing cloud solutions and engineering reliability tools for the Cloud Foundation Services ...

Cloud / SRE Engineer

Atlantic City, NJ · On-site

$57.25 - $76/hr

... implement SRE practices (SLIs/SLOs/SLAs), lead incident response playbooks, and architect cloud-native solutions to meet FAA Efficiency Critical requirements during the migration. Primary ...

The Site Reliability Engineer (SRE) is a critical part of our Mapfre USA On-Prem and Cloud platform strategy. In this role, you will be focused on ensuring MUSA's development platform and processes ...

SRE Manager / SRE Architect

Fort Mill, SC · On-site

$50 - $66.50/hr

This role requires a strong blend of SRE, DevOps, Release Management, Cloud Engineering, Automation, and Production Operations expertise. The ideal candidate will be deeply involved in designing and ...

SRE Manager / SRE Architect

Manhattan, NY · On-site

$62.50 - $83/hr

This role requires a strong blend of SRE, DevOps, Release Management, Cloud Engineering, Automation, and Production Operations expertise. The ideal candidate will be deeply involved in designing and ...

$53 - $70.50/hr

This role requires a strong blend of SRE, DevOps, Release Management, Cloud Engineering, Automation, and Production Operations expertise. The ideal candidate will be deeply involved in designing and ...

next page

Showing results 1-20

Cloud Site Reliability Engineer information

See salary details

$10

$63

$91

How much do cloud site reliability engineer jobs pay per hour?

As of Jun 15, 2026, the average hourly pay for cloud site reliability engineer in the United States is $63.74, according to ZipRecruiter salary data. Most workers in this role earn between $54.81 and $72.84 per hour, depending on experience, location, and employer.

What are the key skills and qualifications needed to thrive in the Cloud Site Reliability Engineer position, and why are they important?

To thrive as a Cloud Site Reliability Engineer, you need a solid background in cloud infrastructure, automation, monitoring, and reliability engineering, supported by a degree in computer science or related field. Proficiency with cloud platforms like AWS, Azure, or Google Cloud, along with expertise in scripting languages, container orchestration tools (such as Kubernetes), and relevant certifications (e.g., AWS Certified DevOps Engineer) is highly valued. Strong analytical thinking, clear communication, and effective problem-solving skills help SREs excel in dynamic, cross-functional teams. These capabilities ensure the design, operation, and continuous improvement of resilient and scalable cloud systems, minimizing downtime and enhancing user experience.

What is a Cloud Site Reliability Engineer job?

A Cloud Site Reliability Engineer (Cloud SRE) is responsible for ensuring the reliability, availability, and performance of cloud-based systems and services. They apply software engineering principles to infrastructure and operations tasks, automating processes like monitoring, incident response, and scaling. Cloud SREs work closely with development and operations teams to build resilient cloud environments, implement observability tools, and manage system reliability. Their goal is to minimize downtime, optimize performance, and enhance the overall efficiency of cloud services.

What are the typical day-to-day responsibilities of a Cloud Site Reliability Engineer?

Cloud Site Reliability Engineers (SREs) are responsible for maintaining, monitoring, and improving the reliability and performance of cloud-based services. This involves automating repetitive tasks, designing robust monitoring and alerting systems, managing incidents, and working proactively to prevent outages. SREs often collaborate closely with software development, infrastructure, and security teams to deploy new features and ensure scalability. Additionally, they participate in on-call rotations and root cause analyses to address and learn from incidents. These varied tasks help ensure the stability and efficiency of cloud environments while allowing SREs to develop a broad skill set and contribute to continuous service improvement.

More about Cloud Site Reliability Engineer jobs
Infographic showing various Cloud Site Reliability Engineer job openings in the United States as of June 2026, with employment types broken down into 98% Full Time, 1% Part Time, and 1% Contract. Highlights an 87% Physical, 5% Hybrid, and 8% Remote job distribution, with an average salary of $132,583 per year, or $63.7 per hour.
Asset & Wealth Management-Cloud SRE Engineer-Associate-Dallas

Asset & Wealth Management-Cloud SRE Engineer-Associate-Dallas

Goldman Sachs, Inc.

Dallas, TX • On-site

$56.50 - $75/hr

Full-time

Posted 26 days ago


Goldman Sachs rating

8.3

Company rating: 8.3 out of 10

Based on 25 frontline employees who took The Breakroom Quiz

29th of 141 rated banks


Job description

Job Description
Cloud SRE Engineer - Associate
Who We Look For:
Goldman Sachs Engineers are innovators and problem-solvers who thrive in fast-paced global environments. We are seeking a motivated Cloud Site Reliability Engineer (SRE) to support the WM Data Engineering ecosystem. In this role, you will apply software engineering principles to operational challenges, ensuring that our cloud-native services - primarily running on AWS are resilient, scalable, and cost-optimized. As we transition from on-premises legacy systems to AWS, you will be the guardian of system health, moving beyond traditional dashboards to implement predictive remediation and SLOs-as-Code.
Key Responsibilities:
  1. Reliability & Performance Engineering:
    • SLO Management: Define and enforce Service Level Objectives (SLOs) and Service Level Indicators (SLIs) using OpenSLO or similar declarative frameworks. Manage "Error Budgets" to balance the pace of innovation with system stability.
    • Predictive Observability: Implement AI-driven observability stacks (e.g., Datadog, Amazon CloudWatch Container Insights, or OpenTelemetry) to detect "p99" latency spikes and subtle configuration drifts before they impact users.
    • Incident Response: Lead high-severity incident restoration and conduct blameless post-mortems to identify root causes and automate future prevention.
  2. Cloud Migration & Orchestration:
    • Microservices Migration: Support the migration of on-premises microservices to Amazon ECS (Fargate/EC2). Design and maintain task definitions, service discovery via AWS Cloud Map, and inter-service communication using Amazon ECS Service Connect.
    • Infrastructure as Code (IaC): Develop and maintain modular, version-controlled infrastructure using Terraform or AWS CDK, ensuring that reliability guardrails are baked into every deployment.
    • Automation of Toil: Identify and eliminate repetitive manual tasks ("toil") by developing custom automation tools in Python or Go.
  3. Modernization:
    • Migration Support: Contribute to the migration of on-premises data workloads to AWS.

Qualifications:
Technical Requirements
  • Experience: 4+ years in SRE, DevOps, or Cloud Engineering roles, with a strong focus on production operations for distributed systems.
  • Container Orchestration: Deep proficiency in Amazon ECS (Fargate and EC2 launch types). Experience with Docker containerization and managing service-to-service connectivity.
  • Programming: Strong proficiency in Python or Java for automation and tool development. Expert-level SQL for data-driven reliability analysis.
  • Cloud Platforms: Advanced knowledge of AWS core services (VPC, IAM, S3, Lambda) and networking (Transit Gateway, PrivateLink).
  • Observability Tools: Hands-on experience with modern monitoring and tracing tools such as Prometheus, Grafana, AWS X-Ray, or Splunk.
  • CI/CD for Containers: Proven ability to build automated deployment pipelines for ECS using AWS CodePipeline, GitHub Actions, or Terraform, incorporating blue/green or canary deployment strategies.
  • Soft Skills: Strong problem-solving "builder" mindset and the ability to communicate technical concepts within a team environment.

Education
  • Bachelor's or Master's degree in computer science, Engineering, Mathematics, or a related field.

ABOUT GOLDMAN SACHS
At Goldman Sachs, we commit our people, capital and ideas to help our clients, shareholders and the communities we serve to grow. Founded in 1869, we are a leading global investment banking, securities and investment management firm. Headquartered in New York, we maintain offices around the world.
We believe who you are makes you better at what you do. We're committed to fostering and advancing diversity and inclusion in our own workplace and beyond by ensuring every individual within our firm has a number of opportunities to grow professionally and personally, from our training and development opportunities and firmwide networks to benefits, wellness and personal finance offerings and mindfulness programs. Learn more about our culture, benefits, and people at GS.com/careers.
We're committed to finding reasonable accommodations for candidates with special needs or disabilities during our recruiting process. Learn more: https://www.goldmansachs.com/careers/footer/disability-statement.html
© The Goldman Sachs Group, Inc., 2023. All rights reserved.
Goldman Sachs is an equal opportunity employer and does not discriminate on the basis of race, color, religion, sex, national origin, age, veterans status, disability, or any other characteristic protected by applicable law.

What Goldman Sachs employees say

Pay

Benefits

Hours and flexibility

Workplace

Get the full story on Breakroom


Goldman Sachs logo

About Goldman Sachs

Sourced by ZipRecruiter

At Goldman Sachs, we commit our people, capital and ideas to help our clients, shareholders and the communities we serve to grow. Founded in 1869, we are a leading global investment banking, securities and investment management firm. Headquartered in New York, we maintain offices around the world. We believe who you are makes you better at what you do. We're committed to fostering and advancing diversity and inclusion in our own workplace and beyond by ensuring every individual within our firm has a number of opportunities to grow professionally and personally, from our training and development opportunities and firmwide networks to benefits, wellness and personal finance offerings and mindfulness programs.

Industry

Finance and insurance

Company size

10,000+ Employees

Headquarters location

New York, NY, US

Year founded

1869