1

Cloud Site Reliability Engineer Jobs (NOW HIRING)

$53 - $70.50/hr

This role requires a strong blend of SRE, DevOps, Release Management, Cloud Engineering, Automation, and Production Operations expertise. The ideal candidate will be deeply involved in designing and ...

Site Reliability Engineer (SRE)

Austin, TX · On-site

$56.50 - $75/hr

Austin, TX Job Type: Full Time Technical Skills: * 6+ years of professional engineering experience developing, managing, or supporting distributed systems * 4+ SRE experience managing multi-cloud ...

Site Reliability Engineer (SRE)

Parsippany, NJ · On-site

$57.25 - $76.25/hr

We are looking for a talented Site Reliability Engineer (SRE) with a strong background in Google ... Google Cloud Expertise: Design, implement, and manage cloud infrastructure using Google Cloud ...

Site Reliability Engineer

Irondale, AL · On-site

$48.25 - $64/hr

Site Reliability Engineer Site Reliability Engineer (SRE) Hybrid Opportunity | Enterprise Cloud Environment Growing enterprise technology organization is seeking an experienced Site Reliability ...

SRE Manager / SRE Architect

Manhattan, NY · On-site

$62.50 - $83/hr

This role requires a strong blend of SRE, DevOps, Release Management, Cloud Engineering, Automation, and Production Operations expertise. The ideal candidate will be deeply involved in designing and ...

Site Reliability Engineer

Northville, MI · On-site

$53.75 - $71.25/hr

Site Reliability Engineer (SRE) About Liveline Liveline enables dramatic improvements in ... This spans from the factory-floor edge systems to AWS cloud components. You will help build and run ...

Site Reliability Engineer

Frederick, MD · On-site

$56.75 - $75.25/hr

Transportation Reimbursement Account (TRN) The Site Reliability Engineer role centers on modernizing and consolidating a complex multi-cloud environment across AWS, Azure, and GCP, building a ...

SITE RELIABILITY ENGINEER

Camden, NJ · On-site

$130K - $150K/yr

Our current focus is on modular, event-driven, API-first and cloud architectures. We continue to enhance reliability and accelerate engineering productivity by strengthening our SRE and AI practices.

Site Reliability Engineer

Northville, MI · On-site

$53.75 - $71.25/hr

Site Reliability Engineer (SRE) About Liveline Liveline enables dramatic improvements in ... This spans from the factory-floor edge systems to AWS cloud components. You will help build and run ...

Site Reliability Engineer

Frederick, MD · Hybrid

$56.75 - $75.25/hr

Transportation Reimbursement Account (TRN) The Site Reliability Engineer role centers on modernizing and consolidating a complex multi-cloud environment across AWS, Azure, and GCP, building a ...

Site Reliability Engineer

Frederick, MD · Hybrid

$56.75 - $75.25/hr

Transportation Reimbursement Account (TRN) The Site Reliability Engineer role centers on modernizing and consolidating a complex multi-cloud environment across AWS, Azure, and GCP, building a ...

next page

Showing results 1-20

Cloud Site Reliability Engineer information

See salary details

$10

$63

$91

How much do cloud site reliability engineer jobs pay per hour?

As of Jun 15, 2026, the average hourly pay for cloud site reliability engineer in the United States is $63.74, according to ZipRecruiter salary data. Most workers in this role earn between $54.81 and $72.84 per hour, depending on experience, location, and employer.

What are the key skills and qualifications needed to thrive in the Cloud Site Reliability Engineer position, and why are they important?

To thrive as a Cloud Site Reliability Engineer, you need a solid background in cloud infrastructure, automation, monitoring, and reliability engineering, supported by a degree in computer science or related field. Proficiency with cloud platforms like AWS, Azure, or Google Cloud, along with expertise in scripting languages, container orchestration tools (such as Kubernetes), and relevant certifications (e.g., AWS Certified DevOps Engineer) is highly valued. Strong analytical thinking, clear communication, and effective problem-solving skills help SREs excel in dynamic, cross-functional teams. These capabilities ensure the design, operation, and continuous improvement of resilient and scalable cloud systems, minimizing downtime and enhancing user experience.

What is a Cloud Site Reliability Engineer job?

A Cloud Site Reliability Engineer (Cloud SRE) is responsible for ensuring the reliability, availability, and performance of cloud-based systems and services. They apply software engineering principles to infrastructure and operations tasks, automating processes like monitoring, incident response, and scaling. Cloud SREs work closely with development and operations teams to build resilient cloud environments, implement observability tools, and manage system reliability. Their goal is to minimize downtime, optimize performance, and enhance the overall efficiency of cloud services.

What are the typical day-to-day responsibilities of a Cloud Site Reliability Engineer?

Cloud Site Reliability Engineers (SREs) are responsible for maintaining, monitoring, and improving the reliability and performance of cloud-based services. This involves automating repetitive tasks, designing robust monitoring and alerting systems, managing incidents, and working proactively to prevent outages. SREs often collaborate closely with software development, infrastructure, and security teams to deploy new features and ensure scalability. Additionally, they participate in on-call rotations and root cause analyses to address and learn from incidents. These varied tasks help ensure the stability and efficiency of cloud environments while allowing SREs to develop a broad skill set and contribute to continuous service improvement.

More about Cloud Site Reliability Engineer jobs
Infographic showing various Cloud Site Reliability Engineer job openings in the United States as of June 2026, with employment types broken down into 98% Full Time, 1% Part Time, and 1% Contract. Highlights an 87% Physical, 5% Hybrid, and 8% Remote job distribution, with an average salary of $132,583 per year, or $63.7 per hour.

SRE Manager / SRE Architect

Qode

On-site

$53 - $70.50/hr

Full-time

Posted 13 days ago


Job description

Job Description - SRE Manager / SRE Architect (Hands-on)

Location: New York City, NY / Fort Mill, SC (Hybrid)

Employment Type: Full-Time / Contract

Industry: Financial Services

Position Overview

We are seeking a highly experienced and hands-on Site Reliability Engineering (SRE) Manager / SRE Architect to lead reliability, availability, performance, and release management initiatives across enterprise-scale applications and platforms. This role requires a strong blend of SRE, DevOps, Release Management, Cloud Engineering, Automation, and Production Operations expertise.

The ideal candidate will be deeply involved in designing and implementing reliability strategies, driving release governance, improving deployment processes, and ensuring operational excellence across cloud-native environments.

LaunchDarkly experience is highly preferred but not mandatory.

Key Responsibilities

Site Reliability Engineering (SRE)

  • Design and implement SRE best practices focused on reliability, scalability, performance, and availability.
  • Define and monitor SLIs, SLOs, and error budgets across critical applications and services.
  • Drive proactive monitoring, alerting, observability, and incident management processes.
  • Lead root cause analysis (RCA) efforts and implement preventive measures.
  • Improve system resiliency through automation, self-healing capabilities, and operational excellence.
  • Establish reliability standards across distributed systems and cloud platforms.

Release Management

  • Own and drive end-to-end release management processes across multiple environments.
  • Coordinate application releases across development, QA, UAT, staging, and production environments.
  • Develop release governance, release calendars, deployment strategies, rollback procedures, and change management processes.
  • Partner with development, QA, infrastructure, and business teams to ensure smooth production deployments.
  • Identify and mitigate release risks while minimizing downtime and business impact.
  • Implement deployment automation and continuous delivery best practices.

DevOps & Automation

  • Design and maintain CI/CD pipelines using modern DevOps tools.
  • Automate infrastructure provisioning, deployment, monitoring, and operational workflows.
  • Drive Infrastructure as Code (IaC) adoption using Terraform or similar technologies.
  • Support cloud-native architectures and containerized application deployments.
  • Partner with engineering teams to improve developer productivity and deployment velocity.

Cloud & Platform Engineering

  • Manage and optimize cloud infrastructure on AWS and/or Azure.
  • Support Kubernetes, container orchestration, and cloud-native application platforms.
  • Ensure platform scalability, security, compliance, and operational readiness.
  • Drive platform modernization initiatives and operational transformation efforts.

Required Skills & Experience

Core SRE Skills

  • 15+ years of IT experience with strong focus on SRE, DevOps, Platform Engineering, or Production Support.
  • Extensive hands-on experience implementing SRE practices in enterprise environments.
  • Strong understanding of:
  • SLI/SLO/Error Budgets
  • Incident Management
  • Problem Management
  • Capacity Planning
  • Reliability Engineering
  • Observability & Monitoring

Release Management

  • Proven experience managing large-scale production releases.
  • Strong expertise in:
  • Release Planning
  • Release Governance
  • Change Management
  • Deployment Automation
  • Rollback Strategies
  • Production Readiness Reviews

DevOps & Cloud

  • Hands-on experience with:
  • AWS and/or Azure
  • Kubernetes (EKS, AKS, OpenShift preferred)
  • Docker
  • Terraform
  • GitHub Actions, Jenkins, Azure DevOps, GitLab CI/CD
  • Experience building and maintaining CI/CD pipelines.

Monitoring & Observability

  • Strong experience with:
  • Dynatrace
  • Datadog
  • Splunk
  • Prometheus
  • Grafana
  • ELK Stack
  • CloudWatch

Scripting & Automation

  • Experience with Python, Bash, PowerShell, or similar scripting languages.
  • Strong automation mindset with focus on operational efficiency.

Nice to Have

  • LaunchDarkly end-to-end implementation experience
  • Feature flag management and progressive delivery strategies.
  • Financial Services, Banking, or Wealth Management domain experience.
  • Experience leading SRE or DevOps transformation initiatives.
  • Cloud certifications (AWS, Azure, Kubernetes).

Preferred Candidate Profile

  • Strong hands-on SRE leader, not just a people manager.
  • Deep expertise in Release Management and Production Support.
  • Proven background in DevOps, Cloud Engineering, and Platform Reliability.
  • Ability to work with development, infrastructure, security, and business teams.

Keywords

SRE, Site Reliability Engineering, Release Management, DevOps, Terraform, AWS, Azure, Kubernetes, Dynatrace, CI/CD, LaunchDarkly, Production Support, Incident Management, Reliability Engineering, Observability, Platform Engineering, Infrastructure Automation.