1

Amazon Sre Jobs (NOW HIRING)

Site Reliability Engineer II

Exton, PA · On-site

$55 - $73/hr

Cloud platforms Azure and Amazon Web Services (AWS), including infrastructure provisioning ... Site Reliability Engineering and DevOps automation including designing, implementing and ...

Site Reliability Engineer (SRE)

Englewood, CO

$56.25 - $74.75/hr

The SRE team is responsible for maintaining the existing systems, supporting our development teams, and implementing innovative solutions. You will work alongside software developers, testers, and ...

About the Role As a Site Reliability Engineer (SRE) at Mercor, you'll own production reliability across our most critical systems, partnering directly with infrastructure leadership. You'll play a ...

Site Reliability Engineer (SRE)

Reston, VA

$59.25 - $78.75/hr

The SRE team is responsible for maintaining the existing systems, supporting our development teams, and implementing innovative solutions. You will work alongside software developers, testers, and ...

SRE

Austin, TX · On-site

$56.50 - $75/hr

ABOUT THIS FEATURED OPPORTUNITY This organization is seeking a Site Reliability Engineer (SRE) to support Clarity, a business planning tool, and Halo, an internal IT inventory platform used to track ...

The SRE team is responsible for maintaining the existing systems, supporting our development teams, and implementing innovative solutions. You will work alongside software developers, testers, and ...

Site Reliability Engineer II

Exton, PA · On-site

$55 - $73/hr

Cloud platforms Azure and Amazon Web Services (AWS), including infrastructure provisioning ... Site Reliability Engineering and DevOps automation including designing, implementing and ...

SRE

Charlotte, NC · On-site

$55.75 - $74/hr

Role: SRE Location: Charlotte, NC Skills: Grafana, Python, Splunk, Linux, Scripting. Microsoft 360 or Power BI Job Summary The Senior Support Lead in Site Reliability engineering (SRE) will be ...

Site Reliability Engineer (SRE)

Austin, TX · On-site

$56.50 - $75/hr

Site Reliability Engineer (SRE) Location: Austin, TX Job Type: Full Time Job Summary - Seasoned Site Reliability Engineer (SRE) with 7+ years of experience in supporting complex, large-scale ...

next page

Showing results 1-20

Amazon Sre information

See salary details

$10

$63

$91

How much do amazon sre jobs pay per hour?

As of Jun 12, 2026, the average hourly pay for amazon sre in the United States is $63.74, according to ZipRecruiter salary data. Most workers in this role earn between $54.81 and $72.84 per hour, depending on experience, location, and employer.

What are the key skills and qualifications needed to thrive in the Amazon Sre position, and why are they important?

To thrive as an Amazon SRE (Site Reliability Engineer), you need a strong background in computer science, systems engineering, and automation, typically demonstrated through a relevant degree or equivalent experience. Familiarity with cloud services (especially AWS), containerization tools (like Docker or Kubernetes), monitoring solutions, and infrastructure-as-code platforms, along with certifications (such as AWS Certified DevOps Engineer), is highly valued. Problem-solving mindset, effective communication, and the ability to collaborate across multidisciplinary teams are essential soft skills. These competencies help maintain high system reliability, enable efficient incident response, and support the rapid innovation environment at Amazon.

What are the typical daily responsibilities of an Amazon SRE, and how does the role interact with other teams?

As an Amazon SRE, your daily responsibilities include monitoring system performance, proactively identifying and resolving reliability issues, automating repetitive tasks, and participating in on-call rotations to address production incidents. You will also work on infrastructure improvements, conduct root cause analyses after outages, and help implement robust deployment pipelines. Collaboration is frequent, as SREs partner closely with software engineers, product managers, and operations staff to ensure applications run smoothly and reliably. This teamwork fosters a culture of continuous improvement, making Amazon's large-scale services resilient and highly available.

What is an Amazon SRE job?

An Amazon Site Reliability Engineer (SRE) is responsible for ensuring the reliability, scalability, and performance of Amazon's infrastructure and services. They bridge the gap between development and operations, applying software engineering principles to system administration tasks. SREs focus on automation, monitoring, and incident response to minimize downtime and improve efficiency. Their goal is to create highly available and resilient systems while balancing innovation and system stability.

More about Amazon Sre jobs
What cities are hiring for Amazon Sre jobs? Cities with the most Amazon Sre job openings:
What states have the most Amazon Sre jobs? States with the most job openings for Amazon Sre jobs include:
What job categories do people searching Amazon Sre jobs look for? The top searched job categories for Amazon Sre jobs are:

Principal Site Reliability Engineer

Fidelity Investments

Durham, NC • On-site

$55 - $73.25/hr

Full-time

Posted 3 days ago


Fidelity Investments rating

8.7

Company rating: 8.7 out of 10

Based on 264 frontline employees who took The Breakroom Quiz

14th of 138 rated financial services


Job description

Job Description:
Position Description:
Combines Operational excellence with Development experience to deliver services at high scale, high availability with resilience. Builds reliability into the ecosystem by applying best practices in Resiliency Engineering, Automation, Observability and Chaos Testing. Streamlines and accelerates software delivery cycle by using DevOps practices and toolchain. Integrates Site Reliability Engineering (SRE) practices (Observability and Chaos) with DevOps processes and delivery pipelines to stop bad code from reaching production. Ensures business-critical enterprise systems are continuously available to internal and external customers. Implements technical standardization and process refinements within the engineering organization and for Site Reliability Engineers. Collaborates with production support teams to define and implement processes for the identification, collection, and analysis of incident data. Brings together technical, procedural, and financial data to reduce toil and increase efficiency.
Primary Responsibilities:
  • Develops Chaos Testing capabilities using multiple Chaos Tools (AWS Fault Injection Service (FIS), Chaos Mesh, and Chaosd) and Chaos Toolkit.
  • Develops and enhances organization's internal Chaos Framework to streamline Chaos Executions and reporting.
  • Provides specialized technical expertise in the adoption of Chaos Engineering by application teams.
  • Chaos tests and observes business-critical applications to understand the weaknesses and increase application resiliency.
  • Activates Observability for the critical applications with recommended Service Level Indicators and Service Level Objectives for Latency, Availability, Error Rate etc.
  • Utilizes modern monitoring tools (Datadog, Splunk, Catchpoint etc.) to reduce mean time to detect an issue and improve the response times.
  • Creates CI/CD pipelines with security and quality checks with Application Lifecycle management toolchain. Helps in integrating Chaos and Observability with CI/CD pipelines.
  • Automates repetitive activities using scripting languages (Python, Groovy etc.).
  • Implements and supports solutions based on cloud platforms AWS/Azure and container orchestration Kubernetes.
  • Onboards /Evaluates New Cloud services that help to enhance the Resiliency of cloud ecosystem. Serves as a liaison for vendor engagement.
  • Participates in incident management, problem management and incident postmortems.
  • Takes part in peer code reviews providing qualitative feedback.
  • Builds processes and capabilities to adapt and respond to risks, and disruptions, while maintaining business operations and data recovery with minimal disruptions.
  • Coaches peer SREs and application teams on SRE and DevOps.
  • Implements Agile methodologies in the team's project completion using incremental and iterative steps.

Education and Experience:
Bachelor's degree in Computer Science, Engineering, Information Technology, Information Systems, or a closely related field (or foreign education equivalent) and five (5) years of experience as a Principal Site Reliability Engineer (or closely related occupation) implementing resilient container and cloud-based applications and infrastructure solutions, using DevOps or SRE practices, in a financial services environment.
Or, alternatively, Master's degree (or foreign education equivalent) in Computer Science, Engineering, Information Technology, Information Systems, or a closely related field (or foreign education equivalent) and three (3) years of experience as a Principal Site Reliability Engineer (or closely related occupation) implementing resilient container and cloud-based applications and infrastructure solutions, using DevOps or SRE practices, in a financial services environment.
Skills and Knowledge:
Candidate must also possess:
  • Demonstrated Expertise ("DE") improving application resiliency by implementing chaos engineering to build system's capability to withstand turbulent conditions in production, using Chaos Mesh, Chaosd, Azure Chaos Studio, AWS FIS, or Gremlin; and driving automation to implement scalable approaches for the planning, design, execution, and reporting of chaos testing using Jenkins pipelines, standard frameworks, data visualization, and dashboards.
  • DE implementing advanced observability practices and techniques in production and pre-production environments, at scale using Datadog, Splunk, or Catchpoint; tracking the error budget, proactively identifying issues, minimizing Mean Time to Repair (MTTR); and balancing customer expectations by implementing Service-Level Indicators (SLIs) and Service-Level Objectives (SLOs) using logs, traces, monitors and synthetic tests.
  • DE migrating and maintaining cloud applications and creating cloud solutions using Amazon Web Services (AWS) or Azure cloud services; Implementing infrastructure as code for cloud; Onboarding new AWS or Azure services with required reviews and security controls in non-production and production environments; and researching evolving cloud ecosystem to adopt machine learning based tools (AWS DevOps guru) to boost AIOps abilities.
  • DE implementing CI/CD pipelines in both production and non-production environments using Application Lifecycle Management (ALM) tools (JIRA, GitHub, Jenkins, SonarQube, Artifactory, or uDeploy) to enable faster code delivery, enhanced software quality, reliability, and security; and developing products, and core and common capabilities for the organization to reduce toil and drive standardization, using containerization and orchestration technologies (Docker or Kubernetes), Infrastructure as Code (IaC) tools, scripting languages (Python or Groovy), and engineering best practices.

#PE1M2
#LI-DNI
Certifications:
Category:
Information Technology
Please be advised that Fidelity's business is governed by the provisions of the Securities Exchange Act of 1934, the Investment Advisers Act of 1940, the Investment Company Act of 1940, ERISA, numerous state laws governing securities, investment and retirement-related financial activities and the rules and regulations of numerous self-regulatory organizations, including FINRA, among others. Those laws and regulations may restrict Fidelity from hiring and/or associating with individuals with certain Criminal Histories.

What Fidelity Investments employees say

Pay

Benefits

Hours and flexibility

Workplace

Get the full story on Breakroom