1

Site Reliability Engineer Iii Jobs (NOW HIRING)

Principal III, SRE

Torrance, CA · On-site

$59.75 - $79.50/hr

Overview THE ROLE The SRE Principal Engineer III will work a hybrid schedule, with a requirement to be onsite at our Torrance, CA facility at least two days per week or more if needed, while also ...

Principal III, SRE

Torrance, CA · Hybrid

$59.75 - $79.50/hr

THE ROLE The SRE Principal Engineer III will work a hybrid schedule, with a requirement to be onsite at our Torrance, CA facility at least two days per week or more if needed, while also having the ...

Principal III, SRE

Torrance, CA · Hybrid

$59.75 - $79.50/hr

Overview THE ROLE The SRE Principal Engineer III will work a hybrid schedule, with a requirement to be onsite at our Torrance, CA facility at least two days per week or more if needed, while also ...

Site Reliability Engineer III

Plano, TX · On-site

$53.25 - $70.75/hr

Formal training or certification on software engineering concepts and 3+ years applied experience * Proficient in site reliability engineering principles and their application within cloud ...

Site Reliability Engineer III

Plano, TX · On-site

$53.25 - $70.75/hr

Formal training or certification on software engineering concepts and 3+ years applied experience * Proficient in site reliability engineering principles and their application within cloud ...

Site Reliability Engineer III

Plano, TX · On-site

$53.25 - $70.75/hr

Formal training or certification on software engineering concepts and 3+ years applied experience * Proficient in site reliability engineering principles and their application within cloud ...

Site Reliability Engineer III

Jersey City, NJ · On-site

$62.50 - $83/hr

They are seeking a Site Reliability Engineer III who will design and maintain CI/CD pipelines, manage AWS infrastructure, and enhance system reliability and performance. Responsibilities : • Design ...

Job#: 3030115 Site Reliability Engineer III Location: Plano, Texas (Hybrid) Employment Type: Contract Duration: 12 months Role Overview We are seeking a senior Site Reliability Engineer to provide ...

Site Reliability Engineer III- Eng

Alpharetta, GA · On-site

$55.75 - $74/hr

Site Reliability Engineer III Site Reliability Engineers (SREs) at UKG are experienced individual contributors who apply software engineering principles to operational challenges across the full ...

Site Reliability Engineer III

Jersey City, NJ · On-site

$62.25 - $82.75/hr

Required qualifications, capabilities, and skills * 8+ years of software engineering experience with 3+ years of applied Site Reliability Engineering experience in Data Warehousing, Oracle/Snowflake ...

Site Reliability Engineer III

Bedford, NH · On-site

$56.25 - $74.75/hr

SilverTech is seeking a Site Reliability Engineer III (SRE) to help lead and evolve our managed hosting practice. This role works in our Bedford, NH office and is responsible for the performance ...

Site Reliability Engineer III

Bedford, NH · Hybrid

$56.25 - $74.75/hr

SilverTech is seeking a Site Reliability Engineer III (SRE) to help lead and evolve our managed hosting practice. This role works in our Bedford, NH office and is responsible for the performance ...

next page

Showing results 1-20

Site Reliability Engineer Iii information

See salary details

$10

$63

$91

How much do site reliability engineer iii jobs pay per hour?

As of Jun 9, 2026, the average hourly pay for site reliability engineer iii in the United States is $63.74, according to ZipRecruiter salary data. Most workers in this role earn between $54.81 and $72.84 per hour, depending on experience, location, and employer.

What are the key skills and qualifications needed to thrive as a Site Reliability Engineer III, and why are they important?

To thrive as a Site Reliability Engineer III, you need deep expertise in systems administration, cloud infrastructure, scripting languages, and a strong grasp of networking, typically supported by a degree in computer science or related experience. Familiarity with tools like Kubernetes, Docker, Terraform, monitoring platforms (e.g., Prometheus, Datadog), and cloud providers (AWS, GCP, Azure), as well as relevant certifications, is highly valuable. Strong problem-solving abilities, collaboration, and effective communication set standout engineers apart in this role. These skills and qualities are crucial for ensuring high system reliability, rapid incident response, and effective cross-team collaboration in complex, high-availability environments.

What is a Site Reliability Engineer III?

A Site Reliability Engineer III is an experienced technical professional responsible for ensuring the reliability, scalability, and performance of large-scale software systems. They bridge the gap between software development and IT operations by automating processes, monitoring system health, and responding to incidents. As a senior-level SRE, they often mentor junior engineers, lead complex projects, and contribute to the development of best practices for system reliability. Their work helps organizations maintain high uptime, rapid deployments, and efficient incident response.

How do Site Reliability Engineer III roles typically collaborate with development and operations teams to improve system reliability?

As a Site Reliability Engineer III, you often serve as a bridge between development and operations teams, facilitating communication and collaboration to enhance system reliability and performance. You may lead incident response efforts, coordinate post-incident reviews, and work closely with developers to design scalable, fault-tolerant systems. Additionally, you’ll likely participate in on-call rotations, automate operational tasks, and contribute to building robust monitoring and alerting systems. This cross-functional collaboration is crucial for proactively identifying reliability risks and implementing solutions that benefit both engineering and operations.
More about Site Reliability Engineer Iii jobs
What job categories do people searching Site Reliability Engineer Iii jobs look for? The top searched job categories for Site Reliability Engineer Iii jobs are:
Infographic showing various Site Reliability Engineer Iii job openings in the United States as of June 2026, with employment types broken down into 85% Full Time, 14% Part Time, and 1% Contract. Highlights an 87% Physical, 5% Hybrid, and 8% Remote job distribution, with an average salary of $132,583 per year, or $63.7 per hour.

Principal III, SRE

Herbalife

Torrance, CA • On-site

$59.75 - $79.50/hr

Full-time

Medical, Dental, Vision, Life, Retirement, PTO

Posted 10 days ago


Job description

Overview
THE ROLE
The SRE Principal Engineer III will work a hybrid schedule, with a requirement to be onsite at our Torrance, CA facility at least two days per week or more if needed, while also having the flexibility to work remotely. This role is responsible for leading, designing, and implementing robust Site Reliability Engineering (SRE) practices to ensure high availability, scalability, and resilience of critical business systems and applications. The SRE Principal Engineer III will focus on improving system reliability through automation, monitoring, and performance tuning, working closely with development and operations teams to champion a culture of continuous improvement and operational excellence.
The SRE team consists of:
• SRE Engineers
• Deployment Automation
• Incident Response and Postmortem Analysis
• Observability and Monitoring
This role will drive the adoption of best practices in multi-cloud and hybrid-cloud platforms, managing services from major cloud providers like Microsoft Azure, Amazon AWS, Oracle OCI, Google GCP, and Alibaba Cloud. The SRE Principal Engineer III will focus on automation, incident management, performance monitoring, and optimizing infrastructure to support scalable, reliable systems. The position will also be responsible for fostering collaboration between development, operations, and security teams to streamline system operations across the organization.
HOW YOU WOULD CONTRIBUTE:
• Lead the implementation and optimization of SRE practices, ensuring system reliability, performance, and scalability.
• Architect and maintain automation for infrastructure provisioning, deployment, and incident response.
• Establish and implement SLOs (Service Level Objectives) and SLIs (Service Level Indicators) for key services.
• Collaborate with development teams to design and deliver reliable software systems, ensuring that production environments are optimized for uptime and performance.
• Create and maintain monitoring, alerting, and observability solutions to provide real-time insights into system health and performance.
• Respond to production incidents, perform root cause analysis, and implement corrective measures to prevent recurrence.
• Continuously improve system performance, capacity planning, and reliability through infrastructure tuning and automation.
• Facilitate post-incident reviews, fostering a blameless culture that focuses on learning from incidents.
• Collaborate with security teams to ensure infrastructure meets compliance, security standards, and best practices.
• Champion a collaborative environment across development, operations, and security teams to enhance operational efficiency and knowledge sharing.
• Drive the adoption of automation tools and frameworks to minimize manual intervention and optimize systems.
Qualifications
Skills Required:
• Proven expertise in SRE practices, with a focus on automation, incident management, observability, and infrastructure scalability.
• Extensive knowledge of cloud platforms (Azure, AWS, GCP, Alibaba) and hybrid-cloud environments, with a focus on reliability and performance optimization.
• Experience with automation tools and scripting languages, such as Python, Go, Terraform, or Ansible, for leading infrastructure and incident response.
• Strong understanding of containerization (Docker, Kubernetes) and orchestration systems.
• Solid grasp of monitoring and observability tools (Prometheus, Grafana, Dynatrace, Splunk) to ensure real-time system health monitoring.
• Expertise in capacity planning, performance tuning, and failure management techniques.
• Strong background in incident management, root cause analysis, and postmortem processes to improve system resilience.
• Deep understanding of security and compliance requirements, and the ability to ensure production environments meet industry standards.
• Experience with Agile and DevOps methodologies to ensure fast, reliable delivery of services.
Experience Required:
• 10+ years of experience in IT, with a focus on SRE, DevOps, or infrastructure engineering roles.
• Extensive hands-on experience with cloud infrastructure management and automation tools such as Terraform, CloudFormation, or equivalent.
• Proficiency in scripting and automation languages like Python, Bash, Go, or Ruby for infrastructure automation.
• Proven experience in managing large-scale systems, ensuring reliability, high availability, and scalability.
• Expertise in container orchestration technologies, including Kubernetes, OpenShift, and Docker Swarm.
• Deep knowledge of monitoring and observability platforms (Prometheus, Grafana, ELK, Dynatrace), including experience building and maintaining alerting and dashboard systems.
• Strong understanding of version control systems and CI/CD practices to optimize code deployment as it relates to infrastructure.
• Demonstrated ability to optimize performance in multi-cloud and hybrid-cloud environments, ensuring uptime and performance at scale.
Education Required:
• Bachelor's degree in computer science, Information Technology, or related field, or equivalent experience.
Certificates / Training Preferred:
• Relevant cloud certifications such as AWS Certified Solutions Architect, Azure Solutions Architect Expert, or Google Cloud Professional Cloud Architect.
• SRE-related certifications like Certified Kubernetes Administrator (CKA) or Google Professional Cloud DevOps Engineer.
US Benefits Statement
Herbalife offers a variety of benefits to eligible employees in the U.S. (limited to the 50 States and the District of Columbia), which includes Group Health Programs, other Voluntary Benefit Programs, and Paid Time Off. Group Health Programs include Medical, Dental, Vision, Health Savings Account (HSA), Flexible Spending Accounts (FSA), Basic Life/AD&D; Short-Term and Long-Term Disability, and an Employee Assistance Program (EAP). Other Voluntary Benefit Programs include a 401(k) plan, Wellness Incentive Program, Employee Stock Purchase Plan (ESPP), Supplemental Life/Critical Illness/Hospitalization/Accident Insurance, and Pet Insurance. Paid time off includes Company-observed U.S. Holidays, Floating Holidays, Vacation, Sick Time, a Volunteer Program, Paid Maternity and Paternity Leave, Bereavement Leave, Personal Leave and time off for voting.