1

Site Reliability Engineer Jobs in Raleigh, NC (NOW HIRING)

Site Reliability Engineer

Morrisville, NC

$53.25 - $70.75/hr

Site Reliability Engineer The Company: Varonis(Nasdaq: VRNS) secures AI and the data that powers it. The Varonis platform gives organizations automated visibility and control over their critical data ...

Site Reliability Engineer

Morrisville, NC · On-site

$53.25 - $70.75/hr

Description Site Reliability Engineer The Company: Varonis (Nasdaq: VRNS) secures AI and the data that powers it. The Varonis platform gives organizations automated visibility and control over their ...

New

Site Reliability Engineer

Raleigh, NC · On-site +1

$55.50 - $73.75/hr

The Site Reliability Engineer Role Join our dynamic team at Qlik as a Site Reliability Engineer, where you'll play a crucial role in ensuring the security, stability, and scalability of our Qlik and ...

Site Reliability Engineer

Raleigh, NC

$55.50 - $73.75/hr

The Site Reliability Engineer Role Join our dynamic team at Qlik as a Site Reliability Engineer, where you\'ll play a crucial role in ensuring the security, stability, and scalability of our Qlik ...

As a Site Reliability Engineer (SRE) at Litera, you will play a key role in ensuring our SaaS solutions remain stable, scalable, and resilient. Your primary focus will be on enhancing operational ...

Site Reliability Engineer

Raleigh, NC · On-site

$120K - $150K/yr

As a Site Reliability Engineer (SRE) at Litera, you will play a key role in ensuring our SaaS solutions remain stable, scalable, and resilient. Your primary focus will be on enhancing operational ...

Site Reliability Engineer

Raleigh, NC · On-site

$120K - $150K/yr

As a Site Reliability Engineer (SRE) at Litera, you will play a key role in ensuring our SaaS solutions remain stable, scalable, and resilient. Your primary focus will be on enhancing operational ...

Site Reliability Engineer

Raleigh, NC · On-site

$120K - $150K/yr

As a Site Reliability Engineer (SRE) at Litera, you will play a key role in ensuring our SaaS solutions remain stable, scalable, and resilient. Your primary focus will be on enhancing operational ...

SRE Engineer

Raleigh, NC · On-site

$55.50 - $73.75/hr

Monday- Friday, 9am-5pm The Site Reliability Engineer will be an active contributor responsible for configuring Dynatrace as the main business monitoring platform and SolarWinds as the primary ...

SRE Engineer - PxE Talent

Raleigh, NC

$55.50 - $73.75/hr

As a SRE Engineer you will actively engage in your engineering craft, taking a hands-on approach to multiple high-visibility projects. Your expertise will be pivotal in delivering solutions that ...

Site Reliability Engineer II

Raleigh, NC · On-site

$55.50 - $73.75/hr

Site Reliability Engineer II The SRE II sits at the intersection of software engineering and platform operations. You will own the reliability, scalability, and operational hygiene of Kastle's core ...

next page

Showing results 1-20

Site Reliability Engineer information

See Raleigh, NC salary details

$10

$61

$89

How much do site reliability engineer jobs pay per hour?

As of Jun 12, 2026, the average hourly pay for site reliability engineer in Raleigh, NC is $61.96, according to ZipRecruiter salary data. Most workers in this role earn between $53.27 and $70.82 per hour, depending on experience, location, and employer.

Will SRE be replaced by AI?

Site Reliability Engineers (SREs) focus on maintaining system reliability, automation, and incident response, and AI tools are increasingly used to assist these tasks. While AI can automate routine processes, SREs' expertise in system design, troubleshooting, and decision-making remains essential, making complete replacement unlikely in the near future.

What Is a Site Reliability Engineer?

A site reliability engineer specializes in site reliability engineering, or SRE, a specific branch of operations first pioneered by Google. You are responsible for ensuring that when a website decides to scale a particular feature for various users to access, it does not break the underlying software or website functions. This means you need to use analytical problem-solving skills to determine how to make specific features on a new software release work on top of existing source code.

What engineers make $300,000 a year?

Senior-level engineers such as Site Reliability Engineers, Software Engineers, and Cloud Infrastructure Engineers can earn $300,000 or more annually, especially with extensive experience, specialized skills, and working at large tech companies or in high-cost-of-living areas. Compensation often includes base salary, bonuses, and stock options, with expertise in automation, cloud platforms, and monitoring tools being highly valued.

What are the key skills and qualifications needed to thrive as a Site Reliability Engineer, and why are they important?

To thrive as a Site Reliability Engineer, you need a strong background in computer science, systems administration, and software engineering, often supported by a degree in a technical field. Familiarity with cloud platforms (like AWS or GCP), container orchestration (such as Kubernetes), infrastructure as code (Terraform or Ansible), and monitoring tools (Prometheus, Grafana) is typically expected. Strong problem-solving skills, effective communication, and a proactive mindset help SREs excel at incident management and cross-functional collaboration. These skills are crucial for maintaining system reliability, minimizing downtime, and driving continuous improvement in complex technical environments.

Is SRE a stressful job?

Site Reliability Engineers (SREs) often work in high-pressure environments where they monitor system performance, troubleshoot outages, and ensure uptime. The role can involve on-call duties and incident response, which may contribute to stress, but it also offers opportunities for automation and process improvements to reduce workload. Overall, stress levels vary depending on the organization, team culture, and individual skills.

What are some of the most common challenges Site Reliability Engineers face when balancing system reliability with rapid software delivery?

Site Reliability Engineers (SREs) often navigate the challenge of maintaining highly reliable systems while supporting fast-paced software releases. This involves managing incidents, automating processes to reduce manual toil, and working closely with development teams to embed reliability into the software development lifecycle. SREs must carefully prioritize their efforts between proactive improvements and urgent, reactive fire-fighting. Effective communication and collaboration with both operations and development teams are crucial to ensuring service uptime without slowing down innovation.

What does a Site Reliability Engineer do?

A Site Reliability Engineer (SRE) is responsible for maintaining and improving the reliability, availability, and performance of software systems. They use automation, monitoring tools, and scripting to prevent outages and resolve issues quickly, often working closely with development teams to ensure scalable infrastructure. SREs typically have skills in systems engineering, coding, and cloud platforms, and may hold certifications like those in cloud services or DevOps practices.

What is the difference between Site Reliability Engineer vs DevOps Engineer?

AspectSite Reliability EngineerDevOps Engineer
CredentialsTypically requires a computer science degree, certifications like AWS, Google Cloud, or KubernetesSimilar credentials, often with cloud certifications and scripting skills
Work EnvironmentFocuses on maintaining and improving system reliability, often in large-scale production environmentsWorks on automation, CI/CD pipelines, and deployment processes across development and operations teams
Industry UsageCommon in tech, cloud services, and large-scale enterprise companiesWidely used in software development, cloud, and IT organizations

Both roles require strong technical skills and cloud knowledge, but SREs focus more on system reliability and uptime, while DevOps engineers emphasize automation and deployment processes. They often collaborate but have distinct primary responsibilities.

What is a Site Reliability Engineer?

A Site Reliability Engineer (SRE) is a professional who applies software engineering principles to infrastructure and operations problems. Their primary goal is to create scalable and highly reliable software systems, often bridging the gap between development and IT operations. SREs automate tasks, monitor system health, respond to incidents, and work to improve system reliability and performance. They also help define service level objectives (SLOs) and ensure systems meet customer expectations for uptime and availability.
What are the most commonly searched types of Site Reliability Engineer jobs in Raleigh, NC? The most popular types of Site Reliability Engineer jobs in Raleigh, NC are:
What job categories do people searching Site Reliability Engineer jobs in Raleigh, NC look for? The top searched job categories for Site Reliability Engineer jobs in Raleigh, NC are:
What cities near Raleigh, NC are hiring for Site Reliability Engineer jobs? Cities near Raleigh, NC with the most Site Reliability Engineer job openings:
Infographic showing various Site Reliability Engineer job openings in Raleigh, NC as of June 2026, with employment types broken down into 76% Full Time, and 24% Contract. Highlights an 82% In-person, 12% Hybrid, and 6% Remote job distribution, with an average salary of $128,882 per year, or $62 per hour.

Principal Site Reliability Engineer

Fidelity Investments

Durham, NC • On-site

$55 - $73.25/hr

Full-time

Posted 2 days ago


Fidelity Investments rating

8.7

Company rating: 8.7 out of 10

Based on 264 frontline employees who took The Breakroom Quiz

14th of 138 rated financial services


Job description

Job Description:
Position Description:
Combines Operational excellence with Development experience to deliver services at high scale, high availability with resilience. Builds reliability into the ecosystem by applying best practices in Resiliency Engineering, Automation, Observability and Chaos Testing. Streamlines and accelerates software delivery cycle by using DevOps practices and toolchain. Integrates Site Reliability Engineering (SRE) practices (Observability and Chaos) with DevOps processes and delivery pipelines to stop bad code from reaching production. Ensures business-critical enterprise systems are continuously available to internal and external customers. Implements technical standardization and process refinements within the engineering organization and for Site Reliability Engineers. Collaborates with production support teams to define and implement processes for the identification, collection, and analysis of incident data. Brings together technical, procedural, and financial data to reduce toil and increase efficiency.
Primary Responsibilities:
  • Develops Chaos Testing capabilities using multiple Chaos Tools (AWS Fault Injection Service (FIS), Chaos Mesh, and Chaosd) and Chaos Toolkit.
  • Develops and enhances organization's internal Chaos Framework to streamline Chaos Executions and reporting.
  • Provides specialized technical expertise in the adoption of Chaos Engineering by application teams.
  • Chaos tests and observes business-critical applications to understand the weaknesses and increase application resiliency.
  • Activates Observability for the critical applications with recommended Service Level Indicators and Service Level Objectives for Latency, Availability, Error Rate etc.
  • Utilizes modern monitoring tools (Datadog, Splunk, Catchpoint etc.) to reduce mean time to detect an issue and improve the response times.
  • Creates CI/CD pipelines with security and quality checks with Application Lifecycle management toolchain. Helps in integrating Chaos and Observability with CI/CD pipelines.
  • Automates repetitive activities using scripting languages (Python, Groovy etc.).
  • Implements and supports solutions based on cloud platforms AWS/Azure and container orchestration Kubernetes.
  • Onboards /Evaluates New Cloud services that help to enhance the Resiliency of cloud ecosystem. Serves as a liaison for vendor engagement.
  • Participates in incident management, problem management and incident postmortems.
  • Takes part in peer code reviews providing qualitative feedback.
  • Builds processes and capabilities to adapt and respond to risks, and disruptions, while maintaining business operations and data recovery with minimal disruptions.
  • Coaches peer SREs and application teams on SRE and DevOps.
  • Implements Agile methodologies in the team's project completion using incremental and iterative steps.

Education and Experience:
Bachelor's degree in Computer Science, Engineering, Information Technology, Information Systems, or a closely related field (or foreign education equivalent) and five (5) years of experience as a Principal Site Reliability Engineer (or closely related occupation) implementing resilient container and cloud-based applications and infrastructure solutions, using DevOps or SRE practices, in a financial services environment.
Or, alternatively, Master's degree (or foreign education equivalent) in Computer Science, Engineering, Information Technology, Information Systems, or a closely related field (or foreign education equivalent) and three (3) years of experience as a Principal Site Reliability Engineer (or closely related occupation) implementing resilient container and cloud-based applications and infrastructure solutions, using DevOps or SRE practices, in a financial services environment.
Skills and Knowledge:
Candidate must also possess:
  • Demonstrated Expertise ("DE") improving application resiliency by implementing chaos engineering to build system's capability to withstand turbulent conditions in production, using Chaos Mesh, Chaosd, Azure Chaos Studio, AWS FIS, or Gremlin; and driving automation to implement scalable approaches for the planning, design, execution, and reporting of chaos testing using Jenkins pipelines, standard frameworks, data visualization, and dashboards.
  • DE implementing advanced observability practices and techniques in production and pre-production environments, at scale using Datadog, Splunk, or Catchpoint; tracking the error budget, proactively identifying issues, minimizing Mean Time to Repair (MTTR); and balancing customer expectations by implementing Service-Level Indicators (SLIs) and Service-Level Objectives (SLOs) using logs, traces, monitors and synthetic tests.
  • DE migrating and maintaining cloud applications and creating cloud solutions using Amazon Web Services (AWS) or Azure cloud services; Implementing infrastructure as code for cloud; Onboarding new AWS or Azure services with required reviews and security controls in non-production and production environments; and researching evolving cloud ecosystem to adopt machine learning based tools (AWS DevOps guru) to boost AIOps abilities.
  • DE implementing CI/CD pipelines in both production and non-production environments using Application Lifecycle Management (ALM) tools (JIRA, GitHub, Jenkins, SonarQube, Artifactory, or uDeploy) to enable faster code delivery, enhanced software quality, reliability, and security; and developing products, and core and common capabilities for the organization to reduce toil and drive standardization, using containerization and orchestration technologies (Docker or Kubernetes), Infrastructure as Code (IaC) tools, scripting languages (Python or Groovy), and engineering best practices.

#PE1M2
#LI-DNI
Certifications:
Category:
Information Technology
Please be advised that Fidelity's business is governed by the provisions of the Securities Exchange Act of 1934, the Investment Advisers Act of 1940, the Investment Company Act of 1940, ERISA, numerous state laws governing securities, investment and retirement-related financial activities and the rules and regulations of numerous self-regulatory organizations, including FINRA, among others. Those laws and regulations may restrict Fidelity from hiring and/or associating with individuals with certain Criminal Histories.

What Fidelity Investments employees say

Pay

Benefits

Hours and flexibility

Workplace

Get the full story on Breakroom