1

Site Reliability Engineer Jobs in Raleigh, NC (NOW HIRING)

Software Engineer

Morrisville, NC · On-site

$148K - $200K/yr

Job Summary As a Cloud Infrastructure / Site Reliability Engineer, you will operate at the intersection of development and operations. You will engage and enhance all aspects of the cloud services ...

Job Summary As a Cloud Infrastructure / Site Reliability Engineer, you will operate at the intersection of development and operations. You will engage and enhance all aspects of the cloud services ...

As a Platform Network Engineer within our Site Reliability Engineering (SRE) organization, you'll play a pivotal role in building a secure, scalable, and high-performing network foundation that ...

Senior Software Engineer

Raleigh, NC

$119K - $157K/yr

Collaborate across engineering, product, and SRE. Work closely with platform, SRE, security, and product teams to understand their delivery needs and ensure the CI/CD platform supports them ...

DevOps / AWS Engineer

Raleigh, NC · On-site

$51.25 - $70.25/hr

Improve system reliability, scalability, and performance through SRE best practices. Security & Compliance * Implement AWS security best practices including IAM policies, encryption, secrets ...

Lead Infrastructure Engineer

Raleigh, NC · On-site

$104K - $137K/yr

Experience with Agile, CI/CD, DevOps concepts and Site Reliability Engineer (SRE) principles * Proficient on container-based solution services, have handled 2-3 large scale Kubernetes based ...

next page

Showing results 1-20

People also search for

Site Reliability Engineer information

See Raleigh, NC salary details

$10

$61

$89

How much do site reliability engineer jobs pay per hour?

As of Jun 11, 2026, the average hourly pay for site reliability engineer in Raleigh, NC is $61.96, according to ZipRecruiter salary data. Most workers in this role earn between $53.27 and $70.82 per hour, depending on experience, location, and employer.

Will SRE be replaced by AI?

Site Reliability Engineers (SREs) focus on maintaining system reliability, automation, and incident response, and AI tools are increasingly used to assist these tasks. While AI can automate routine processes, SREs' expertise in system design, troubleshooting, and decision-making remains essential, making complete replacement unlikely in the near future.

What Is a Site Reliability Engineer?

A site reliability engineer specializes in site reliability engineering, or SRE, a specific branch of operations first pioneered by Google. You are responsible for ensuring that when a website decides to scale a particular feature for various users to access, it does not break the underlying software or website functions. This means you need to use analytical problem-solving skills to determine how to make specific features on a new software release work on top of existing source code.

What engineers make $300,000 a year?

Senior-level engineers such as Site Reliability Engineers, Software Engineers, and Cloud Infrastructure Engineers can earn $300,000 or more annually, especially with extensive experience, specialized skills, and working at large tech companies or in high-cost-of-living areas. Compensation often includes base salary, bonuses, and stock options, with expertise in automation, cloud platforms, and monitoring tools being highly valued.

What are the key skills and qualifications needed to thrive as a Site Reliability Engineer, and why are they important?

To thrive as a Site Reliability Engineer, you need a strong background in computer science, systems administration, and software engineering, often supported by a degree in a technical field. Familiarity with cloud platforms (like AWS or GCP), container orchestration (such as Kubernetes), infrastructure as code (Terraform or Ansible), and monitoring tools (Prometheus, Grafana) is typically expected. Strong problem-solving skills, effective communication, and a proactive mindset help SREs excel at incident management and cross-functional collaboration. These skills are crucial for maintaining system reliability, minimizing downtime, and driving continuous improvement in complex technical environments.

Is SRE a stressful job?

Site Reliability Engineers (SREs) often work in high-pressure environments where they monitor system performance, troubleshoot outages, and ensure uptime. The role can involve on-call duties and incident response, which may contribute to stress, but it also offers opportunities for automation and process improvements to reduce workload. Overall, stress levels vary depending on the organization, team culture, and individual skills.

What are some of the most common challenges Site Reliability Engineers face when balancing system reliability with rapid software delivery?

Site Reliability Engineers (SREs) often navigate the challenge of maintaining highly reliable systems while supporting fast-paced software releases. This involves managing incidents, automating processes to reduce manual toil, and working closely with development teams to embed reliability into the software development lifecycle. SREs must carefully prioritize their efforts between proactive improvements and urgent, reactive fire-fighting. Effective communication and collaboration with both operations and development teams are crucial to ensuring service uptime without slowing down innovation.

What does a Site Reliability Engineer do?

A Site Reliability Engineer (SRE) is responsible for maintaining and improving the reliability, availability, and performance of software systems. They use automation, monitoring tools, and scripting to prevent outages and resolve issues quickly, often working closely with development teams to ensure scalable infrastructure. SREs typically have skills in systems engineering, coding, and cloud platforms, and may hold certifications like those in cloud services or DevOps practices.

What is the difference between Site Reliability Engineer vs DevOps Engineer?

AspectSite Reliability EngineerDevOps Engineer
CredentialsTypically requires a computer science degree, certifications like AWS, Google Cloud, or KubernetesSimilar credentials, often with cloud certifications and scripting skills
Work EnvironmentFocuses on maintaining and improving system reliability, often in large-scale production environmentsWorks on automation, CI/CD pipelines, and deployment processes across development and operations teams
Industry UsageCommon in tech, cloud services, and large-scale enterprise companiesWidely used in software development, cloud, and IT organizations

Both roles require strong technical skills and cloud knowledge, but SREs focus more on system reliability and uptime, while DevOps engineers emphasize automation and deployment processes. They often collaborate but have distinct primary responsibilities.

What is a Site Reliability Engineer?

A Site Reliability Engineer (SRE) is a professional who applies software engineering principles to infrastructure and operations problems. Their primary goal is to create scalable and highly reliable software systems, often bridging the gap between development and IT operations. SREs automate tasks, monitor system health, respond to incidents, and work to improve system reliability and performance. They also help define service level objectives (SLOs) and ensure systems meet customer expectations for uptime and availability.
What are the most commonly searched types of Site Reliability Engineer jobs in Raleigh, NC? The most popular types of Site Reliability Engineer jobs in Raleigh, NC are:
What job categories do people searching Site Reliability Engineer jobs in Raleigh, NC look for? The top searched job categories for Site Reliability Engineer jobs in Raleigh, NC are:
What cities near Raleigh, NC are hiring for Site Reliability Engineer jobs? Cities near Raleigh, NC with the most Site Reliability Engineer job openings:
Infographic showing various Site Reliability Engineer job openings in Raleigh, NC as of June 2026, with employment types broken down into 76% Full Time, and 24% Contract. Highlights an 82% In-person, 12% Hybrid, and 6% Remote job distribution, with an average salary of $128,882 per year, or $62 per hour.
Software Engineer

Software Engineer

NetApp, Inc.

Morrisville, NC • On-site

$148K - $200K/yr

Full-time

Medical, Life, Retirement, PTO

Posted 19 days ago


Job description

Job Summary
As a Cloud Infrastructure / Site Reliability Engineer, you will operate at the intersection of development and operations. You will engage and enhance all aspects of the cloud services lifecycle from design through deployment, operation, and refinement. You will be responsible for maintaining these services by measuring and monitoring their availability, latency, and overall system health and building automation for efficient cloud operations management.
You will play a crucial role in sustainably scaling systems through automation and driving changes that improve reliability and velocity. As part of your responsibilities, you will administer cloud-based environments that support our SaaS/IaaS offerings implemented on a microservices, container-based architecture (Kubernetes). In addition, you will oversee a portfolio of customer-centric cloud services (SaaS/IaaS), ensuring their overall availability, performance, and security. You will work closely with NetApp and cloud service provider teams (to include Azure) from Research Triangle Park (RTP), D.C., Pittsburg and more.
Due to the critical nature of the services we support, this position involves participation in a rotation-based on-call schedule as part of our global team. This role offers the opportunity to work in a dynamic, global environment, ensuring the smooth operation of vital cloud services. To be successful in this role, you should be a motivated self-starter and self-learner, possess strong problem-solving skills, and be someone who embraces challenges.
Responsibilities
  • Automation and Efficiency: Identify tasks and areas where automation can be applied to achieve time efficiencies and risk reduction. Develop software for deployment automation, packaging, and monitoring visibility.
  • Team Collaboration and Influence: Work in tandem with other Cloud Infrastructure Engineers and developers to ensure maximum performance, reliability, and automation of our deployments and infrastructure. Consult and influence developers on new feature development and software architecture to ensure scalability.
  • Debugging, Troubleshooting, and Advanced Support: Undertake debugging and troubleshooting of service bottlenecks throughout the entire software stack. Additionally, provide advanced tier 2 and 3 support for NetApp's Cloud Data Service solutions.
  • Analysis, and Infrastructure Maintenance: Continuously monitor, analyze, and measure system health, availability, and latency using tools like Prometheus, Stackdriver, ElasticSearch, Grafana, and SolarWinds. Develop strategies to enhance system and application performance, availability, and reliability. In addition, maintain and monitor the deployment and orchestration of servers, docker containers, databases, and general backend infrastructure.
  • Incident Response and Troubleshooting: Address and perform Root Cause Analysis (RCA) of complex live production incidents and cross-platform issues involving OS, Networking, and Database in cloud-based SaaS/IaaS environments. Implement SRE best practices for effective resolution.
  • Document system knowledge as you acquire it, create runbooks, and ensure critical system information is readily accessible. Security Management: Stay updated with security protocols and proactively identify, diagnose, and resolve complex security issues. Issue Tracking and Resolution: Use Atlassian's tool chain along with first party cloud service management tools to track and resolve issues based on their priority.
  • Directly influence the decisions and outcomes related to solution implementation: measure and monitor availability, latency, and overall system health.

Job Requirements
  • 5+ years experience in scripting and infrastructure automation using tools such as PowerShell, Python, Go or Ruby.
  • Deep working knowledge of Containers, Kubernetes, Serverless computing implementation, and distributed systems design patterns. Knowledge of DevOps/SRE development methodologies.
  • Proficiency in Linux/Unix and CoreOS. Experience with cloud platforms such as AWS, Azure, or Google Cloud.
  • Ability to lead a scrum team, influence stakeholders to effectively maintain a product backlog, manage sprints.
  • This position will have ON-CALL rotations as well as an ask to work odd hourss.

Education
  • A Bachelor of Science Degree in Computer Science, a master's degree; or equivalent experience is required

Compensation:
The target salary range for this position is $148,000 - $200,000. The salary offered will be determined by the candidate's location, qualifications, experience, and education and may be outside of this range. Final compensation packages are competitive and in line with industry standards, reflecting a variety of factors, and include a comprehensive benefits package. This may cover Health Insurance, Life Insurance, Retirement or Pension Plans, Paid Time Off, various Leave options, Performance-Based Incentives, employee stock purchase plan, and/or restricted stocks (RSU's), with all offerings subject to regional variations and governed by local laws, regulations, and company policies. Benefits may vary by country and region, and further details will be provided as part of the recruitment process.