1

Linux Site Reliability Engineer Jobs (NOW HIRING)

Site Reliability Engineer - SRE

Atlanta, GA · On-site +1

$54.25 - $72/hr

Role: Site Reliability Engineer * Location: Atlanta, GA OR Dallas OR Austin, TX * Duration ... Proficient in a Linux or Unix based environment. * Proficiency in supporting a 24x7 operation.

Site Reliability Engineer - SRE

Atlanta, GA · On-site

$54.25 - $72/hr

Role: Site Reliability Engineer * Location: Atlanta, GA OR Dallas OR Austin, TX * Duration ... Proficient in a Linux or Unix based environment. * Proficiency in supporting a 24x7 operation.

As a SRE, you will be responsible for maintaining and improving uptime and availability across ... Linux in-depth knowledge. * Knowledge of one of the programming languages (see Preferable ...

As a SRE, you will be responsible for maintaining and improving uptime and availability across ... Linux in-depth knowledge. * Knowledge of one of the programming languages (see Preferable ...

SRE Engineer

San Jose, CA · On-site

$66.75 - $88.75/hr

San Jose, CA / RTP, NC(Onsite) Job Type: Full Time Must Have Technical/Functional Skills: * SRE, NetApp Storage, Linux Certified, Kubernetes Certified, DevOps, Docker, etc. * Experienced Senior SRE ...

We are looking for the right Site Reliability Engineer to help us take our efforts to the next ... Linux, Python, Docker, Kubernetes, Postgres, Redis, along with operations and monitoring ...

SITE RELIABILITY ENGINEER

Camden, NJ · On-site

$130K - $150K/yr

Site Reliability Engineer (SRE) Engineer Reliability into the Systems That Move the Nation's Food ... Strong Linux and Windows systems administration and troubleshooting skills * Hands-on experience ...

$57.75 - $76.75/hr

Site Reliability Engineer (SRE) Department: Technology Location: Manila Reporting To: Head of Infra ... Linux administration and command-line debugging. * Hands-on with AWS (preferred) or GCP cloud ...

Site Reliability Engineer

Frederick, MD · On-site

$56.75 - $75.25/hr

Must have 4+ years of Hands-on Linux experience that includes Ubuntu/CentOS/Red Hat operating ... and SRE use cases * Must have proficiency to debug or troubleshoot and/or deploying SQL and/or ...

next page

Showing results 1-20

Linux Site Reliability Engineer information

See salary details

$10

$63

$91

How much do linux site reliability engineer jobs pay per hour?

As of Jun 21, 2026, the average hourly pay for linux site reliability engineer in the United States is $63.74, according to ZipRecruiter salary data. Most workers in this role earn between $54.81 and $72.84 per hour, depending on experience, location, and employer.

What are some common challenges faced by Linux Site Reliability Engineers when scaling infrastructure, and how can they be addressed?

Linux Site Reliability Engineers often encounter challenges related to maintaining system stability and performance as infrastructure scales. Issues such as configuration drift, automation bottlenecks, and monitoring gaps can arise when managing numerous servers or services. Addressing these challenges typically involves implementing robust configuration management tools, investing in automated deployment pipelines, and enhancing observability through comprehensive monitoring and alerting solutions. Collaboration with development and operations teams is essential to ensure that scalability solutions align with business needs and technical requirements.

What are the key skills and qualifications needed to thrive as a Linux Site Reliability Engineer, and why are they important?

To thrive as a Linux Site Reliability Engineer, you need deep expertise in Linux system administration, scripting (such as Bash or Python), and a solid understanding of networking concepts, usually backed by a computer science degree or equivalent experience. Familiarity with configuration management tools (like Ansible, Puppet, or Chef), containerization (Docker, Kubernetes), and cloud platforms (AWS, GCP, or Azure) is typically required, along with relevant certifications like RHCE or AWS Certified SysOps Administrator. Strong problem-solving skills, effective communication, and the ability to work under pressure are crucial soft skills for this role. These competencies ensure the reliability, scalability, and security of complex infrastructure, minimizing downtime and supporting seamless operations.

Who gets paid more, SRE or DevOps?

Generally, Site Reliability Engineers (SREs) tend to have higher salaries than DevOps engineers due to their specialized focus on system reliability, automation, and incident management. Both roles require strong skills in cloud platforms, scripting, and monitoring tools, but SREs often have more advanced expertise in reliability engineering practices, which can lead to higher compensation.

Will AI replace SRE jobs?

AI is expected to augment the work of Linux Site Reliability Engineers by automating routine tasks such as monitoring, incident response, and log analysis. However, SRE roles require complex problem-solving, system design, and decision-making that currently cannot be fully replaced by AI, making human expertise essential. SREs will likely focus more on overseeing automation tools and managing system reliability rather than being replaced entirely.

What engineer makes $500,000 a year?

A senior Linux Site Reliability Engineer or similar high-level engineering roles in cloud infrastructure and large-scale systems can earn $500,000 or more annually, especially with bonuses and stock options. These positions typically require extensive experience, advanced skills in automation, scripting, and cloud platforms, and often involve leadership responsibilities.

What engineers make $300,000 a year?

Senior Linux Site Reliability Engineers with extensive experience, advanced skills in automation, cloud platforms, and monitoring tools can earn $300,000 or more annually, especially in high-cost-of-living areas or large tech companies. Achieving this salary often requires specialized certifications, leadership roles, and a strong track record of managing complex infrastructure at scale.

What is the difference between Linux Site Reliability Engineer vs Linux DevOps Engineer?

AspectLinux Site Reliability EngineerLinux DevOps Engineer
CredentialsLinux certifications, SRE-specific trainingLinux certifications, DevOps tools certifications
Work EnvironmentFocus on system reliability, monitoring, incident responseFocus on automation, CI/CD pipelines, deployment
Employer & IndustryTech companies, cloud providers, large enterprisesStartups, tech firms, software development teams
Search & Comparison IntentUnderstanding reliability roles, incident managementAutomation, deployment, continuous integration

While both roles involve Linux expertise, a Linux Site Reliability Engineer primarily focuses on maintaining system reliability, monitoring, and incident response. In contrast, a Linux DevOps Engineer emphasizes automation, continuous integration, and deployment processes. Both roles require Linux skills and often overlap, but their core responsibilities differ based on organizational needs.

What is a Linux Site Reliability Engineer?

A Linux Site Reliability Engineer (SRE) is an IT professional responsible for ensuring the reliability, scalability, and performance of systems running on the Linux operating system. They bridge the gap between software development and operations by automating processes, monitoring infrastructure, and managing incidents. Linux SREs focus on system availability, building tools for deployment and monitoring, and improving system robustness through best practices and automation. Their work helps organizations deliver reliable online services and quickly recover from outages or system failures.
More about Linux Site Reliability Engineer jobs
What cities are hiring for Linux Site Reliability Engineer jobs? Cities with the most Linux Site Reliability Engineer job openings:
What states have the most Linux Site Reliability Engineer jobs? States with the most job openings for Linux Site Reliability Engineer jobs include:
What job categories do people searching Linux Site Reliability Engineer jobs look for? The top searched job categories for Linux Site Reliability Engineer jobs are:
Sr. IT Linux Site Reliability Engineer

Sr. IT Linux Site Reliability Engineer

SpaceX

Cape Canaveral, FL • On-site

$48.25 - $64.25/hr

Other

Posted 8 days ago


SpaceX rating

8.7

Company rating: 8.7 out of 10

Based on 144 frontline employees who took The Breakroom Quiz

13th of 60 rated aerospace companies


Job description

SpaceX was founded under the belief that a future where humanity is out exploring the stars is fundamentally more exciting than one where we are not. Today SpaceX is actively developing the technologies to make this possible, with the ultimate goal of enabling human life on Mars.
SR. LINUX SITE RELIABILITY ENGINEER
SpaceX is looking for an experienced engineer with deep working knowledge of Kubernetes and related containerized technologies. This employee will be a member of the Information Technology Linux Infrastructure team and will provide expertise in Kubernetes design, maintenance, scaling and optimization in support of critical business functions. The ideal candidate will be flexible and flourish in a fast paced and challenging environment. They should be a self-starter, self-motivator and possess ingenuity to excel at this position.
RESPONSIBILITIES:
  • Build, install, manage, scale and optimize Kubernetes and RKE clusters using Ansible, Terraform and adjacent technologies in production environments.
  • Work closely with other SpaceX engineers to gather requirements, research, evaluate, design, plan, deploy, and support software platforms and related technologies running in Kubernetes within a world-class environment that meets the needs of the demanding SpaceX engineering teams. Build highly resilient, high-performance, scalable, and robust systems.
  • Exercise a high degree of personal responsibility for the processes, systems, and tools you create and manage; all supporting the goal of making humanity an interplanetary species.
  • Make recommendations, justify, and implement improvements using an accepted change control methodology.
  • Work within a diverse group to design and deliver creative solutions and resolve problems in a timely and proactive manner by interacting with internal business units.
  • Define, document and follow standards and best practices for systems design, testing, and implementation.
  • Foster an environment of collaboration and cross-training, upskilling the team in Kubernetes expertise and ensuring peers are developed into capable engineers.
  • Drive scripting, self-service and automation to develop solutions to reduce administrative overhead and TOIL.
  • Participate in on-call rotation to handle urgent after-hours work when necessary.

BASIC QUALIFICATIONS:
  • Bachelor's degree in Computer Science or a STEM discipline and 5+ years of systems engineering experience; OR 7+ years of systems engineering experience in lieu of a degree.
  • Experience deploying and supporting Linux servers in physical and virtualized environments (e.g. VMware via automation).
  • Experience with the Linux shell as well as configuring and extending Linux instances (e.g. kernel modules, cgroups, pki, iptables, interfaces).
  • Experience supporting and scaling containerized applications in Linux environments.
  • Experience using automation frameworks (e.g. Ansible, Terraform) to manage provisioning and post-provisioning lifecycles of infrastructure and Kubernetes installations.

PREFERRED SKILLS AND EXPERIENCE:
  • Expertise in creating repeatable, reliable, scalable systems architectures, with high availability, fault tolerance, performance tuning, monitoring, and statistics/metrics collection.
  • Expertise in source code version control tools such as Git and Subversion and collaborating on source code via Pull Requests and other Git-based workflows.
  • Strong understanding of Linux Container Runtime.
  • Experience implementing configuration management provisioning and workflow automation solutions via Infrastructure as Code, CI/CD and GitOps (e.g. Ansible, AWX/Tower, Vagrant, Puppet, Redfish, Jenkins, cloud-init, ArgoCD, etc).
  • Experience writing test automation to ensure backwards compatibility of feature and change development for automation processes and Kubernetes deployments.
  • Experience with programming and scripting languages such as Python and Golang to develop software solutions and integrate with external systems to implement automation against RESTful API services.
  • Experience installing, configuring and troubleshooting Kubernetes internals, CNI, CRI and CSI plugins (e.g. Docker, Cri-O, Ceph, Cilium), load balancing (e.g. MetalLB), Service Mesh (e.g. Istio) and software-defined storage (e.g. rook-ceph) in cloud or on-premise environments.
  • Experience developing solutions using Kubernetes patterns to extend system functionality and solve custom use cases (e.g. webhooks, controllers, operators, sidecars).
  • Experience implementing proactive alert/monitoring workflows and dashboards for Linux systems and Kubernetes deployments using Prometheus, Grafana, InfluxDB or similar technologies.
  • Experience with dynamic system configuration templating using Jinja, Jsonnet, YAML and Helm.

ADDITIONAL REQUIREMENTS:
  • Must be willing to work extended hours and weekends as needed.
  • Ability to pass Air Force background check for Cape Canaveral.

ITAR REQUIREMENTS:
    Learn more about the ITAR here.

SpaceX is an Equal Opportunity Employer; employment with SpaceX is governed on the basis of merit, competence and qualifications and will not be influenced in any manner by race, color, religion, gender, national origin/ethnicity, veteran status, disability status, age, sexual orientation, gender identity, marital status, mental or physical disability or any other legally protected status.
Applicants wishing to view a copy of SpaceX's Affirmative Action Plan for veterans and individuals with disabilities, or applicants requiring reasonable accommodation to the application/interview process should reach out to

What SpaceX employees say

Pay

Benefits

Hours and flexibility

Workplace

Get the full story on Breakroom