1

Linux Site Reliability Engineer Jobs (NOW HIRING)

Site Reliability Engineer (SRE)

Austin, TX · On-site

$56.50 - $75/hr

Site Reliability Engineer (SRE) Location: Austin, TX Job Type: Full Time Job Summary - Seasoned ... Linux/Unix. · Monitor the application/Services/batch availability. · Act quickly on the ...

Sr. IT Linux Site Reliability Engineer

Hawthorne, CA · On-site

$57.75 - $76.75/hr

SpaceX is actively developing technologies to enable human life on Mars, and they are seeking a Sr. IT Linux Site Reliability Engineer to join their Information Technology Linux Infrastructure team.

SRE

Austin, TX · On-site

$56.50 - $75/hr

Job Title: SRE Location: Austin, TX/ Sunnyvale, CA Skills * Core Java * Advanced Java * Advanced ... Provide deep Linux systems expertise , including performance tuning, debugging, and incident ...

Site Reliability Engineer SRE - ML platform Location: Austin, TX OR Sunnyvale, CA Type: FTE Salary ... Proficient with Linux administration. * Knowledge of ML models and LLM. * Ability to understand ...

New

Site Reliability Engineer

Austin, TX · On-site

$56.50 - $75/hr

Site Reliability Engineer SRE - ML platform Location: Austin, TX OR Sunnyvale, CA Title: Site ... Proficient with Linux administration. * Knowledge of ML models and LLM. * Ability to understand ...

Site Reliability Engineer

Irondale, AL · On-site

$48.25 - $64/hr

Site Reliability Engineer Site Reliability Engineer (SRE) Hybrid Opportunity | Enterprise Cloud ... Architecture-level knowledge of Linux and Windows systems * Experience with CI/CD pipelines and ...

Site Reliability Engineer

Chicago, IL · On-site

$58.75 - $78/hr

NET workloads running on Windows and Linux containers in AWS environments. The role is focused on the applications and technology underpinning the PartsTrader customer-facing products. The SRE will ...

Site Reliability Engineer

Sterling, VA · On-site

$56.50 - $75/hr

The Site Reliability Engineer (SRE) collaboratively works closely with the contract leadership ... Linux/Unix Systems Administration: Strong knowledge of Linux/Unix operating systems, including ...

Troubleshoot and optimize Linux-based systems in production environments * Support production systems including on-call, incident response, and RCA * Collaborate with SRE and Security teams to ensure ...

SRE

Framingham, MA

$62 - $82.50/hr

Role: SRE Location: Framingham, MA [Onsite] W2 Only 7+ years of IT experience with strong exposure ... Strong understanding of Linux systems, networking fundamentals, and application runtime diagnostics.

Site Reliability Engineer

Chicago, IL

$58.75 - $78/hr

NET workloads running on Windows and Linux containers in AWS environments. The role is focused on the applications and technology underpinning the PartsTrader customer-facing products. The SRE will ...

Site Reliability Engineer

Sterling, VA · On-site

$56.50 - $75/hr

Site Reliability Engineer Location: Sterling, VA Clearance: TS/SCI Poly **This position is ... Linux/Unix Systems Administration: Strong knowledge of Linux/Unix operating systems, including ...

next page

Showing results 1-20

Linux Site Reliability Engineer information

See salary details

$10

$63

$91

How much do linux site reliability engineer jobs pay per hour?

As of Jun 21, 2026, the average hourly pay for linux site reliability engineer in the United States is $63.74, according to ZipRecruiter salary data. Most workers in this role earn between $54.81 and $72.84 per hour, depending on experience, location, and employer.

What are some common challenges faced by Linux Site Reliability Engineers when scaling infrastructure, and how can they be addressed?

Linux Site Reliability Engineers often encounter challenges related to maintaining system stability and performance as infrastructure scales. Issues such as configuration drift, automation bottlenecks, and monitoring gaps can arise when managing numerous servers or services. Addressing these challenges typically involves implementing robust configuration management tools, investing in automated deployment pipelines, and enhancing observability through comprehensive monitoring and alerting solutions. Collaboration with development and operations teams is essential to ensure that scalability solutions align with business needs and technical requirements.

What are the key skills and qualifications needed to thrive as a Linux Site Reliability Engineer, and why are they important?

To thrive as a Linux Site Reliability Engineer, you need deep expertise in Linux system administration, scripting (such as Bash or Python), and a solid understanding of networking concepts, usually backed by a computer science degree or equivalent experience. Familiarity with configuration management tools (like Ansible, Puppet, or Chef), containerization (Docker, Kubernetes), and cloud platforms (AWS, GCP, or Azure) is typically required, along with relevant certifications like RHCE or AWS Certified SysOps Administrator. Strong problem-solving skills, effective communication, and the ability to work under pressure are crucial soft skills for this role. These competencies ensure the reliability, scalability, and security of complex infrastructure, minimizing downtime and supporting seamless operations.

Who gets paid more, SRE or DevOps?

Generally, Site Reliability Engineers (SREs) tend to have higher salaries than DevOps engineers due to their specialized focus on system reliability, automation, and incident management. Both roles require strong skills in cloud platforms, scripting, and monitoring tools, but SREs often have more advanced expertise in reliability engineering practices, which can lead to higher compensation.

Will AI replace SRE jobs?

AI is expected to augment the work of Linux Site Reliability Engineers by automating routine tasks such as monitoring, incident response, and log analysis. However, SRE roles require complex problem-solving, system design, and decision-making that currently cannot be fully replaced by AI, making human expertise essential. SREs will likely focus more on overseeing automation tools and managing system reliability rather than being replaced entirely.

What engineer makes $500,000 a year?

A senior Linux Site Reliability Engineer or similar high-level engineering roles in cloud infrastructure and large-scale systems can earn $500,000 or more annually, especially with bonuses and stock options. These positions typically require extensive experience, advanced skills in automation, scripting, and cloud platforms, and often involve leadership responsibilities.

What engineers make $300,000 a year?

Senior Linux Site Reliability Engineers with extensive experience, advanced skills in automation, cloud platforms, and monitoring tools can earn $300,000 or more annually, especially in high-cost-of-living areas or large tech companies. Achieving this salary often requires specialized certifications, leadership roles, and a strong track record of managing complex infrastructure at scale.

What is the difference between Linux Site Reliability Engineer vs Linux DevOps Engineer?

AspectLinux Site Reliability EngineerLinux DevOps Engineer
CredentialsLinux certifications, SRE-specific trainingLinux certifications, DevOps tools certifications
Work EnvironmentFocus on system reliability, monitoring, incident responseFocus on automation, CI/CD pipelines, deployment
Employer & IndustryTech companies, cloud providers, large enterprisesStartups, tech firms, software development teams
Search & Comparison IntentUnderstanding reliability roles, incident managementAutomation, deployment, continuous integration

While both roles involve Linux expertise, a Linux Site Reliability Engineer primarily focuses on maintaining system reliability, monitoring, and incident response. In contrast, a Linux DevOps Engineer emphasizes automation, continuous integration, and deployment processes. Both roles require Linux skills and often overlap, but their core responsibilities differ based on organizational needs.

What is a Linux Site Reliability Engineer?

A Linux Site Reliability Engineer (SRE) is an IT professional responsible for ensuring the reliability, scalability, and performance of systems running on the Linux operating system. They bridge the gap between software development and operations by automating processes, monitoring infrastructure, and managing incidents. Linux SREs focus on system availability, building tools for deployment and monitoring, and improving system robustness through best practices and automation. Their work helps organizations deliver reliable online services and quickly recover from outages or system failures.
More about Linux Site Reliability Engineer jobs
What cities are hiring for Linux Site Reliability Engineer jobs? Cities with the most Linux Site Reliability Engineer job openings:
What states have the most Linux Site Reliability Engineer jobs? States with the most job openings for Linux Site Reliability Engineer jobs include:
What job categories do people searching Linux Site Reliability Engineer jobs look for? The top searched job categories for Linux Site Reliability Engineer jobs are:

Site Reliability Engineer (SRE)

Hirekeyz Inc

Austin, TX • On-site

$56.50 - $75/hr

Contractor

Posted 21 days ago


Job description

Job Title: Site Reliability Engineer (SRE)

Location: Austin, TX

Job Type: Full Time

Job Description:

Job Summary –

Seasoned Site Reliability Engineer (SRE) with 7+ years of experience in supporting complex, large-scale distributed systems. Highly skilled in managing production failures, conducting root cause analysis, and driving effective remediation. Strong communicator with expertise in ing, monitoring, and release management, complemented by automation proficiency and a keen ability to learn quickly.

This role involves providing 24/7 support as part of the SRE team, ensuring the reliability and performance of mission-critical Java, .NET, and Batch applications deployed across GCP, PCF, and on-premises environments.

Years of experience needed –

Candidate experience – 7+ Years

Technical Skills:

· Expertise in understanding large scale production systems and technologies, for example load balancing, monitoring, distributed systems, microservices, and configuration management.

· Should have solid hands-on experience in troubleshooting and fixing application failures, application Performance degradation, Code issues, cloud platform issues, Batch Failures, infra failures, DB failures, Network failures.

· Hands-on experience in performing Production deployments using CI/CD and exposure to deployment strategies.

· Experience in troubleshooting of Linux/Unix.

· Monitor the application/Services/batch availability.

· Act quickly on the application s(Performance, Availability) and Batch Job failures

· Perform the required analysis (Code/Log) and escalate to the Engineering team as required.

· Initiate and drive the Techlines in case of outages/major incidents/Batch abends and ensure Service Restoration in the least time possible.

· Effectively handle the Incident, Problem, Release and Change management.

· Own and deliver the user stories assigned as part of the sprint.

o The user stories range from application code Debugging, Issue analysis, Code fix, Knowledge base creation, documentation of SOP’s, Production Deployments, Pre & Post Patching/Maintenance activities, Service Requests.

o Build monitoring solutions using APM tools like Splunk, Appdynamics, Thousand Eyes, ITRS, AppMetrics, MoogSoft, Kafka etc.

o Automate of day-day operational tasks.

o Be part of the Exit reviews to ensure the best practices are followed to have the right code deployed to Production systems

o Provide feedback/recommend improvements to the system which would enable highly stable systems.

· Strong understanding of Networking Concepts (TCP/IP, SSL/TLS, IPSec, VPN etc), Firewall and Load Balancers.

· Experience in Scripting – Shell/Powershell/Python

· Strong Experience in working with any Cloud-based infrastructure (PCF, GCP, AWS, Azure Cloud or others)