1

Linux Sre Jobs (NOW HIRING)

Site Reliability Engineer

Chicago, IL · On-site

$58.75 - $78/hr

NET workloads running on Windows and Linux containers in AWS environments. The role is focused on the applications and technology underpinning the PartsTrader customer-facing products. The SRE will ...

Site Reliability Engineering

Charlotte, NC · On-site

$55.75 - $74/hr

Job Title: Site Reliability Engineer Location: Charlotte, NC (Onsite) Experience: 10+ Years Job ... Strong Linux/Unix administration skills Key Responsibilities * Monitor and maintain application and ...

Site Reliability Engineer

Sterling, VA · On-site

$56.50 - $75/hr

Site Reliability Engineer Location: Sterling, VA Clearance: TS/SCI Poly **This position is ... Linux/Unix Systems Administration: Strong knowledge of Linux/Unix operating systems, including ...

Site Reliability Engineer - SRE

Atlanta, GA

$54.25 - $72/hr

Role: Site Reliability Engineer * Location: Atlanta, GA OR Dallas OR Austin, TX * Duration ... Proficient in a Linux or Unix based environment. * Proficiency in supporting a 24x7 operation.

As a SRE, you will be responsible for maintaining and improving uptime and availability across ... Linux in-depth knowledge. * Knowledge of one of the programming languages (see Preferable ...

Site Reliability Engineer - SRE

Atlanta, GA · On-site +1

$54.25 - $72/hr

Role: Site Reliability Engineer * Location: Atlanta, GA OR Dallas OR Austin, TX * Duration ... Proficient in a Linux or Unix based environment. * Proficiency in supporting a 24x7 operation.

As a SRE, you will be responsible for maintaining and improving uptime and availability across ... Linux in-depth knowledge. * Knowledge of one of the programming languages (see Preferable ...

SRE Engineer

San Jose, CA · On-site

$66.75 - $88.75/hr

San Jose, CA / RTP, NC(Onsite) Job Type: Full Time Must Have Technical/Functional Skills: * SRE, NetApp Storage, Linux Certified, Kubernetes Certified, DevOps, Docker, etc. * Experienced Senior SRE ...

We are looking for the right Site Reliability Engineer to help us take our efforts to the next ... Linux, Python, Docker, Kubernetes, Postgres, Redis, along with operations and monitoring ...

SITE RELIABILITY ENGINEER

Camden, NJ · On-site

$130K - $150K/yr

Site Reliability Engineer (SRE) Engineer Reliability into the Systems That Move the Nation's Food ... Strong Linux and Windows systems administration and troubleshooting skills * Hands-on experience ...

$57.75 - $76.75/hr

Site Reliability Engineer (SRE) Department: Technology Location: Manila Reporting To: Head of Infra ... Linux administration and command-line debugging. * Hands-on with AWS (preferred) or GCP cloud ...

Site Reliability Engineer (SRE)

San Francisco, CA · On-site

$67.25 - $89.25/hr

The Site Reliability Engineer (SRE) will contribute to the stability and performance of Mithril ... Willing to pick up new languages as needed. • Linux fundamentals: strong command of Linux systems ...

next page

Showing results 1-20

Linux Sre information

See salary details

$10

$63

$91

How much do linux sre jobs pay per hour?

As of Jun 8, 2026, the average hourly pay for linux sre in the United States is $63.74, according to ZipRecruiter salary data. Most workers in this role earn between $54.81 and $72.84 per hour, depending on experience, location, and employer.

What are some common challenges faced by Linux SREs when managing large-scale deployments?

Linux Site Reliability Engineers (SREs) often encounter challenges such as maintaining high availability across distributed systems, automating repetitive tasks, and ensuring smooth deployments with minimal downtime. Troubleshooting incidents under pressure and balancing on-call responsibilities can also be demanding. Effective collaboration with development and operations teams is crucial to proactively identify and resolve performance bottlenecks, security vulnerabilities, and scalability issues.

What is a Linux SRE?

A Linux SRE (Site Reliability Engineer) is a professional responsible for maintaining, automating, and improving the reliability, scalability, and performance of systems running on Linux. They bridge the gap between software development and IT operations, ensuring that services remain available and efficient. Linux SREs use their expertise in Linux systems, automation tools, and monitoring to proactively prevent issues and respond quickly to incidents. Their work often involves coding, infrastructure management, and implementing best practices for system reliability.

What is the difference between Linux Sre vs Linux System Administrator?

AspectLinux SreLinux System Administrator
CertificationsLinux Foundation certifications, AWS, KubernetesCompTIA Linux+, LPIC, RHCSA
Work EnvironmentCloud, automation, scripting, large-scale systemsOn-premises, server management, user support
ResponsibilitiesAutomation, reliability, monitoring, incident responseSystem setup, maintenance, user management

Linux Sre and Linux System Administrator roles share foundational Linux skills and certifications. However, Linux Sre focuses more on automation, scalability, and system reliability in cloud environments, while Linux System Administrators primarily handle server setup and maintenance. Both roles are vital in IT, but Linux Sre often involves more scripting and proactive system management.

What are the key skills and qualifications needed to thrive as a Linux SRE, and why are they important?

To thrive as a Linux SRE (Site Reliability Engineer), you need strong expertise in Linux system administration, scripting (such as Bash or Python), and a solid understanding of networking and cloud infrastructure, often supported by a degree in computer science or related certifications like RHCE. Familiarity with configuration management tools (e.g., Ansible, Puppet), CI/CD pipelines, monitoring solutions (e.g., Prometheus, Grafana), and version control systems like Git is typically required. Problem-solving, effective communication, and a proactive mindset are crucial soft skills for excelling in this role. These skills ensure high system reliability, rapid incident response, and efficient collaboration, which are vital for maintaining robust and scalable production environments.
What states have the most Linux Sre jobs? States with the most job openings for Linux Sre jobs include:
Infographic showing various Linux Sre job openings in the United States as of May 2026, with employment types broken down into 88% Full Time, 6% Part Time, and 6% Contract. Highlights an 77% Physical, 7% Hybrid, and 16% Remote job distribution, with an average salary of $132,583 per year, or $63.7 per hour.
Site Reliability Engineer

Site Reliability Engineer

Enlyte

Chicago, IL • On-site

$58.75 - $78/hr

Full-time

Medical, Dental, Vision, Life, Retirement

Posted 24 days ago


Job description

Company Overview

At Enlyte, we combine innovative technology, clinical expertise, and human compassion to help people recover after workplace injuries or auto accidents. We support their journey back to health and wellness through our industry-leading solutions and services. Whether you're supporting a Fortune 500 client or a local business, developing cutting-edge technology, or providing clinical services you'll work alongside dedicated professionals who share your commitment to excellence and make a meaningful impact. Join us in fueling our mission to protect dreams and restore lives, while building your career in an environment that values collaboration, innovation, and personal growth.

Be part of a team that makes a real difference.


Job Description

The Site Reliability Engineer (SRE) is responsible for ensuring the reliability, availability, and performance of critical technology services and platforms. This role emphasizes proactive response, service level management, and technical leadership in observability, with a particular focus on supporting .NET workloads running on Windows and Linux containers in AWS environments. The role is focused on the applications and technology underpinning the PartsTrader customer-facing products. The SRE will collaborate closely with technology teams to identify and remediate risks, drive continuous improvement, and maintain operational excellence in an evolving microservices architecture that requires a high degree of availability.

Expected Hours: Monday - Friday (8am to 5pm), with flexibility to meet with New Zealand stakeholders as needed.

Environment: Onsite 4 days/week; Remote Fridays

Key Accountabilities and Responsibilities:

Main Responsibilities

  • Incident Response & Management: Lead and participate in the full incident lifecycle, including detection, triage, escalation, resolution, and post-incident reviews. Maintain readiness for high-priority incidents and ensure timely communication and documentation.
  • Observability & Monitoring: Implement, maintain, and optimize observability tools such as New Relic for distributed microservices. Develop and refine dashboards, alerts, and analytics to proactively detect issues and improve system reliability.
  • Service Level Management: Define, measure, and report on Service Level Indicators (SLIs), Service Level Objectives (SLOs), and Service Level Agreements (SLAs). Provide regular reporting on service-health and performance to stakeholders.
  • AIDriven Operations: Design and operate AIenabled SRE workflows, including LLMassisted incident triage, postincident analysis, and runbook automation. Explore agentic approaches to reduce manual toil and improve speed and consistency of operational responses.

General

  • Technical Support & Troubleshooting: Provide expert support for .NET workloads deployed on Windows and Linux containers, with a focus on AWS infrastructure. Troubleshoot complex issues across applications, platforms, and network layers.
  • Continuous Improvement: Collaborate with engineering and DevOps teams to identify opportunities for automation, reliability enhancements, and process improvements. Participate in root cause analysis and implement corrective actions.
  • Documentation & Knowledge Sharing: Create and maintain technical documentation, incident records, runbooks, and best practices for operational processes.
  • Collaboration: Work effectively with cross-functional teams, including developers, QA, product managers, and business stakeholders, to ensure alignment on reliability goals and incident action plans.
  • Maintain a high level of professionalism with regard to attitude, conduct, appearance, confidentiality and service excellence.
  • Effectively engage with internal customers via email, telephone and in-person to provide guidance and support.
  • Demonstrate sense of urgency, initiative, responsiveness and attention to detail.

Technical Support

  • Support the technology teams in optimizing .NET applications deployed on Windows and Linux containers in AWS cloud environments to enhance reliability and supportability.
  • Configure, maintain, and enhance observability tooling frameworks for monitoring microservices, logging, and tracing.
  • Assist with deployment, scaling, and maintenance of containerized workloads using AWS ECS.
  • Serve as a technical escalation point for production issues, ensuring rapid resolution and minimal business impact.
  • Maintain and improve CI/CD pipelines and automation supporting reliable application delivery.

Qualifications
  • Bachelor's degree in Computer Science, Information Technology, or a related field, or equivalent professional experience.
  • 5 Years of proven experience in site reliability engineering, incident response, and operational support for cloud-based applications.
  • Demonstrated expertise with observability and monitoring tools in microservice architectures.
  • Strong proficiency with AWS services, including EC2, ECS/EKS, CloudWatch, IAM, and networking.
  • Expert communicator in written, verbal, and diagrammatic mediums, able to effectively interact with and present to all levels of the organization.
  • Ability to get up to speed quickly in new technical or business domains.
  • Ability to work after hours or weekends as required.

Preferred Skills:

  • Extensive experience in incident management, escalation procedures, and service level reporting.
  • Strong commitment to delivering exceptional service and operational excellence.
  • Ability to anticipate potential impacts, think strategically, and proceed proactively during high priority incidents.
  • Exceptional interpersonal and “soft” skills, demonstrated by building strong relationships, influencing peers and senior stakeholders, and navigating conflict to achieve successful outcomes.
  • Advanced problem analysis and solving skills for complex technical issues.
  • Familiarity with CI/CD tools, infrastructure-as-code, and automation frameworks.
  • Knowledge of container orchestration platforms (e.g., Kubernetes) and related AWS services.
  • Familiarity with AI tooling that can assist in incident response and site reliability activities.

Benefits

We’re committed to supporting your ultimate well-being through our total compensation package offerings that support your health, wealth and self. These offerings include Medical, Dental, Vision, Health Savings Accounts / Flexible Spending Accounts, Life and AD&D Insurance, 401(k), Tuition Reimbursement, and an array of resources that encourage a lifetime of healthier living. Benefits eligibility may differ depending on full-time or part-time status. Compensation depends on the applicable US geographic market. The expected base pay for this position ranges from $91,000 - $110,000 annually, and will be based on a number of additional factors including skills, experience, and education.  

The Company is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, religion, color, national origin, gender, gender identity, sexual orientation, age, status as a protected veteran, among other things, or status as a qualified individual with disability.  

Don’t meet every single requirement? Studies have shown that women and underrepresented minorities are less likely to apply to jobs unless they meet every single qualification. We are dedicated to building a diverse, inclusive, and authentic workplace, so if you’re excited about this role but your past experience doesn’t align perfectly with every qualification in the job description, we encourage you to apply anyway. You may be just the right candidate for this or other roles.

#LI-FP1

Qualifications:
  • Bachelor's degree in Computer Science, Information Technology, or a related field, or equivalent professional experience.
  • 5 Years of proven experience in site reliability engineering, incident response, and operational support for cloud-based applications.
  • Demonstrated expertise with observability and monitoring tools in microservice architectures.
  • Strong proficiency with AWS services, including EC2, ECS/EKS, CloudWatch, IAM, and networking.
  • Expert communicator in written, verbal, and diagrammatic mediums, able to effectively interact with and present to all levels of the organization.
  • Ability to get up to speed quickly in new technical or business domains.
  • Ability to work after hours or weekends as required.

Preferred Skills:

  • Extensive experience in incident management, escalation procedures, and service level reporting.
  • Strong commitment to delivering exceptional service and operational excellence.
  • Ability to anticipate potential impacts, think strategically, and proceed proactively during high priority incidents.
  • Exceptional interpersonal and “soft” skills, demonstrated by building strong relationships, influencing peers and senior stakeholders, and navigating conflict to achieve successful outcomes.
  • Advanced problem analysis and solving skills for complex technical issues.
  • Familiarity with CI/CD tools, infrastructure-as-code, and automation frameworks.
  • Knowledge of container orchestration platforms (e.g., Kubernetes) and related AWS services.
  • Familiarity with AI tooling that can assist in incident response and site reliability activities.
Education:UNAVAILABLEEmployment Type: FULL_TIME