1

Linux Site Reliability Engineer Jobs in Tennessee

Service Reliability Engineer

Nashville, TN

$55 - $73.25/hr

... SRE best practices (SLOs, error budgets) into the design and development lifecycle. Job Requirements: Required Experience & Skills: A strong background in systems administration (Linux/Windows) in a ...

Description We are looking for a Senior Site Reliability Engineer to join our OCI team. This role is part of a globally distributed team responsible for detecting, triaging, and mitigating OCI ...

The ideal candidate will possess deep expertise in OpenShift, Kubernetes, Red Hat Linux, AWS infrastructure, DevOps practices, and Site Reliability Engineering (SRE), along with strong leadership ...

... a Site Reliability Engineer • Open-source contributions • Code generation frameworks • GraphQL schema design • Linux internals and CLI tooling • Experience with LLMs, LangChain/LangGraph ...

Experience as a Site Reliability Engineer * Open-source contributions * Code generation frameworks * GraphQL schema design * Linux internals and CLI tooling * Experience with LLMs, LangChain ...

Reliability Engineer

Oak Ridge, TN

$88K - $111K/yr

Spectra Tech, Inc. is hiring for a Reliability Engineer (RE) in Oak Ridge, Tennessee Spectra Tech, ... Site-wide collaboration skills; proven problem-solving/analytical ability; capacity to multi-task ...

Reliability Engineer

Oak Ridge, TN

$88K - $111K/yr

Spectra Tech, Inc. is hiring for a Reliability Engineer (RE) in Oak Ridge, Tennessee Spectra Tech, ... Site-wide collaboration skills; proven problem-solving/analytical ability; capacity to multi-task ...

Reliability Engineer

Oak Ridge, TN · On-site

$88K - $111K/yr

Spectra Tech, Inc. is hiring for a Reliability Engineer (RE) in Oak Ridge, Tennessee Spectra Tech, ... Site-wide collaboration skills; proven problem-solving/analytical ability; capacity to multi-task ...

... SRE framework. They will lead the post-mortem reviews and the timely production of RCAs. They address issues with impact beyond their own team based on knowledge of related disciplines.

next page

Showing results 1-20

Linux Site Reliability Engineer information

What are some common challenges faced by Linux Site Reliability Engineers when scaling infrastructure, and how can they be addressed?

Linux Site Reliability Engineers often encounter challenges related to maintaining system stability and performance as infrastructure scales. Issues such as configuration drift, automation bottlenecks, and monitoring gaps can arise when managing numerous servers or services. Addressing these challenges typically involves implementing robust configuration management tools, investing in automated deployment pipelines, and enhancing observability through comprehensive monitoring and alerting solutions. Collaboration with development and operations teams is essential to ensure that scalability solutions align with business needs and technical requirements.

What are the key skills and qualifications needed to thrive as a Linux Site Reliability Engineer, and why are they important?

To thrive as a Linux Site Reliability Engineer, you need deep expertise in Linux system administration, scripting (such as Bash or Python), and a solid understanding of networking concepts, usually backed by a computer science degree or equivalent experience. Familiarity with configuration management tools (like Ansible, Puppet, or Chef), containerization (Docker, Kubernetes), and cloud platforms (AWS, GCP, or Azure) is typically required, along with relevant certifications like RHCE or AWS Certified SysOps Administrator. Strong problem-solving skills, effective communication, and the ability to work under pressure are crucial soft skills for this role. These competencies ensure the reliability, scalability, and security of complex infrastructure, minimizing downtime and supporting seamless operations.

Who gets paid more, SRE or DevOps?

Generally, Site Reliability Engineers (SREs) tend to have higher salaries than DevOps engineers due to their specialized focus on system reliability, automation, and incident management. Both roles require strong skills in cloud platforms, scripting, and monitoring tools, but SREs often have more advanced expertise in reliability engineering practices, which can lead to higher compensation.

Will AI replace SRE jobs?

AI is expected to augment the work of Linux Site Reliability Engineers by automating routine tasks such as monitoring, incident response, and log analysis. However, SRE roles require complex problem-solving, system design, and decision-making that currently cannot be fully replaced by AI, making human expertise essential. SREs will likely focus more on overseeing automation tools and managing system reliability rather than being replaced entirely.

What engineer makes $500,000 a year?

A senior Linux Site Reliability Engineer or similar high-level engineering roles in cloud infrastructure and large-scale systems can earn $500,000 or more annually, especially with bonuses and stock options. These positions typically require extensive experience, advanced skills in automation, scripting, and cloud platforms, and often involve leadership responsibilities.

What engineers make $300,000 a year?

Senior Linux Site Reliability Engineers with extensive experience, advanced skills in automation, cloud platforms, and monitoring tools can earn $300,000 or more annually, especially in high-cost-of-living areas or large tech companies. Achieving this salary often requires specialized certifications, leadership roles, and a strong track record of managing complex infrastructure at scale.

What is the difference between Linux Site Reliability Engineer vs Linux DevOps Engineer?

AspectLinux Site Reliability EngineerLinux DevOps Engineer
CredentialsLinux certifications, SRE-specific trainingLinux certifications, DevOps tools certifications
Work EnvironmentFocus on system reliability, monitoring, incident responseFocus on automation, CI/CD pipelines, deployment
Employer & IndustryTech companies, cloud providers, large enterprisesStartups, tech firms, software development teams
Search & Comparison IntentUnderstanding reliability roles, incident managementAutomation, deployment, continuous integration

While both roles involve Linux expertise, a Linux Site Reliability Engineer primarily focuses on maintaining system reliability, monitoring, and incident response. In contrast, a Linux DevOps Engineer emphasizes automation, continuous integration, and deployment processes. Both roles require Linux skills and often overlap, but their core responsibilities differ based on organizational needs.

What is a Linux Site Reliability Engineer?

A Linux Site Reliability Engineer (SRE) is an IT professional responsible for ensuring the reliability, scalability, and performance of systems running on the Linux operating system. They bridge the gap between software development and operations by automating processes, monitoring infrastructure, and managing incidents. Linux SREs focus on system availability, building tools for deployment and monitoring, and improving system robustness through best practices and automation. Their work helps organizations deliver reliable online services and quickly recover from outages or system failures.
What are popular job titles related to Linux Site Reliability Engineer jobs in Tennessee? For Linux Site Reliability Engineer jobs in Tennessee, the most frequently searched job titles are:
What job categories do people searching Linux Site Reliability Engineer jobs in Tennessee look for? The top searched job categories for Linux Site Reliability Engineer jobs in Tennessee are:
What cities in Tennessee are hiring for Linux Site Reliability Engineer jobs? Cities in Tennessee with the most Linux Site Reliability Engineer job openings:
Site Reliability Engineer (Rustici) US, Franklin, Remote

Site Reliability Engineer (Rustici) US, Franklin, Remote

LTG

Franklin, TN • On-site, Remote

$56.25 - $74.75/hr

Full-time

Medical, Dental, Vision, Life, Retirement, PTO

Posted 23 days ago


Job description

We are looking for a Site Reliability Engineer to join our team here at Rustici Software. Come work alongside our software development teams, deploying the applications we create in our AWS hosted infrastructure. We are a remote/in-office hybrid company located in Franklin, TN. While we give preference to local candidates, we are open to qualified remote candidates residing in the United States.
The Site Reliability Engineer (SRE) at Rustici Software contributes to the success of the Site Reliability team in deploying, monitoring, and maintaining multiple large applications along with hundreds of customer environments hosted in AWS. The SRE is an individual contributor reporting to the Director of DevSecOps and will participate in an on-call rotation schedule.

US based only, direct hire only, no recruiters, no contracting agencies, please.

What will you be doing?
  • Assist in the improvement of internal automation specifically through the use of "Infrastructure as Code" tools
  • Assist in the maintenance of and addition of new features to the infrastructure control plane
  • Act as the primary contact to monitor, troubleshoot, and resolve production issues as part of an on-call rotation of roughly one week per month to adhere to a 24/7/365 SLA
  • Collaborate with the Director of DevSecOps as well as other members of the SRE team to explore, plan, and implement or improve the security posture, reliability, performance, and cost of hosted resources
  • Collaborate with one or more product development teams on application development direction as relates to aspects of deployment and operational factors
  • Collaborate with the members of the support and integration teams to assist with the support of customer environments as relates to deployment and operational factors
  • Continuously improve knowledge of best practices in site reliability and technical skills related to security, automation, networking, and system operations
Successful candidates

Successful candidates have a mix of skills in the technology space centering around the deployment and up keep of web application products. We look for the following, but if you don't have experience with all of the following we'd still like to hear from you.

  • Extensive experience of one or more Unix CLIs including tools such as zsh/bash, non-graphical text editor(s), git, and various other common shell utilities for system administration
  • Extensive experience using AWS resources in the deployment of highly available web application platforms
  • Experience with implementing Infrastructure as Code
  • Experience with the application of security tools, controls, and policies
  • Experience with CI/CD pipeline configuration and orchestration
  • Experience with deployment and orchestration of containerized resources (Docker, Kubernetes, etc.)

Additional Technical Background

  • Broad knowledge of AWS resources including EC2, ECS, S3, RDS, CloudFront, Elasticache, SQS, Route53, ELB, Lambda, etc.
  • Broad knowledge of DNS, CDNs, load balancers, web servers, application servers, databases (MySQL), and networking concepts like the basics of TCP/UDP, RFC 1918 subnets, and NAT
  • Familiarity with Terraform, CloudFormation, Ansible or similar tools
  • Familiarity with GitHub Actions, Jenkins, or other automated task management tools
  • Familiarity with web application (HTML, CSS) development using contemporary frameworks in Java, Python, JavaScript/TypeScript, or similar particularly in extensible, scalable, performant, and secure implementations
  • Familiarity with non-web based scripting language(s) such as Python, JavaScript (Node.js), Go, etc.
About our work

Every day, millions of people around the world access valuable learning and training content powered by Rustici Software's products. If you've ever taken an online course, there's a good chance our software was running behind the scenes. We specialize in helping software vendors and organizations solve problems specific to implementing eLearning standards, such as SCORM, xAPI and cmi5. Since 2002, we've been sharing our expertise with our customers and the industry by providing resources for creating, delivering and distributing eLearning content. We are proud to be known as the "SCORM folks," "eLearning nerds" and most recently, productizing AI to assist our customers in better understanding and delivering training.

How we're different

Rustici Software isn't your average workplace. There's a reason why we have been named Best Place to Work by Nashville Business Journal for 15+ years.

Over the last 20+ years, we've created a unique environment where people want to work and look forward to Monday. We strive not to be static. We desperately want to grow, change, and do our work better year over year. This is your chance to work with a group of people that want you to be opinionated about the work we do and how we do it. You won't always win the arguments we participate in, but you'll know that we deeply value your input and that your coworkers are as passionate as you.

Rustici benefits

We also take great care of the people that work here, and our benefits are unrivaled.

    • Flexible work environment: Rustici Software offers the best of all worlds when it comes to where you work. Remote from your home office, a private office in Franklin, TN if you prefer, or a mix of both. We care more about the work that you do than where you do that work.
    • Untracked PTO
    • Medical, Dental, and Vision insurance
    • HSA and FSA plans
    • Short-term and Long-term disability
    • Company paid life insurance
    • 401k/Retirement vesting+matching on day 1
    • Performance-based bonuses
    • Office perks: Concierge services, gym equipment, yoga room and stocked kitchen with snacks and drinks
    How to Apply

    Check out "An Open Letter" (https://rusticisoftware.com/jo...) from our Managing Director, Tammy Rutherford. It says a lot about what you need to know before applying to this job opening. You might also want to read up on our answers to the Joel Test (https://rusticisoftware.com/ou...) to see how we approach software development.

    Make sure that what we get from you makes it apparent that you are the right person and that this job is important to you, and that you want to work here, not just somewhere.

    You will also want to spend some time on our website, learn how we think, what we do, and why we have been named a Best Place to Work by Nashville Business Journal for 15+ years. Get to know us if you want us to get to know you.

    Each time we hire, we wait until we find exactly the right person. If that's you, we really hope you'll apply. Don't forget to include more about why you're the right person to join our team and your answer to our developer test. Like really, we won't look at applications that do not include the developer test source code.

    We are an Equal Opportunity Employer and do not discriminate against any employee or applicant for employment because of race, colour, sex, age, national origin, religion, sexual orientation, gender identity, status as a veteran, and basis of disability or any other federal, state or local protected class.

      Employment Type: FULL_TIME

      LTG logo

      About LTG

      Sourced by ZipRecruiter

      Industry

      Machinery manufacturing

      Company size

      1 - 10 Employees

      Headquarters location

      Spartanburg, SC, US

      Year founded

      1924