1

Linux Site Reliability Engineer Jobs in Georgia (NOW HIRING)

Site Reliability Engineer - SRE

Atlanta, GA · On-site +1

$54.25 - $72/hr

Role: Site Reliability Engineer * Location: Atlanta, GA OR Dallas OR Austin, TX * Duration ... Proficient in a Linux or Unix based environment. * Proficiency in supporting a 24x7 operation.

Site Reliability Engineer - SRE

Atlanta, GA · On-site

$54.25 - $72/hr

Role: Site Reliability Engineer * Location: Atlanta, GA OR Dallas OR Austin, TX * Duration ... Proficient in a Linux or Unix based environment. * Proficiency in supporting a 24x7 operation.

Site Reliability Engineer

Alpharetta, GA · On-site

$55.75 - $74/hr

I have an opportunity for a " Site Reliability Engineer " - Alpharetta, GA (Onsite). and I am ... Linux, Windows servers. • Experience with Web service technologies, including REST, SOAP, JSON ...

Site Reliability Engineer

Atlanta, GA · On-site

$54.75 - $72.75/hr

Position : SRE Duration : 6 to 12 Months Location : Atlanta or St. Louis - Day Onsite Job ... Linux, Windows servers. • Experience with Web service technologies, including REST, SOAP, JSON ...

Site Reliability engineer (SRE)

Atlanta, GA · Hybrid

$54.75 - $72.75/hr

Site Reliability engineer(SRE) Location: Atlanta, GA ( Hybrid - 3days Office - 2 days WFH) Duration: C2H : Dynatrace App dynamics ACI (Advanced Computing International) is a Global Technology ...

Site Reliability Engineer (AWS)

Atlanta, GA · Hybrid

$54.75 - $72.75/hr

This position is under our CTO org to support SRE functions for innovation and growth for the ... Strong experience in Linux systems and networking. * Hands-on experience with AWS Cloud Platform.

Site Reliability Engineer (AWS)

Atlanta, GA · Hybrid

$54.75 - $72.75/hr

This position is under our CTO org to support SRE functions for innovation and growth for the ... Strong experience in Linux systems and networking. * Hands-on experience with AWS Cloud Platform.

Site Reliability Engineer

Atlanta, GA · Remote

$54.75 - $72.75/hr

Site Reliability Engineer Company: AutoRABIT Work Type: Remote Employment: Full Time Location: US Seniority: Mid Level Technologies: AWS, GCP, Azure, Kubernetes, Docker, Ansible, Jenkins, Terraform ...

next page

Showing results 1-20

Linux Site Reliability Engineer information

What are some common challenges faced by Linux Site Reliability Engineers when scaling infrastructure, and how can they be addressed?

Linux Site Reliability Engineers often encounter challenges related to maintaining system stability and performance as infrastructure scales. Issues such as configuration drift, automation bottlenecks, and monitoring gaps can arise when managing numerous servers or services. Addressing these challenges typically involves implementing robust configuration management tools, investing in automated deployment pipelines, and enhancing observability through comprehensive monitoring and alerting solutions. Collaboration with development and operations teams is essential to ensure that scalability solutions align with business needs and technical requirements.

What are the key skills and qualifications needed to thrive as a Linux Site Reliability Engineer, and why are they important?

To thrive as a Linux Site Reliability Engineer, you need deep expertise in Linux system administration, scripting (such as Bash or Python), and a solid understanding of networking concepts, usually backed by a computer science degree or equivalent experience. Familiarity with configuration management tools (like Ansible, Puppet, or Chef), containerization (Docker, Kubernetes), and cloud platforms (AWS, GCP, or Azure) is typically required, along with relevant certifications like RHCE or AWS Certified SysOps Administrator. Strong problem-solving skills, effective communication, and the ability to work under pressure are crucial soft skills for this role. These competencies ensure the reliability, scalability, and security of complex infrastructure, minimizing downtime and supporting seamless operations.

Who gets paid more, SRE or DevOps?

Generally, Site Reliability Engineers (SREs) tend to have higher salaries than DevOps engineers due to their specialized focus on system reliability, automation, and incident management. Both roles require strong skills in cloud platforms, scripting, and monitoring tools, but SREs often have more advanced expertise in reliability engineering practices, which can lead to higher compensation.

Will AI replace SRE jobs?

AI is expected to augment the work of Linux Site Reliability Engineers by automating routine tasks such as monitoring, incident response, and log analysis. However, SRE roles require complex problem-solving, system design, and decision-making that currently cannot be fully replaced by AI, making human expertise essential. SREs will likely focus more on overseeing automation tools and managing system reliability rather than being replaced entirely.

What engineer makes $500,000 a year?

A senior Linux Site Reliability Engineer or similar high-level engineering roles in cloud infrastructure and large-scale systems can earn $500,000 or more annually, especially with bonuses and stock options. These positions typically require extensive experience, advanced skills in automation, scripting, and cloud platforms, and often involve leadership responsibilities.

What engineers make $300,000 a year?

Senior Linux Site Reliability Engineers with extensive experience, advanced skills in automation, cloud platforms, and monitoring tools can earn $300,000 or more annually, especially in high-cost-of-living areas or large tech companies. Achieving this salary often requires specialized certifications, leadership roles, and a strong track record of managing complex infrastructure at scale.

What is the difference between Linux Site Reliability Engineer vs Linux DevOps Engineer?

AspectLinux Site Reliability EngineerLinux DevOps Engineer
CredentialsLinux certifications, SRE-specific trainingLinux certifications, DevOps tools certifications
Work EnvironmentFocus on system reliability, monitoring, incident responseFocus on automation, CI/CD pipelines, deployment
Employer & IndustryTech companies, cloud providers, large enterprisesStartups, tech firms, software development teams
Search & Comparison IntentUnderstanding reliability roles, incident managementAutomation, deployment, continuous integration

While both roles involve Linux expertise, a Linux Site Reliability Engineer primarily focuses on maintaining system reliability, monitoring, and incident response. In contrast, a Linux DevOps Engineer emphasizes automation, continuous integration, and deployment processes. Both roles require Linux skills and often overlap, but their core responsibilities differ based on organizational needs.

What is a Linux Site Reliability Engineer?

A Linux Site Reliability Engineer (SRE) is an IT professional responsible for ensuring the reliability, scalability, and performance of systems running on the Linux operating system. They bridge the gap between software development and operations by automating processes, monitoring infrastructure, and managing incidents. Linux SREs focus on system availability, building tools for deployment and monitoring, and improving system robustness through best practices and automation. Their work helps organizations deliver reliable online services and quickly recover from outages or system failures.
What are popular job titles related to Linux Site Reliability Engineer jobs in Georgia? For Linux Site Reliability Engineer jobs in Georgia, the most frequently searched job titles are:
What job categories do people searching Linux Site Reliability Engineer jobs in Georgia look for? The top searched job categories for Linux Site Reliability Engineer jobs in Georgia are:
What cities in Georgia are hiring for Linux Site Reliability Engineer jobs? Cities in Georgia with the most Linux Site Reliability Engineer job openings:

$54.75 - $72.75/hr

Other

Posted 24 days ago


Job description

What You'll Bring to the Team:

We are seeking a Site Reliability Engineer (SRE) to join one of our Scrum teams and help ensure the reliability, scalability, and performance of the Florence platform. AI-driven tooling and automation are a cornerstone of how we build, operate, and scale our systems.

In this role, you will work closely with product engineers while actively leveraging AI to improve observability, incident response, automation, and overall platform reliability. Coding assignments in this role will require working with AI-assisted development workflows as a core part of how solutions are designed and delivered.

You Will:
  • Be an embedded member of a Scrum team, participating in planning, refinement, reviews, and retrospectives
  • Use AI-powered tools to enhance system reliability, operational efficiency, and developer productivity
  • Design, build, and operate reliable, scalable cloud infrastructure supporting platform and product services
  • Apply AI-assisted analysis to monitoring, alerting, and observability data to detect, predict, and prevent incidents
  • Define and maintain SLOs, SLIs, and error budgets to guide reliability decisions
    Collaborate with software engineers to embed reliability and AI-driven automation into the software development lifecycle
  • Lead and participate in incident response, root cause analysis, and postmortems, leveraging AI insights where appropriate
  • Automate operational tasks and reduce toil through AI-enabled and traditional automation approaches
  • Contribute to disaster recovery planning, testing, and operational readiness
  • Produce and maintain documentation such as runbooks, operational guides, and system diagrams
  • Contribute code as a secondary responsibility, with coding assignments focused on building reliability tooling, automation, and integrations using AI-assisted development practices
An Ideal Candidate Is / Has:
  • Passionate about building reliable, scalable systems using modern, AI-enabled approaches
  • Strong understanding of cloud-native and distributed system architectures
  • Experience applying SRE principles in a production environment
  • Hands-on experience with cloud platforms (AWS preferred)
  • Experience using AI-assisted tools for coding, debugging, automation, or operational analysis
  • Strong background in Linux, networking, and system operations
  • Experience with infrastructure-as-code and automation tools (e.g., Terraform, CI/CD pipelines)
  • Familiarity with modern observability practices (metrics, logs, tracing), including AI-enhanced analysis
  • Comfortable working as part of an agile, cross-functional Scrum team
  • Strong problem-solving, communication, and collaboration skills
  • 4+ years of experience in SRE, DevOps, or similar roles
  • Experience supporting production systems at scale