1

Reliability Engineer Manager Jobs in Georgia (NOW HIRING)

Staff Site Reliability Engineer

Atlanta, GA ยท Remote

$54.75 - $72.75/hr

As a Lead SRE, you'll be a technical and operational leader for reliability across Develocity. You ... Build and maintain comprehensive observability for all managed services, including logging, metrics ...

Senior SRE

Atlanta, GA ยท On-site

$54.75 - $72.75/hr

ABOUT THIS POSITION We are looking for a talented and driven Sr. Site Reliability Engineering (SRE) to support our engineering team, which manages the infrastructure and services that power our ...

Site Reliability Engineer II

Atlanta, GA ยท On-site

$54.75 - $72.75/hr

ABOUT THIS POSITION We are looking for a talented and driven Site Reliability Engineering (SRE) to support our engineering team, which manages the infrastructure and services that power our Waystar ...

Site Reliability Engineer II

Atlanta, GA ยท On-site

$54.75 - $72.75/hr

ABOUT THIS POSITION We are looking for a talented and driven Site Reliability Engineering (SRE) to support our engineering team, which manages the infrastructure and services that power our Waystar ...

Senior SRE

Atlanta, GA ยท On-site

$54.75 - $72.75/hr

ABOUT THIS POSITION We are looking for a talented and driven Sr. Site Reliability Engineering (SRE) to support our engineering team, which manages the infrastructure and services that power our ...

Manager, SRE Engineer - PxE ERM

Atlanta, GA ยท On-site

$54.75 - $72.75/hr

As a Manager, SRE Engineer , you will actively engage in your engineering craft, taking a hands-on approach to multiple high-visibility projects. Your expertise will be pivotal in delivering ...

$156K - $288K/yr

Take a leading role in incident management, including coordinating response efforts ... Reliability Engineering or similar DevOps roles focused on system reliability and incident ...

Senior Site Reliability Engineer II

Buford, GA ยท On-site +1

$104.90K - $174.70K/yr

Define and manage SLOs, SLIs, and error budgets * Build and improve CI/CD pipelines and operational ... Mentor other engineers and help set SRE standards and best practices Required Qualifications * 5+ ...

ServiceNow SRE Engineering Manager

Atlanta, GA ยท On-site

$54.75 - $72.75/hr

As a Manager, ServiceNow SRE Engineer , you will actively engage in your engineering craft, taking a hands-on approach to multiple high-visibility projects. Your expertise will be pivotal in ...

Sr. Site Reliability Engineer

Atlanta, GA ยท On-site

$54.75 - $72.75/hr

Extensive/Strong AWS experience: experience in designing, deploying managing scalable/reliable ... Drive the adoption of SRE best practices and ensure adherence to reliability and performance ...

Senior Site Reliability Engineer II

Alpharetta, GA ยท On-site +1

$104.90K - $174.70K/yr

Define and manage SLOs, SLIs, and error budgets * Build and improve CI/CD pipelines and operational ... Mentor other engineers and help set SRE standards and best practices Required Qualifications * 5+ ...

next page

Showing results 1-20

Reliability Engineer Manager information

See Georgia salary details

$52.5K

$111.4K

$133.7K

How much do reliability engineer manager jobs pay per year?

As of May 30, 2026, the average yearly pay for reliability engineer manager in Georgia is $111,361.00, according to ZipRecruiter salary data. Most workers in this role earn between $97,400.00 and $121,900.00 per year, depending on experience, location, and employer.

What are the key skills and qualifications needed to thrive as a Reliability Engineer Manager, and why are they important?

To thrive as a Reliability Engineer Manager, you need a strong background in engineering principles, reliability analysis, and maintenance strategies, typically supported by a degree in engineering and experience in reliability roles. Familiarity with reliability-centered maintenance (RCM), failure mode and effects analysis (FMEA), and asset management software such as SAP or Maximo is common, along with certifications like Certified Reliability Engineer (CRE). Leadership, problem-solving, and effective communication are vital soft skills for managing teams and driving cross-functional initiatives. These competencies are crucial for minimizing downtime, optimizing equipment performance, and ensuring long-term operational efficiency.

What are some common challenges Reliability Engineer Managers face when balancing long-term reliability improvements with immediate operational demands?

Reliability Engineer Managers often need to prioritize urgent maintenance issues while also driving long-term reliability initiatives. Balancing these competing demands can be challenging, as immediate equipment failures may require quick fixes that temporarily interrupt ongoing improvement projects. Effective managers work closely with operations, maintenance, and engineering teams to communicate priorities, allocate resources, and implement sustainable solutions that address root causes rather than just symptoms. This role typically involves using data-driven decision-making and fostering a culture of proactive maintenance and continuous improvement.

What does a Reliability Engineer Manager do?

A Reliability Engineer Manager oversees teams responsible for improving the reliability and performance of systems, machinery, or processes within an organization. They develop maintenance strategies, lead root cause analyses of failures, and implement best practices to minimize downtime and costs. Additionally, they collaborate with other departments to ensure that reliability goals align with business objectives and compliance standards. Their role is crucial in industries such as manufacturing, energy, and technology, where system uptime and safety are critical.

What is the difference between Reliability Engineer Manager vs Reliability Engineer?

AspectReliability EngineerReliability Engineer Manager
Required CredentialsBachelor's in Engineering or related field; certifications like CRC, CRESame as Reliability Engineer, plus leadership experience
Work EnvironmentDesign, analyze, and improve system reliability; often in teamsOversees Reliability Engineers; manages projects and teams
Employer & Industry UsageManufacturing, aerospace, energy, automotiveSame industries, with added managerial responsibilities
Common Search & ComparisonFocuses on technical skills and hands-on reliability tasksFocuses on leadership, team management, and strategic planning

The main difference between a Reliability Engineer and a Reliability Engineer Manager lies in their responsibilities. The Reliability Engineer focuses on technical analysis and system improvements, while the Reliability Engineer Manager oversees teams, manages projects, and develops strategies to enhance reliability across the organization.

What are the most commonly searched types of Reliability Engineer jobs in Georgia? The most popular types of Reliability Engineer jobs in Georgia are:
What cities in Georgia are hiring for Reliability Engineer Manager jobs? Cities in Georgia with the most Reliability Engineer Manager job openings:

Cloud Infrastructure Site Reliability Engineer (SRE) at Alpharetta, GA or Berkeley Heights, NJ

BI-Commercial

Alpharetta, GA โ€ข On-site

$55.75 - $74/hr

Contractor

Posted 15 days ago


Job description

Title: Cloud Infrastructure Site Reliability Engineer (SRE)

Location: Alpharetta, GA or Berkeley Heights, NJ (5 Days Onsite)

Job Description:

As a Cloud Infrastructure Site Reliability Engineer (SRE) with expertise across multiple public cloud platforms, you will be responsible for managing and operating cloud infrastructure in alignment with the principles of Googleโ€™s SRE model. Your role will focus on ensuring the reliability, availability, and performance of our cloud services, while driving automation and continuous improvement across production environments. You will collaborate closely with cross-functional teams to strengthen our cloud reliability posture and streamline operations through innovative automation solutions.

Key Responsibilities:

  • Design, build, and maintain highly available, scalable, and secure cloud infrastructure on platforms such as AWS, GCP, or Azure.
  • Develop and implement automation for provisioning, monitoring, scaling, and incident response using Infrastructure-as-Code tools (e.g., Terraform, CloudFormation, Ansible). Monitor system reliability, capacity, and performance; proactively detect and address issues before they impact users.
  • Respond to production incidents, participate in on-call rotations, and lead post-incident reviews to drive root cause analysis and reliability improvements.
  • Collaborate with software engineering and security teams to ensure new services and features are production-ready and meet reliability standards.
  • Build and maintain tools for deployment, monitoring, and operations; automate manual processes to reduce toil.
  • Document operational processes and system architectures to ensure knowledge sharing and repeatability.
  • Continuously evaluate and implement new technologies to improve system reliability, security, and efficiency.

Qualifications:

  • Bachelorโ€™s degree in computer science, Engineering, or a related technical field, or equivalent practical experience.
  • 3+ years of experience in software development with proficiency in at least one programming language (e.g., Python, Go, Java, C++).
  • Experience administrating cloud platforms (AWS, GCP, Azure), including networking, security, containerization, storage, data management, and serverless technologies.
  • Solid understanding of Linux systems, networking fundamentals, virtualized, and distributed systems, file systems, system processes and configurations.
  • Deep understanding of observability (monitoring, alerting, and logging) tools in cloud environments. Ability to set up and maintain monitoring dashboards, alerts, and logs.ย  Familiarity with Continuous Integration/Continuous Deployment (CI/CD) tools for automated testing, deployments, provisioning, and observability.
  • Ability to manage and respond to incidents, perform root cause analysis, and implement post-mortem reviews. Understanding of setting, monitoring, and maintaining Service-Level Objectives (SLOs) and Service-Level Agreements (SLAs) for system reliability.
  • Additional Qualifications a Plus: Experience working with enterprise-scale financial services or other regulated industries

Certifications: Certified Engineer, DevOps, SRE, CSREF