1

Reliability Engineer Manager Jobs in Georgia (NOW HIRING)

SRE Lead/ Architect

Atlanta, GA · On-site

$54.75 - $72.75/hr

Deep understanding and practical application of SRE principles (SLIs/SLOs, error budgets, toil reduction, automation, incident management, postmortems) * Expertise in cloud computing platforms (e.g ...

Reliability Engineer

Dudley, GA

$87.40K - $110K/yr

... reliability and lower operating costs ... Utilize the mill's computerized maintenance management system to manage lubrication and other ...

Manager, ServiceNow SRE Engineer

Atlanta, GA · On-site

$54.75 - $72.75/hr

Manager, ServiceNow SRE Engineer Role Overview: As a Manager, ServiceNow SRE Engineer , you will actively engage in your engineering craft, taking a hands-on approach to multiple high-visibility ...

Reliability Engineer

Hartwell, GA

$95.30K - $120K/yr

As a Reliability Engineer you'll provide leadership, coaching, and technical expertise to support ... Bring your expertise to an industry leader in pet care are you manage continuous improvement of ...

Reliability Engineer

Hartwell, GA

$95.30K - $120K/yr

As a Reliability Engineer you?ll provide leadership, coaching, and technical expertise to support ... Bring your expertise to an industry leader in pet care are you manage continuous improvement of ...

Reliability Engineer

Dudley, GA · On-site

$87.40K - $110K/yr

... reliability and lower operating costs ... Utilize the mill's computerized maintenance management system to manage lubrication and other ...

Site Reliability Engineer

Atlanta, GA · On-site +1

$100K - $120K/yr

Strong knowledge of SRE best practices and incident management protocols * Deep experience using and/or configuring New Relic, Data Dog, SumoLogic or similar observability tools * Proficiency in ...

Lead Site Reliability Engineer

Atlanta, GA · Remote

$54.75 - $72.75/hr

Partner with InfoSec to define hardening standards, manage perimeter defense (WAF/DDoS), and ... & Operations * Deep Observability: Proven experience designing monitoring solutions (Datadog, New ...

Site Reliability Engineer

Atlanta, GA · On-site

$117K - $209.33K/yr

Participate in on-call support and incident management, ensuring timely resolution and clear ... DevOps/SRE experience with cloud-based applications * Advanced hands-on experience Linux ...

next page

Showing results 1-20

Reliability Engineer Manager information

See Georgia salary details

$52.5K

$111.4K

$133.7K

How much do reliability engineer manager jobs pay per year?

As of May 29, 2026, the average yearly pay for reliability engineer manager in Georgia is $111,361.00, according to ZipRecruiter salary data. Most workers in this role earn between $97,400.00 and $121,900.00 per year, depending on experience, location, and employer.

What are the key skills and qualifications needed to thrive as a Reliability Engineer Manager, and why are they important?

To thrive as a Reliability Engineer Manager, you need a strong background in engineering principles, reliability analysis, and maintenance strategies, typically supported by a degree in engineering and experience in reliability roles. Familiarity with reliability-centered maintenance (RCM), failure mode and effects analysis (FMEA), and asset management software such as SAP or Maximo is common, along with certifications like Certified Reliability Engineer (CRE). Leadership, problem-solving, and effective communication are vital soft skills for managing teams and driving cross-functional initiatives. These competencies are crucial for minimizing downtime, optimizing equipment performance, and ensuring long-term operational efficiency.

What are some common challenges Reliability Engineer Managers face when balancing long-term reliability improvements with immediate operational demands?

Reliability Engineer Managers often need to prioritize urgent maintenance issues while also driving long-term reliability initiatives. Balancing these competing demands can be challenging, as immediate equipment failures may require quick fixes that temporarily interrupt ongoing improvement projects. Effective managers work closely with operations, maintenance, and engineering teams to communicate priorities, allocate resources, and implement sustainable solutions that address root causes rather than just symptoms. This role typically involves using data-driven decision-making and fostering a culture of proactive maintenance and continuous improvement.

What does a Reliability Engineer Manager do?

A Reliability Engineer Manager oversees teams responsible for improving the reliability and performance of systems, machinery, or processes within an organization. They develop maintenance strategies, lead root cause analyses of failures, and implement best practices to minimize downtime and costs. Additionally, they collaborate with other departments to ensure that reliability goals align with business objectives and compliance standards. Their role is crucial in industries such as manufacturing, energy, and technology, where system uptime and safety are critical.

What is the difference between Reliability Engineer Manager vs Reliability Engineer?

AspectReliability EngineerReliability Engineer Manager
Required CredentialsBachelor's in Engineering or related field; certifications like CRC, CRESame as Reliability Engineer, plus leadership experience
Work EnvironmentDesign, analyze, and improve system reliability; often in teamsOversees Reliability Engineers; manages projects and teams
Employer & Industry UsageManufacturing, aerospace, energy, automotiveSame industries, with added managerial responsibilities
Common Search & ComparisonFocuses on technical skills and hands-on reliability tasksFocuses on leadership, team management, and strategic planning

The main difference between a Reliability Engineer and a Reliability Engineer Manager lies in their responsibilities. The Reliability Engineer focuses on technical analysis and system improvements, while the Reliability Engineer Manager oversees teams, manages projects, and develops strategies to enhance reliability across the organization.

What are the most commonly searched types of Reliability Engineer jobs in Georgia? The most popular types of Reliability Engineer jobs in Georgia are:
What cities in Georgia are hiring for Reliability Engineer Manager jobs? Cities in Georgia with the most Reliability Engineer Manager job openings:

SRE Lead/ Architect

Vish Consulting IT

Atlanta, GA • On-site

$54.75 - $72.75/hr

Contractor

Posted 26 days ago


Job description

Job Title: SRE Lead/Architect

Location: Atlanta, GA  - Hybrid (Thur to next wed (Alternate weeks))

Contract Role

Role Summary: Mandatory skills are Observability, Resiliency, Chaos engineering, strong python, and Dynatrace

As an SRE Architect, you will be a pivotal technical leader responsible for designing, building, and evolving the foundational systems and practices that ensure the reliability, scalability, performance, and efficiency of our critical services. Moving beyond day-to-day operations, you will focus on the strategic architectural direction of SRE function, defining standards, blueprints, and frameworks that enable development teams and fellow SRE operations team to build and operate highly resilient systems. Leverage deep expertise in software engineering, distributed systems, cloud infrastructure, and SRE principles to influence technology choices, establish best practices, and foster a proactive culture of reliability across the organization and much beyond observability pillar.

Key Responsibilities:

  1. Reliability Strategy & Design:
    • Architect and design highly available, scalable, secure, and cost-effective infrastructure and application patterns on AWS
    • Define and evangelize SRE best practices, standards, and blueprints for service design, deployment, monitoring, and operational readiness across the engineering organization
    • Review current observability implementation to identify gaps and define steps to reach next level maturity of observability setup  to provide deep insights into system health and behaviour
    • With overall maturity lead the definition and implementation strategy for Service Level Indicators (SLIs), Service Level Objectives (SLOs), and Error Budgets for critical services
  1. Platform Architecture & Automation:
    • Design solutions to systematically reduce operational toil through automation and improved system design
    • Evaluate current SRE tools and automation frameworks (e.g., CI/CD pipelines, Infrastructure as Code modules, automated incident remediation, chaos engineering platforms) and suggest enhancement that will help overall enhancement of capability
    • Evaluate, prototype, and recommend new technologies, tools, and methodologies to enhance system reliability, developer productivity, and operational efficiency
  1. Technical Leadership & Consultation:
    • Act as a senior technical advisor and subject matter expert on reliability, scalability, and performance for development and platform teams
    • Provide architectural guidance during the design phase of new services and features to ensure reliability principles are embedded early (shift-left)
    • Mentor and coach other SREs and engineers, fostering technical excellence and adherence to SRE principles
    • Lead architectural reviews and production readiness assessments for critical systems
  1. Resilience:
    • Lead blameless postmortems for significant incidents, ensuring root causes are identified and systemic architectural improvements are prioritized and implemented
    • Architect and advocate for resilience patterns (e.g., circuit breaking, rate limiting, graceful degradation, chaos engineering) within applications and infrastructure

Required Qualifications:

  • Proven experience in an architectural role, designing solutions for reliability, scalability, and performance
  • Deep understanding and practical application of SRE principles (SLIs/SLOs, error budgets, toil reduction, automation, incident management, postmortems)
  • Expertise in cloud computing platforms (e.g., AWS) including infrastructure, networking, and security services
  • Strong experience with containerization and orchestration technologies (Kubernetes, Docker, serverless computing)
  • Solid experience designing and implementing observability solutions (e.g., Dynatrace, Prometheus, Grafana, ELK/EFK Stack, Jaeger, OpenTelemetry)
  • Strong programming/scripting skills (e.g., Python, Go, Bash) for automation and tool development
  • Excellent analytical, problem-solving, and strategic thinking skills.
  • Strong communication, collaboration, and leadership skills with the ability to influence technical direction across teams

Preferred Qualifications:

  • Experience designing and implementing chaos engineering practices and platforms

Thanks & Regards,

Vivek Sharma 

Account Manager

Cell: (904) 481-0481

Fax: (619)-333-1294

Email: vivek@vishusa.com

Vish Consulting Services, Inc

www.vishusa.com