Log In

1

Process Reliability Manager Jobs in Atlanta, GA (NOW HIRING)

Senior Site Reliability Engineer (SRE) - Dynatrace & Azure Observability Expert

$54.25 - $72/hr

... processing systems. API & Integration Monitoring * Monitor and troubleshoot Azure API Management ... Site Reliability Engineering (SRE) * Perform deep technical analysis across systems by correlating ...

Senior Site Reliability Engineer (SRE) - Dynatrace & Azure Observability Expert

$54.25 - $72/hr

... processing systems. API & Integration Monitoring * Monitor and troubleshoot Azure API Management ... Site Reliability Engineering (SRE) * Perform deep technical analysis across systems by correlating ...

Crew Career Center

Site Reliability Engineer II

Atlanta, GA · On-site

$54.75 - $72.75/hr

Design and implement reliability solutions for data ingestion, processing, and delivery pipelines. * Define and maintain SLIs/SLOs for data licensing services and manage error budgets. * Build ...

Crew Career Center

Site Reliability Engineer II

Atlanta, GA · On-site

$54.75 - $72.75/hr

Design and implement reliability solutions for data ingestion, processing, and delivery pipelines. * Define and maintain SLIs/SLOs for data licensing services and manage error budgets. * Build ...

Lexisnexis Risk Solutions

Senior Site Reliability Engineer II

Alpharetta, GA · On-site +1

$125K - $209K/yr

Define and manage SLOs, SLIs, and error budgets * Build and improve CI/CD pipelines and operational ... Strong written communication skills with experience documenting systems and processes in Confluence

Lexisnexis Risk Solutions

Senior Site Reliability Engineer II

Alpharetta, GA · On-site +1

$125K - $209K/yr

Define and manage SLOs, SLIs, and error budgets * Build and improve CI/CD pipelines and operational ... Strong written communication skills with experience documenting systems and processes in Confluence

Senior Site Reliability Engineer II

Alpharetta, GA · On-site +1

$125K - $209K/yr

Define and manage SLOs, SLIs, and error budgets * Build and improve CI/CD pipelines and operational ... Strong written communication skills with experience documenting systems and processes in Confluence

Senior Site Reliability Engineer II

Alpharetta, GA · On-site +1

$125K - $209K/yr

Define and manage SLOs, SLIs, and error budgets * Build and improve CI/CD pipelines and operational ... Strong written communication skills with experience documenting systems and processes in Confluence

Senior Site Reliability Engineer II

Buford, GA · On-site +1

$125K - $209K/yr

Define and manage SLOs, SLIs, and error budgets * Build and improve CI/CD pipelines and operational ... Strong written communication skills with experience documenting systems and processes in Confluence

Senior Site Reliability Engineer II

Buford, GA · On-site +1

$125K - $209K/yr

Define and manage SLOs, SLIs, and error budgets * Build and improve CI/CD pipelines and operational ... Strong written communication skills with experience documenting systems and processes in Confluence

Lexisnexis Risk Solutions

Senior Site Reliability Engineer II

Buford, GA · On-site +1

$125K - $209K/yr

Define and manage SLOs, SLIs, and error budgets * Build and improve CI/CD pipelines and operational ... Strong written communication skills with experience documenting systems and processes in Confluence

Lexisnexis Risk Solutions

Senior Site Reliability Engineer II

Buford, GA · On-site +1

$125K - $209K/yr

Define and manage SLOs, SLIs, and error budgets * Build and improve CI/CD pipelines and operational ... Strong written communication skills with experience documenting systems and processes in Confluence

Lexisnexis Risk Solutions

Senior Site Reliability Engineer II

Atlanta, GA · On-site +1

$125K - $209K/yr

Define and manage SLOs, SLIs, and error budgets * Build and improve CI/CD pipelines and operational ... Strong written communication skills with experience documenting systems and processes in Confluence

Lexisnexis Risk Solutions

Senior Site Reliability Engineer II

Atlanta, GA · On-site +1

$125K - $209K/yr

Define and manage SLOs, SLIs, and error budgets * Build and improve CI/CD pipelines and operational ... Strong written communication skills with experience documenting systems and processes in Confluence

Site Reliability Engineer II

Atlanta, GA · On-site

$54.75 - $72.75/hr

Design and implement reliability solutions for data ingestion, processing, and delivery pipelines. * Define and maintain SLIs/SLOs for data licensing services and manage error budgets. * Build ...

Site Reliability Engineer II

Atlanta, GA · On-site

$54.75 - $72.75/hr

Design and implement reliability solutions for data ingestion, processing, and delivery pipelines. * Define and maintain SLIs/SLOs for data licensing services and manage error budgets. * Build ...

Senior Site Reliability Engineer

Atlanta, GA · On-site +1

$109K/yr

Build, implement, iterate over CI/CD pipelines * Assist with the Management, Development, Design ... Identify opportunities for improvement around observability and process * Standardization and ...

Senior Site Reliability Engineer

Atlanta, GA · On-site +1

$109K/yr

Build, implement, iterate over CI/CD pipelines * Assist with the Management, Development, Design ... Identify opportunities for improvement around observability and process * Standardization and ...

Senior Site Reliability Engineer

Atlanta, GA · Hybrid

$54.75 - $72.75/hr

Build, implement, iterate over CI/CD pipelines * Assist with the Management, Development, Design ... Identify opportunities for improvement around observability and process * Standardization and ...

Quick apply

Senior Site Reliability Engineer

Atlanta, GA · Hybrid

$54.75 - $72.75/hr

Build, implement, iterate over CI/CD pipelines * Assist with the Management, Development, Design ... Identify opportunities for improvement around observability and process * Standardization and ...

Site Reliability Engineer (AWS)

Atlanta, GA · Hybrid

$54.75 - $72.75/hr

Manage deployment pipelines and configuration management for consistent and reliable app ... response processes. * Participate in on-call rotations and provide 24/7 support for critical ...

Site Reliability Engineer (AWS)

Atlanta, GA · Hybrid

$54.75 - $72.75/hr

Manage deployment pipelines and configuration management for consistent and reliable app ... response processes. * Participate in on-call rotations and provide 24/7 support for critical ...

Site Reliability Engineer

Atlanta, GA · On-site

$54.75 - $72.75/hr

... manual processes using Python, Ruby, Unix Shell (bash, ksh), perl, Ant, etc. • Installing ... QA, Product Management, and Production Ops teams to make sure Product Releases on-time with ...

Site Reliability Engineer

Atlanta, GA · On-site

$54.75 - $72.75/hr

... manual processes using Python, Ruby, Unix Shell (bash, ksh), perl, Ant, etc. • Installing ... QA, Product Management, and Production Ops teams to make sure Product Releases on-time with ...

Senior Site Reliability Engineer

Atlanta, GA · Hybrid

$54.75 - $72.75/hr

Build, implement, iterate over CI/CD pipelines * Assist with the Management, Development, Design ... Identify opportunities for improvement around observability and process * Standardization and ...

Senior Site Reliability Engineer

Atlanta, GA · Hybrid

$54.75 - $72.75/hr

Build, implement, iterate over CI/CD pipelines * Assist with the Management, Development, Design ... Identify opportunities for improvement around observability and process * Standardization and ...

Senior Site Reliability Engineer II

Atlanta, GA · On-site +1

$125K - $209K/yr

Define and manage SLOs, SLIs, and error budgets * Build and improve CI/CD pipelines and operational ... Strong written communication skills with experience documenting systems and processes in Confluence

Senior Site Reliability Engineer II

Atlanta, GA · On-site +1

$125K - $209K/yr

Define and manage SLOs, SLIs, and error budgets * Build and improve CI/CD pipelines and operational ... Strong written communication skills with experience documenting systems and processes in Confluence

Site Reliability Engineer (AWS)

Atlanta, GA · Hybrid

$54.75 - $72.75/hr

Manage deployment pipelines and configuration management for consistent and reliable app ... response processes. * Participate in on-call rotations and provide 24/7 support for critical ...

Site Reliability Engineer (AWS)

Atlanta, GA · Hybrid

$54.75 - $72.75/hr

Manage deployment pipelines and configuration management for consistent and reliable app ... response processes. * Participate in on-call rotations and provide 24/7 support for critical ...

Cloud Security Engineer - SRE

Alpharetta, GA · On-site

$60 - $63/hr

Role: Cloud Security Engineer - SRE Location: Alpharetta, GA or Berkeley Heights, NJ Duration: 12 ... processes and teams • Proficient with Project Management tools

Quick apply

Cloud Security Engineer - SRE

Alpharetta, GA · On-site

$60 - $63/hr

Role: Cloud Security Engineer - SRE Location: Alpharetta, GA or Berkeley Heights, NJ Duration: 12 ... processes and teams • Proficient with Project Management tools

Lorven Technologies

Senior Site reliability Engineer - Remote

Sandy Springs, GA · On-site +1

$54.75 - $72.75/hr

Design and manage hub-and-spoke architecture, including VNet peering, routing, and segmentation ... Automate CI/CD pipelines using Azure DevOps (YAML, PowerShell) for build and release processes.

Lorven Technologies

Senior Site reliability Engineer - Remote

Sandy Springs, GA · On-site +1

$54.75 - $72.75/hr

Design and manage hub-and-spoke architecture, including VNet peering, routing, and segmentation ... Automate CI/CD pipelines using Azure DevOps (YAML, PowerShell) for build and release processes.

Sr. Site Reliability Engineer

$128K - $216K/yr

... management. With over 15 billion transactions processed annually, Clover empowers merchants ... As a Senior Site Reliability Engineer, you are responsible for managing, deploying and architecting ...

Sr. Site Reliability Engineer

$128K - $216K/yr

... management. With over 15 billion transactions processed annually, Clover empowers merchants ... As a Senior Site Reliability Engineer, you are responsible for managing, deploying and architecting ...

Braves Technologies

Site Reliability Engineer (DevOps) (Remote)

Woodstock, GA · Remote

$51.50 - $68.25/hr

Contribute to the continuous improvement of the SRE team processes and practices. * Assist delivery teams with understanding, managing, and optimizing cloud-based application costs using AWS Well ...

Braves Technologies

Site Reliability Engineer (DevOps) (Remote)

Woodstock, GA · Remote

$51.50 - $68.25/hr

Contribute to the continuous improvement of the SRE team processes and practices. * Assist delivery teams with understanding, managing, and optimizing cloud-based application costs using AWS Well ...

North American Electric Reliability Corporation

Manager Power Risk Issues and Strategic Management

Atlanta, GA · On-site

... process. * Direct and manage staff responsible for developing ERO positions around existing and emerging threats to BPS reliability leveraging industry expertise in conjunction with data and ...

North American Electric Reliability Corporation

Manager Power Risk Issues and Strategic Management

Atlanta, GA · On-site

... process. * Direct and manage staff responsible for developing ERO positions around existing and emerging threats to BPS reliability leveraging industry expertise in conjunction with data and ...

1

2

3

Showing results 1-20

Process Reliability Manager Jobs in Atlanta, GA

Process Reliability Manager information

See Atlanta, GA salary details

$59.6K

$113K

$162K

How much do process reliability manager jobs pay per year?

As of Jul 13, 2026, the average yearly pay for process reliability manager in Atlanta, GA is $112,983.00, according to ZipRecruiter salary data. Most workers in this role earn between $90,900.00 and $134,600.00 per year, depending on experience, location, and employer.

What is a Process Reliability Manager?

A Process Reliability Manager is a professional responsible for ensuring that manufacturing or production processes operate efficiently, consistently, and with minimal downtime. They analyze process data, identify areas for improvement, and implement strategies to enhance equipment reliability and overall process performance. By collaborating with maintenance, engineering, and operations teams, they help reduce failures, optimize productivity, and maintain quality standards. Their work is crucial for minimizing costs and ensuring that production targets are met safely and reliably.

What is the difference between Process Reliability Manager vs Maintenance Engineer?

Aspect	Process Reliability Manager	Maintenance Engineer
Certifications	Reliability certifications, Six Sigma, PMP	Mechanical/Electrical certifications, HVAC, PLC certifications
Work Environment	Manufacturing plants, industrial facilities	Factories, equipment maintenance sites
Industry Usage	Focus on reliability, uptime, and process optimization	Focus on equipment repair, preventive maintenance

The Process Reliability Manager primarily focuses on improving equipment reliability and process efficiency through data analysis and strategic planning. In contrast, Maintenance Engineers handle the hands-on repair and maintenance of machinery. Both roles are essential in manufacturing, but the Process Reliability Manager emphasizes proactive reliability strategies, while Maintenance Engineers focus on reactive and preventive maintenance tasks.

How does a Process Reliability Manager typically collaborate with maintenance and production teams to achieve operational goals?

A Process Reliability Manager works closely with both maintenance and production teams to identify areas of improvement in equipment reliability and process efficiency. This often involves facilitating cross-functional meetings, analyzing downtime data, and implementing preventive maintenance strategies. Clear communication and teamwork are key, as the role requires aligning the objectives of different departments to minimize unplanned outages and optimize production output. By fostering a proactive culture and sharing best practices, the Process Reliability Manager helps ensure the plant operates smoothly and efficiently.

What are the key skills and qualifications needed to thrive as a Process Reliability Manager, and why are they important?

To thrive as a Process Reliability Manager, you need a strong background in engineering, process optimization, and reliability analysis, often supported by a degree in engineering and experience in manufacturing or industrial settings. Familiarity with reliability-centered maintenance (RCM), root cause analysis tools, and data analysis software such as SAP or Maximo is typically required. Exceptional problem-solving, leadership, and communication skills help drive cross-functional teams and foster a culture of continuous improvement. These skills are crucial to ensure equipment reliability, minimize downtime, and optimize operational efficiency within complex production environments.

What are popular job titles related to Process Reliability Manager jobs in Atlanta, GA? For Process Reliability Manager jobs in Atlanta, GA, the most frequently searched job titles are:

What job categories do people searching Process Reliability Manager jobs in Atlanta, GA look for? The top searched job categories for Process Reliability Manager jobs in Atlanta, GA are:

What cities near Atlanta, GA are hiring for Process Reliability Manager jobs? Cities near Atlanta, GA with the most Process Reliability Manager job openings:

McDonough

Process Reliability Manager jobs near you

Senior Site Reliability Engineer (SRE) - Dynatrace & Azure Observability Expert

Atlanta, GA

Apply

$54.25 - $72/hr

Full-time

Re-posted 2 days ago

RaceTrac rating

4.7

Based on 195 frontline employees who took The Breakroom Quiz

37th of 48 rated convenience stores

Job description

We are seeking a highly experienced Site Reliability Engineer (SRE) with deep expertise in Dynatrace, observability engineering, and Azure cloud technologies. This role will be exclusively focused on building, enhancing, and managing enterprise observability, telemetry, monitoring, and proactive reliability engineering practices across critical digital platforms.

The ideal candidate must possess advanced hands-on expertise in Dynatrace, especially Dynatrace Query Language (DQL), along with strong knowledge of Azure Monitor, Azure KQL, Application Insights, Azure Functions, APIM, and distributed telemetry concepts. The candidate should have a strong understanding of .NET application architecture and the ability to read and analyze .NET code to support troubleshooting, root cause analysis, and observability implementation within Azure environments. Experience enabling observability for mobile platforms such as iOS and Android is also required.

This is a highly technical, hands-on role requiring a proactive engineering mindset, strong analytical capabilities, and the ability to collaborate across engineering, cloud, mobile, and business teams.

What You'll Do

Dynatrace & Observability Engineering

Serve as the primary Dynatrace SME across the organization.
Design, develop, and optimize enterprise observability solutions using Dynatrace.
Develop advanced Dynatrace DQL queries, dashboards, workflows, alerts, and analytics.
Implement intelligent monitoring strategies for applications, APIs, integrations, Azure services, mobile platforms, and distributed systems.
Continuously improve observability maturity through telemetry standardization, proactive monitoring, and automation.
Configure and tune alerting mechanisms to improve signal-to-noise ratio and reduce alert fatigue.
Leverage Dynatrace Davis AI, anomaly detection, and AI-driven root cause analysis capabilities.
Enable and enhance observability for mobile applications across iOS and Android platforms.

Azure Monitoring & Cloud Operations

Build and maintain monitoring solutions using:
- Azure Monitor
- Application Insights
- Azure Log Analytics
- Azure KQL
Monitor and troubleshoot Azure Function Apps, App Services, APIs, integrations, and backend services.
Analyze telemetry, traces, logs, metrics, and distributed transactions to identify root causes and performance bottlenecks.
Troubleshoot cloud-native applications and Azure infrastructure issues.
Develop proactive monitoring for cloud services, integrations, APIs, and backend processing systems.

API & Integration Monitoring

Monitor and troubleshoot Azure API Management (APIM), API Gateways, API endpoints, and integrations.
Understand end-to-end API transaction flows and dependency mapping.
Build observability solutions for APIs, middleware platforms, and integration services.
Diagnose latency issues, transaction failures, authentication issues, and backend service degradation.

Mobile Application Observability

Enable telemetry, monitoring, tracing, and performance analysis for iOS and Android applications.
Analyze mobile-to-backend transaction flows and end-user experience metrics.
Troubleshoot mobile application latency, crash analytics, API failures, and connectivity issues.
Correlate mobile telemetry with backend application and infrastructure monitoring data.

Application Engineering & Troubleshooting

Utilize prior .NET development experience to troubleshoot application behavior, performance, and deployment issues.
Read and understand .NET application code to support root cause analysis and observability implementation.
Work closely with development teams to understand application logic, API flows, dependencies, and exception handling.
Support Azure Function deployments, configuration management, scaling, and runtime troubleshooting.
Collaborate with development teams during architecture reviews and production releases.
Ensure observability and monitoring readiness before deployments go live.

Site Reliability Engineering (SRE)

Perform deep technical analysis across systems by correlating logs, metrics, traces, and application telemetry.
Conduct root cause analysis (RCA) for recurring incidents and systemic issues.
Partner with engineering and operations teams to implement preventive improvements and automation.
Develop KPI-driven reliability improvements focused on system stability, performance, and operational excellence.
Proactively identify risks, bottlenecks, failure patterns, and reliability concerns before business impact occurs.

Continuous Improvement & Automation

Automate operational workflows and monitoring processes wherever possible.
Improve operational efficiency using AI-driven insights and automation capabilities.
Build reusable monitoring frameworks, dashboards, and telemetry standards.
Drive observability best practices across engineering teams.

What We're Looking For

Mandatory Technical Skills

10+ years of overall IT experience.
Expert-level hands-on experience with Dynatrace.
Advanced expertise in Dynatrace Query Language (DQL).
Strong hands-on expertise in Azure Kusto Query Language (KQL).
Deep understanding of telemetry, observability, distributed tracing, metrics, and logging concepts.
Strong Azure cloud experience with emphasis on:
- Azure Monitor
- Application Insights
- Azure Functions
- Azure API Management (APIM)
- Azure Log Analytics
- App Services
Strong understanding of API architectures, API Gateways, and backend integrations.
Prior hands-on experience developing .NET applications.
Strong ability to read, analyze, and understand .NET application code.
Experience troubleshooting and deploying Azure Functions and cloud-native applications.
Experience enabling observability and telemetry for mobile applications on iOS and Android.
Understanding of mobile telemetry, crash analytics, API monitoring, and end-user experience monitoring.
Strong understanding of distributed systems and enterprise application architectures.

Preferred Skills

Experience with OpenTelemetry implementation and instrumentation.
Experience with CI/CD pipelines and DevOps practices.
Knowledge of AI-driven observability and AIOps concepts.
Experience monitoring high-volume enterprise digital platforms.
Familiarity with ServiceNow and incident management workflows.
Experience with Databricks, SQL platforms, and integration technologies.

Core Competencies

Strong analytical and troubleshooting skills.
Excellent communication and stakeholder management abilities.
Ability to work independently and drive proactively.
Strong collaboration skills across engineering, cloud, SRE, mobile, and business teams.
Ability to quickly adapt to new technologies and evolving environments.

Success Criteria

Reduction in recurring incidents through proactive monitoring and RCA.
Improved observability coverage across enterprise systems, APIs, and mobile applications.
Faster incident detection and resolution.
Reduction in monitoring noise and false positives.
Increased automation and operational efficiency.
Improved reliability and performance of critical systems and APIs.
Strong partnership with engineering teams to ensure production readiness and operational excellence.

Fueled by Growth, Driven by You

At RaceTrac, our people make the difference. Whether you’re working in a store, at our corporate office, or on the road, you’ll be part of a team that brings energy, innovation, and a passion for serving others every day. We support each other, celebrate wins big and small, and create opportunities for growth at every level. With four operating divisions RaceTrac, RaceWay, Energy Dispatch, and Gulf - there’s always a new challenge to take on and a new path to pursue. Join us and discover how far your career can go.

To see what #LifeatRaceTrac is like, visit our LinkedIn, Facebook, and Instagram pages.

All qualified applicants will receive consideration for employment with RaceTrac without regard to their race, national origin, religion, age, color, sex, sexual orientation, gender identity, disability, or protected veteran status, or any other characteristic protected by local, state, or federal laws, rules, or regulations.

What RaceTrac employees say

Pay

Benefits

Hours and flexibility

Workplace

Get the full story on Breakroom

About RaceTrac

Sourced by ZipRecruiter

Industry

Retail

Company size

5,001 - 10,000 Employees

Headquarters location

Atlanta, GA, US

Year founded

1934

Website

Social media

View All RaceTrac Jobs

Apply