1

Observability Manager Jobs in Alabama (NOW HIRING)

Contribute to managing observability platforms that are shared resources used by infrastructure, development, and operations teams * Help establish metrics, dashboards, and alerting strategies ...

Data Quality Engineer

Montgomery, AL · On-site

$113.30K - $136K/yr

Data Testing & Observability: Design and deploy automated data testing at scale; use observability ... Multi-cloud Management Services : Helping businesses digitally transform with smart cloud ...

New

Site Reliability Engineer

Birmingham, AL · On-site

$53.50 - $71/hr

Build and manage CI/CD pipelines to facilitate smooth deployments and automate workflows ... Implement observability solutions to gain insights into system performance and user experience.

Site Reliability Engineer

Birmingham, AL · On-site

$53.50 - $71/hr

Build and manage CI/CD pipelines to facilitate smooth deployments and automate workflows ... Implement observability solutions to gain insights into system performance and user experience.

Site Reliability Engineer

Birmingham, AL

$53.50 - $71/hr

Build and manage CI/CD pipelines to facilitate smooth deployments and automate workflows ... Implement observability solutions to gain insights into system performance and user experience.

Manager, ServiceNow SRE Engineer

Birmingham, AL · On-site

$53.50 - $71/hr

... modern observability practices, and SLAs. The ideal candidate will be a role model and an ... Manager, ServiceNow SRE Engineer Role Overview: As a Manager, ServiceNow SRE Engineer , you will ...

Manager, ServiceNow SRE Engineer

Huntsville, AL · On-site

$56.25 - $74.75/hr

... modern observability practices, and SLAs. The ideal candidate will be a role model and an ... Manager, ServiceNow SRE Engineer Role Overview: As a Manager, ServiceNow SRE Engineer , you will ...

ServiceNow SRE Engineering Manager

Huntsville, AL · On-site

$56.25 - $74.75/hr

As a Manager, ServiceNow SRE Engineer , you will actively engage in your engineering craft, taking ... modern observability practices, and SLAs. The ideal candidate will be a role model and an ...

ServiceNow SRE Engineering Manager

Birmingham, AL · On-site

$53.50 - $71/hr

As a Manager, ServiceNow SRE Engineer , you will actively engage in your engineering craft, taking ... modern observability practices, and SLAs. The ideal candidate will be a role model and an ...

... and observability * Design scalable systems for distributed training, data processing, feature and model lifecycle management, and production inference * Own platform-level technical outcomes from ...

Manager, SRE Engineer - PxE ERM

Birmingham, AL · On-site

$53.50 - $71/hr

As a Manager, SRE Engineer , you will actively engage in your engineering craft, taking a hands-on ... modern observability practices, and SLAs. The ideal candidate will be a role model and an ...

Manager, SRE Engineer - PxE ERM

Huntsville, AL · On-site

$56.25 - $74.75/hr

As a Manager, SRE Engineer , you will actively engage in your engineering craft, taking a hands-on ... modern observability practices, and SLAs. The ideal candidate will be a role model and an ...

AI Data Engineer - Senior Consultant

Huntsville, AL · Hybrid

$103K - $141.40K/yr

... observability, and cost/performance discipline. Recruiting for this role ends on August 30, 2026 ... prompt/version management). * 4+ years of cloud experience on AWS/Azure/GCP (one or more ...

next page

Showing results 1-20

Observability Manager information

What is the difference between Observability Manager vs Site Reliability Engineer?

AspectObservability ManagerSite Reliability Engineer
CredentialsTypically requires experience in monitoring, logging, and cloud tools; certifications like AWS, Google Cloud, or Kubernetes are commonRequires strong background in systems engineering, scripting, and cloud platforms; certifications like AWS, GCP, or Linux are often preferred
Work EnvironmentFocuses on overseeing observability tools, data analysis, and team coordination in tech environmentsHands-on role involving system automation, incident response, and infrastructure reliability
Industry UsageUsed across tech companies to improve system visibility and performanceCommon in DevOps and SRE teams to ensure system reliability and uptime

The Observability Manager primarily oversees monitoring and logging strategies, ensuring system visibility, while the Site Reliability Engineer is more hands-on, focusing on automating infrastructure and maintaining system reliability. Both roles require technical expertise and often collaborate closely but differ in scope and daily responsibilities.

What are the most commonly searched types of Observability jobs in Alabama? The most popular types of Observability jobs in Alabama are:
What are popular job titles related to Observability Manager jobs in Alabama? For Observability Manager jobs in Alabama, the most frequently searched job titles are:
What job categories do people searching Observability Manager jobs in Alabama look for? The top searched job categories for Observability Manager jobs in Alabama are:
What cities in Alabama are hiring for Observability Manager jobs? Cities in Alabama with the most Observability Manager job openings:
Manager of Site Reliability Engineering (SRE)

Manager of Site Reliability Engineering (SRE)

Genuine Parts Company

Birmingham, AL • On-site

$53.50 - $71/hr

Full-time

Posted 24 days ago


Genuine Parts Company rating

6.8

Company rating: 6.8 out of 10

Based on 57 frontline employees who took The Breakroom Quiz

215th of 333 rated retail wholesalers


Job description

SUMMARY:
The Manager of Site Reliability Engineering leads and develops a team of SRE practitioners focused on delivering highly reliable, scalable, and performant cloud-based infrastructure and services. This role ensures the implementation of SRE principles, drives automation, observability, and incident management practices to enhance system reliability, and collaborates across development and operations teams to support continuous delivery and robust cloud platform operations.
You must be eligible to work in the US without Visa Sponsorship
JOB DUTIES
• Lead, mentor, and grow a high-performing team of Site Reliability Engineers, fostering a culture of ownership, continuous improvement, and operational excellence.
• Implement and champion Site Reliability Engineering principles and DevOps best practices within the team to ensure service reliability, availability, and performance.
• Define and track key SRE metrics such as service uptime, incident response and resolution times.
• Drive automation efforts including CI/CD pipeline enhancements, infrastructure-as-code practices, and self-service infrastructure provisioning to increase deployment velocity while reducing manual toil.
• Own and continuously improve observability practices including system monitoring, logging, alerting, and diagnostics to ensure rapid issue detection and resolution.
• Participate in incident response processes including incident management, root cause analysis, post-mortems, and continuous improvement to enhance system resilience.
• Partner closely with software engineering, product management, architecture, and security teams to embed reliability and security early in the software development lifecycle (SDLC).
• Oversee the management and scalability of cloud infrastructure environments, primarily on Google Cloud Platform (GCP), with a focus on Kubernetes, container orchestration, and hybrid cloud integrations.
• Advocate for and apply best practices in performance tuning, capacity planning, and system design for high availability.
• Develop and execute a long-term roadmap for our hybrid cloud platform, aligning with evolving business objectives and technology trends.
• Establish and monitor key performance indicators (KPIs) service level indicators (SLIs) and service level objectives (SLOs) to drive system health and stability.
EDUCATION & EXPERIENCE
Typically requires a bachelor's degree and 7 years of experience in a technology and/or software engineering role or an equivalent combination
KNOWLEDGE, SKILLS, ABILITIES
Experience & Leadership
• Proven experience working in large, complex enterprise environments (Fortune 500 or equivalent).
Site Reliability Engineering & DevOps Practices
• Strong understanding and demonstrated implementation of Site Reliability Engineering (SRE) principles at scale.
• Hands-on experience with infrastructure-as-code (IaC) tools such as Terraform, and ArgoCD.
• In-depth knowledge and practical experience with CI/CD pipelines and automation of software delivery.
• Championing DevOps practices and embedding reliability early in the SDLC.
• Significant hands-on experience in Site Reliability Engineering or related roles focused on cloud infrastructure reliability.
• Strong software engineering background with proficiency in infrastructure-as-code tools (e.g., Terraform, ArgoCD) and CI/CD automation.
• Deep knowledge of cloud platforms, specifically Google Cloud Platform (GCP), Kubernetes, container orchestration, and cloud-native architecture.
• Familiarity with monitoring and observability tools such as Dynatrace, Datadog, or equivalents.
• Experience managing high-availability systems in 24/7 operational environments.
• Ability to collaborate cross-functionally and drive alignment across engineering, product, and security teams.
Tools & Monitoring
• Experience with monitoring, logging, and observability platforms.
• Familiarity with incident management and performance monitoring tools, including Dynatrace and Datadog.
• Proficient in cloud deployment tooling and automation frameworks.
• Experience with Azure DevOps (ADO) or equivalent CI/CD tools.
Core Technical Skills
• Strong software engineering and infrastructure background.
• Solid understanding of Kubernetes, container orchestration, cluster management, and elastic scalability.
• Experience with API-driven, event driven and microservices architectures.
• • Skilled in performance diagnostics, capacity planning, tuning, and system architecture for high-availability systems.
Not the right fit? Let us know you're interested in a future opportunity by joining our Talent Community on jobs.genpt.com or create an account to set up email alerts as new job postings become available that meet your interest!
GPC conducts its business without regard to sex, race, creed, color, religion, marital status, national origin, citizenship status, age, pregnancy, sexual orientation, gender identity or expression, genetic information, disability, military status, status as a veteran, or any other protected characteristic. GPC's policy is to recruit, hire, train, promote, assign, transfer and terminate employees based on their own ability, achievement, experience and conduct and other legitimate business reasons.

What Genuine Parts Company employees say

Pay

Benefits

Hours and flexibility

Workplace

Get the full story on Breakroom