1

Reliability Manager Jobs in Michigan (NOW HIRING)

Site Reliability Engineer

Northville, MI · On-site

$53.75 - $71.25/hr

Site Reliability Engineer (SRE) About Liveline Liveline enables dramatic improvements in ... Our focus is on automating complex processes, not simply providing dashboards for managers and ...

System Reliability Controller (All Levels)

Novi, MI · On-site

$96.60K - $121.50K/yr

Monitors the transmission system security using the Energy Management System (EMS) and other reliability related applications. * Performs power flow studies when necessary to develop plans to address ...

Site Reliability Engineer

Northville, MI · On-site

$53.75 - $71.25/hr

Site Reliability Engineer (SRE) About Liveline Liveline enables dramatic improvements in ... Our focus is on automating complex processes, not simply providing dashboards for managers and ...

SRE Engineer

Dearborn, MI

$52.75 - $70/hr

Incident Management: Proven experience managing high-severity incidents. You understand the ... Reliability Framework: Define and track meaningful Service Level Indicators (SLIs) and Objectives ...

Manager, ServiceNow SRE Engineer

Grand Rapids, MI · On-site

$54.75 - $72.75/hr

Manager, ServiceNow SRE Engineer Role Overview: As a Manager, ServiceNow SRE Engineer , you will actively engage in your engineering craft, taking a hands-on approach to multiple high-visibility ...

Manager, ServiceNow SRE Engineer

Midland, MI · On-site

$49 - $65/hr

Manager, ServiceNow SRE Engineer Role Overview: As a Manager, ServiceNow SRE Engineer , you will actively engage in your engineering craft, taking a hands-on approach to multiple high-visibility ...

Manager, ServiceNow SRE Engineer

Detroit, MI · On-site

$56.50 - $75/hr

Manager, ServiceNow SRE Engineer Role Overview: As a Manager, ServiceNow SRE Engineer , you will actively engage in your engineering craft, taking a hands-on approach to multiple high-visibility ...

Linux Site Reliability Engineer

Livonia, MI

$50.50 - $67/hr

Linux SRE will administer and support Linux-based applications, Linux-based data systems, Linux-based infrastructures, data center technologies, and other hosted/managed technologies. Additionally ...

Linux Site Reliability Engineer

Livonia, MI · On-site

$53.25 - $71/hr

Linux SRE will administer and support Linux-based applications, Linux-based data systems, Linux-based infrastructures, data center technologies, and other hosted/managed technologies. Additionally ...

ServiceNow SRE Engineering Manager

Midland, MI · On-site

$49 - $65/hr

As a Manager, ServiceNow SRE Engineer , you will actively engage in your engineering craft, taking a hands-on approach to multiple high-visibility projects. Your expertise will be pivotal in ...

ServiceNow SRE Engineering Manager

Grand Rapids, MI · On-site

$54.75 - $72.75/hr

As a Manager, ServiceNow SRE Engineer , you will actively engage in your engineering craft, taking a hands-on approach to multiple high-visibility projects. Your expertise will be pivotal in ...

Senior Site Reliability Engineer

Detroit, MI · On-site

$56.25 - $74.75/hr

... • Manage and optimize containerized environments using Kubernetes • Support and maintain ... reliability engineering • Experience operating cloud platforms (AWS, Azure, or GCP) • Strong ...

Software Engineer - SRE

Warren, MI · Hybrid

$53.50 - $71.25/hr

As part of Site Reliability Engineering (SRE) database group at General Motors, you'll join a ... We leverage engineering principles to manage operations effectively and build solutions that enable ...

ServiceNow SRE Engineering Manager

Detroit, MI · On-site

$56.50 - $75/hr

As a Manager, ServiceNow SRE Engineer , you will actively engage in your engineering craft, taking a hands-on approach to multiple high-visibility projects. Your expertise will be pivotal in ...

next page

Showing results 1-20

Reliability Manager information

See Michigan salary details

$54K

$102.4K

$146.9K

How much do reliability manager jobs pay per year?

As of May 29, 2026, the average yearly pay for reliability manager in Michigan is $102,402.00, according to ZipRecruiter salary data. Most workers in this role earn between $82,400.00 and $122,000.00 per year, depending on experience, location, and employer.

What does a Reliability Manager do?

A Reliability Manager is responsible for ensuring that equipment, processes, and systems operate efficiently and consistently to minimize downtime and maximize performance. They develop and implement reliability strategies, conduct root cause analyses, and oversee preventive and predictive maintenance programs. Their role involves working closely with maintenance teams, engineers, and production staff to improve asset reliability and extend equipment lifespan. Additionally, they analyze failure data, recommend improvements, and help optimize operational costs through reliability-centered maintenance practices.

What are the key skills and qualifications needed to thrive in the Reliability Manager position, and why are they important?

A Reliability Manager needs strong analytical skills, a solid background in engineering or maintenance, and experience with reliability-centered maintenance methodologies. Familiarity with tools like Failure Mode and Effects Analysis (FMEA), Root Cause Analysis (RCA), and certifications such as Certified Reliability Engineer (CRE) are often required. Leadership, problem-solving, and the ability to communicate complex technical information clearly are crucial soft skills for this role. These skills help ensure equipment uptime, optimize maintenance processes, and foster a culture of continuous improvement within the organization.

What are some typical daily responsibilities of a Reliability Manager?

A Reliability Manager typically spends their day analyzing equipment performance data, identifying trends, and implementing strategies to reduce downtime and improve asset reliability. They lead investigations into failures using proven methodologies like Root Cause Analysis, and work closely with maintenance, engineering, and operations teams to develop maintenance plans and improvements. Regular responsibilities also include managing reliability projects, training staff in best practices, and ensuring compliance with safety and regulatory standards. This role often requires a balance of hands-on technical work and cross-functional collaboration to drive operational excellence.
What are the most commonly searched types of Reliability jobs in Michigan? The most popular types of Reliability jobs in Michigan are:
What cities in Michigan are hiring for Reliability Manager jobs? Cities in Michigan with the most Reliability Manager job openings:
Infographic showing various Reliability Manager job openings in Michigan as of May 2026, with employment types broken down into 98% Full Time, and 2% Contract. Highlights an 91% In-person, 2% Hybrid, and 7% Remote job distribution, with an average salary of $102,402 per year, or $49.2 per hour.
Site Reliability Engineer

Site Reliability Engineer

Cooper Standard

Northville, MI • On-site

$53.75 - $71.25/hr

Full-time

Posted 9 days ago


Cooper Standard rating

5.7

Company rating: 5.7 out of 10

Based on 35 frontline employees who took The Breakroom Quiz

498th of 511 rated manufacturers


Job description

Job Description:
Site Reliability Engineer (SRE)
About Liveline
Liveline enables dramatic improvements in manufacturing performance thorough a unique application of artificial intelligence to provide real-time process control and predictive assistants for plant personnel. Our focus is on automating complex processes, not simply providing dashboards for managers and operators.
Our team combines experts in AI with world-class process engineers who can focus on the "last mile" with customers: Extracting data from the process and implementing controls on the shop floor. We speak the language of AI but also industrial controllers.
Our hardware and software offerings are scalable and cost-effective whether customers have one production line or hundreds, delivering an ROI that's attractive to small and medium-sized enterprises.
We are passionate about democratizing the power of analytics and advanced automation for manufacturers of almost any size. Through our approach, producers can de-mystify complex processes and free up valuable technicians to focus on more advanced tasks instead of constantly monitoring and adjusting equipment parameters.
A Liveline Technologies SRE is responsible for the reliability, performance, observability, and operational excellence of Liveline's production services. This spans from the factory-floor edge systems to AWS cloud components. You will help build and run resilient infrastructure, automate repetitive work with code (Terraform, Bash, Python), implement monitoring and alerting (Prometheus/Grafana), and participate in incident response/on-call to ensure uptime for mission-critical manufacturing systems. You'll collaborate closely with controls engineers, data scientists, and software teams to safely deploy changes, define SLIs/SLOs, and continuously improve availability and latency for real-time process control.
Primary Responsibilities
  • Operate Production Systems: Maintain high availability, performance, and security of Liveline's production stack across AWS and plant/edge environments.
  • Observability & Monitoring: Stand up, tune, and maintain Prometheus/Grafana dashboards, alerts, recording rules, and runbooks. Implement logs/traces (e.g., OpenTelemetry) and actionable alerting.
  • Infrastructure as Code: Build and manage reproducible infrastructure with Terraform (VPC, IAM, EC2/EKS/ECS, RDS, S3, CloudWatch, CloudTrail). Apply version control, code reviews, and plan/apply workflows.
  • Automation & Tooling: Write Bash and Python scripts and small services to automate operational tasks, health checks, failover routines, backup/restore, and environment bootstrapping.
  • NOC / Incident Response: Participate in a follow-the-sun/on-call rotation; triage and resolve incidents, lead initial comms, and produce blameless postmortems with clear corrective actions.
  • SLIs/SLOs/Error Budgets: Define and instrument SLIs (availability, latency, error rate, freshness), set SLOs with stakeholders, and manage error budgets to guide release velocity and reliability tradeoffs.
  • Networking & Connectivity: Support secure, reliable connectivity between factory networks and cloud (site-to-site VPNs, routing, DNS, TLS, private subnets, security groups, network ACLs).
  • Databases & Storage: Operate and tune PostgreSQL/TimescaleDB, InfluxDB, or similar time-series/relational stores; manage backups, PITR, replication, partitioning, and performance baselining.
  • CI/CD & Release Engineering: Contribute to build/deploy pipelines (e.g., GitHub Actions/GitLab CI), implement canaries/blue-green strategies, and enforce change management and rollback plans.
  • Security & Compliance: Enforce least-privilege IAM, secret management (AWS Secrets Manager/SSM), encryption, artifact signing, and basic hardening for Linux and Kubernetes workloads.
  • Edge & OT Collaboration: Partner with process/controls engineers to ensure reliable data ingestion from PLCs/industrial gateways (e.g., OPC UA/Modbus), and safe deploys to plant edge nodes.
  • Cost, Capacity & Performance: Right-size compute/storage, set budgets/alerts, forecast capacity, and optimize resource utilization without compromising SLOs.
  • Documentation & Runbooks: Author and maintain runbooks, architecture diagrams, operational playbooks, and disaster recovery procedures.

Education and Qualifications:
  • Bachelor's Degree in IT, Computer Science, or Computer Engineering (or equivalent experience).
  • 5+ years of experience in a corporate IT or startup setting
  • Familiar with containers (Docker) and orchestration (Kubernetes or ECS).
  • Experience running production workloads, participating in on-call, and writing postmortems.
  • Strong communication skills with the ability to explain tradeoffs to non-SRE stakeholders.
  • Intellectual curiosity, ownership mindset, and bias for automation.
  • Willingness and ability to travel to customer sites and plants, as necessary.

Nice to Have
  • Kubernetes (EKS), Helm, Kustomize.
  • Service Mesh/Ingress (Envoy, NGINX, ALB).
  • Logging/Tracing: OpenSearch/ELK, Loki, OpenTelemetry.
  • Config Management: Ansible.
  • Secrets & PKI: HashiCorp Vault, mTLS.
  • Edge/Industrial Protocols: OPC UA, Modbus, MQTT; experience with industrial gateways.
  • Compliance exposure (SOC 2, ISO 27001) and change management (ITIL).

Position Type:
Regular
Additional Locations:
Additional Information:
Cooper Standard is proud of its diverse workforce and committed to providing equal employment opportunities to applicants and employees without regard to race, color, religion, sex, national origin, genetic information, physical or mental disability, age, veteran or military status, or any other characteristic protected by applicable law. We are dedicated to creating an environment at work that not only values diversity but also encourages inclusion and a sense of belonging. We firmly believe that a diverse workplace fosters an environment where our employees can flourish and provide superior service to our customers. Because we recognize and value the range of ways in which people acquire experiences, whether personal, professional, or via education or volunteerism, we invite interested applicants to evaluate the key duties and requirements and apply for any opportunities that fit your experience and qualifications. Applicants with disabilities may be entitled to reasonable accommodations under the Americans with Disabilities Act, as well as certain state and/or local laws. If you believe you require such assistance to complete our online application or to participate in an interview, you (or someone on your behalf) may request assistance by emailing recruitment@cooperstandard.com with a description of the accommodation you seek. Application materials submitted to this email address will not be considered.
Remote Status:
Hybrid

What Cooper Standard employees say

Pay

Benefits

Hours and flexibility

Workplace

Get the full story on Breakroom


Cooper Standard logo

About Cooper Standard

Sourced by ZipRecruiter

Cooper Standard, headquartered in Northville, Mich. USA, with locations in 21 countries, is a leading global supplier of sealing and fluid handling systems and components. Utilizing our materials science and manufacturing expertise, we create innovative and sustainable engineered solutions for diverse transportation and industrial markets. Cooper Standard's approximately 23,000 employees are at the heart of our success, continuously improving our business and surrounding communities.

Industry

Transportation equipment manufacturing

Company size

10,000+ Employees

Headquarters location

Novi, MI, US

Year founded

1960

Social media