2

Remote Reliability Engineer Jobs in Michigan (NOW HIRING)

Senior Site Reliability Engineer

Grand Rapids, MI · On-site +1

$54.75 - $72.75/hr

We are seeking a Senior Site Reliability Engineer (Senior SRE) to drive reliability improvements ... Remote worker reimbursement of $300/year * Professional development reimbursement * Competitive pay

Senior Site Reliability Engineer

Grand Rapids, MI · On-site +1

$54.75 - $72.75/hr

We are seeking a Senior Site Reliability Engineer (Senior SRE) to drive reliability improvements ... Remote worker reimbursement of $300/year * Professional development reimbursement * Competitive pay

This is a permanent, full-time, remote position. US Pay Band - $110K - $150K Actual compensation ... This role sits at the intersection of Operations and Engineering, bringing structure to incident ...

This is a permanent, full-time, remote position. US Pay Band - $110K - $150K Actual compensation ... This role sits at the intersection of Operations and Engineering, bringing structure to incident ...

This is a permanent, full-time, remote position. US Pay Band - $110K - $150K Actual compensation ... This role sits at the intersection of Operations and Engineering, bringing structure to incident ...

This is a permanent, full-time, remote position. US Pay Band - $110K - $150K Actual compensation ... This role sits at the intersection of Operations and Engineering, bringing structure to incident ...

This is a permanent, full-time, remote position. US Pay Band - $110K - $150K Actual compensation ... This role sits at the intersection of Operations and Engineering, bringing structure to incident ...

Senior Network Engineer

Dearborn, MI · Remote

$99.60K - $192.90K/yr

Knowledge and Experience in SRE (Site Reliability Engineering) principles * Familiarity with ... Remote #LI-PS2 * Bachelor of Engineering or equivalent - Requirement * 10+ years of relevant ...

New

next page

Showing results 1-20

Remote Reliability Engineer information

What are the key skills and qualifications needed to thrive as a Remote Reliability Engineer, and why are they important?

To thrive as a Remote Reliability Engineer, you need a strong background in systems engineering, software development, and infrastructure management, often supported by a degree in computer science or a related field. Proficiency with cloud platforms (such as AWS, Azure, or GCP), monitoring tools (like Prometheus, Grafana), and relevant certifications (e.g., AWS Certified DevOps Engineer) is highly valuable. Excellent problem-solving, communication, and collaboration skills are crucial for working effectively across distributed teams and responding to incidents. These abilities ensure system reliability, quick incident resolution, and seamless remote teamwork, which are vital for maintaining high service uptime and user satisfaction.

How do Remote Reliability Engineers typically collaborate with on-site teams to address urgent technical issues?

Remote Reliability Engineers often utilize a combination of video conferencing, instant messaging, and collaborative monitoring tools to stay closely connected with on-site teams. When urgent technical issues arise, they participate in real-time troubleshooting sessions, analyze system logs remotely, and may guide on-site staff through step-by-step resolution procedures. Building strong communication channels and regular check-ins are essential to ensure swift and effective collaboration, even across different time zones. This structure allows Remote Reliability Engineers to contribute significantly to system uptime while working from a distance.

What is a Remote Reliability Engineer?

A Remote Reliability Engineer is a professional who works from a remote location to ensure that systems, applications, or infrastructure are reliable, available, and performing well. Their responsibilities typically include monitoring system health, diagnosing issues, implementing preventative measures, and collaborating with teams to improve system reliability. They often use tools for automation, incident response, and performance monitoring, all while working offsite. This role is critical in minimizing downtime and ensuring a smooth user experience, especially for companies with complex technical environments. Remote Reliability Engineers must have strong problem-solving skills and be proficient in cloud technologies, automation, and incident management.

What is the difference between Remote Reliability Engineer vs Remote Site Reliability Engineer?

AspectRemote Reliability EngineerRemote Site Reliability Engineer
CredentialsTypically requires certifications like AWS Certified Solutions Architect, Linux Foundation certificationsSimilar credentials, often with additional focus on site-specific tools and monitoring
Work EnvironmentPrimarily remote, focusing on cloud infrastructure and system reliabilityRemote with some on-site responsibilities, focusing on infrastructure and operational stability
Industry UsageUsed across tech, cloud providers, SaaS companiesCommon in data centers, cloud providers, and large enterprise IT
Search & Comparison IntentOften compared due to overlapping roles in system reliability and cloud infrastructureCompared for on-site vs remote operational responsibilities

The main difference is that Remote Reliability Engineers focus on cloud and system reliability remotely, while Remote Site Reliability Engineers may have some on-site duties related to infrastructure. Both roles require similar skills and certifications but differ in their work environment and specific responsibilities.

What are the most commonly searched types of Reliability Engineer jobs in Michigan? The most popular types of Reliability Engineer jobs in Michigan are:
What job categories do people searching Remote Reliability Engineer jobs in Michigan look for? The top searched job categories for Remote Reliability Engineer jobs in Michigan are:
What cities in Michigan are hiring for Remote Reliability Engineer jobs? Cities in Michigan with the most Remote Reliability Engineer job openings:
Infographic showing various Remote Reliability Engineer job openings in Michigan as of May 2026, with employment types broken down into 84% Full Time, 10% Part Time, 1% Temporary, 4% Contract, and 1% Nights. Highlights an 81% Physical, 7% Hybrid, and 12% Remote job distribution.
Senior Site Reliability Engineer

Senior Site Reliability Engineer

CertifID

Grand Rapids, MI • On-site, Remote

$54.75 - $72.75/hr

Full-time

Medical, Dental, Vision, Life, Retirement, PTO

Posted 20 days ago


Job description

Cybercrime is rising, reaching record highs in 2024. According to the FBI's IC3 report, total losses exceeded $16 billion. With investment fraud and BEC scams at the forefront, the message is clear: the real estate sector remains a lucrative target for cybercriminals. At CertifID, we take this threat seriously and provide a secure platform that verifies the identities of parties involved in transactions, authenticates wire transfer instructions, and detects potential fraud attempts. Our technology is designed to mitigate risks and ensure that every transaction is conducted with confidence and peace of mind.

We know we couldn't take on this challenge without our incredible team. We have been recognized as one of the Best Startups to Work for in Austin, made the Inc. 5000 list, and won Best Culture by Purpose Jobs three years in a row. We are guided by our core values and our vision of a world without wire fraud. We offer a dynamic work environment where you can contribute to meaningful impact and be part of a team dedicated to enhancing security and fighting fraud.

We are seeking a Senior Site Reliability Engineer (Senior SRE) to drive reliability improvements across our production SaaS environment. You'll play a critical role in building scalable infrastructure patterns, advancing observability, improving incident response, and partnering with engineering teams to embed reliability into system design and delivery.
 
This role is ideal for an experienced Sr. SRE who enjoys solving complex operational problems, building automation, and mentoring others.
What You'll Do
  • Reliability & Platform Operations: Own and improve the reliability, availability, and performance of production systems while defining and operationalizing SLIs/SLOs and error budgets.
  • AI Agent Enablement:  Design and implement autonomous and semi-autonomous AI agents for monitoring distributed systems and applications. Build agents capable of consuming multi-source observability data (metrics, logs, traces, etc.).
  • Incident Response: Participate in and help lead an on-call rotation, serving as an escalation point for major incidents and facilitating blameless postmortems.
  • Automation & Infrastructure: Build automated workflows to eliminate manual work and design/maintain Infrastructure-as-Code with Terraform.
  • Observability: Improve metrics, logs, traces, and alerting using tools like Datadog or Prometheus to reduce noise and increase signal.
  • Collaboration & Mentorship: Partner with application teams to implement reliability best practices and mentor junior engineers to foster a culture of knowledge sharing.
Who You Are
  • Strategic Architect: You look beyond the "what" to understand the "why," providing insights that influence our GTM and technical direction.
  • Startup Veteran: You are comfortable moving fast and staying proactive in an environment where the playbook is still being written.
  • Relatable & Adaptable: You can navigate different personalities across the organization, from high-energy sales teams to analytical engineering partners.
  • Lifelong Learner: You have a thirst for learning, keeping up with emerging technologies and industry trends.
What We're Looking For
  • Experience: 5+ years in SRE, DevOps, Platform Engineering, or Infrastructure Engineering.
  • Cloud Expertise: Proven experience supporting production SaaS systems in Azure (preferred), AWS, or GCP.
  • Technical Stack: Strong Linux, networking, and distributed systems troubleshooting skills.
  • Containers: Strong experience with containers and orchestration (Kubernetes/EKS/AKS).
  • IaC & Tooling: Expertise with Infrastructure-as-Code (Terraform strongly preferred).
  • Programming: Strong scripting/programming skills in Python, Go, Bash, or C#/.NET.
  • Observability: Hands-on experience with Datadog, Prometheus/Grafana, or OpenTelemetry.
What We Offer
  • Flexible vacation
  • 12 company-paid holidays
  • 10 paid sick days
  • No work on your birthday
  • Health, dental, and vision Insurance (including a $0 option)
  • 401(k) with matching, and no waiting period
  • Equity
  • Life insurance
  • Generous parental paid leave
  • Wellness reimbursement of $300/year
  • Remote worker reimbursement of $300/year
  • Professional development reimbursement
  • Competitive pay
  • An award-winning culture
Not sure if you check all the boxes? Apply anyway! 

We know that great talent comes in many forms, and we value potential just as much as experience. If you're excited about this role and believe you can grow into it, we'd love to hear from you. We're looking for people who are eager to learn, adapt, and solve challenges-so if that sounds like you, don't let a checklist hold you back!

Change doesn't happen overnight, and the same goes for us here at CertifID. We evolve collectively and individually as we grow by leaning into the core values that define us. As we grow, we embody GRIT-collectively and individually-to raise the bar and influence outcomes in everything we do. Guard the Customer - Raise the Bar - Influence Outcomes - Teamwork Wins
We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.
apply for this job