2

Remote Reliability Engineer Jobs in Ontario (NOW HIRING)

Document & share - Keep diagrams, runbooks, and change records current so the wider Ops and SRE ... Employee Resource Groups EEO/VEVRAA #LI-BV1 #LI-REMOTE

New

Design and own the SRE function for Level 1 data ingestion across all GCP deployments: alert policy ... Comfort with distributed teams and experience managing offshore or remote team members * Ability to ...

In this role, you'll be instrumental in developing, maintaining, and optimizing the infrastructure, services, and tooling that empower our development and SRE teams to rapidly and reliably deliver ...

What You Bring * 3+ years of DevOps, SRE, platform engineering, cloud infrastructure, or systems ... Strong Terraform skills, including modules, remote state concepts, review practices, and safe ...

Platform Engineer, Databases

Toronto, ON ยท Remote

CA$137K - CA$157K/yr

Readiness to work with and develop the MySQL knowledge to the rest of the SRE team * A portfolio of ... ProxySQL and the Percona toolkit #LI-Remote What you will find here: Compensation is one of the ...

Manage and support a team of site reliability engineers, focusing on technical guidance, mentoring, and career development. * Participate in hands-on technical problem solving, design reviews, and ...

Infrastructure Engineer

Toronto, ON ยท Remote

CA$140K - CA$240K/yr

... reliability, security, or compliance. This role contributes to the development and evolution of ... This is a fully remote position that offers a competitive salary range of $140,000 to $240,000 USD ...

Senior Infrastructure Engineer

Toronto, ON ยท Remote

CA$170K - CA$220K/yr

Remote - Remote - Based In ET+2 / -3, NY Preferred Remote | Full-time Compensation: $170K - $220K ... reliability, and speed. This individual will bring a software engineering mindset to infrastructure ...

Lead Cloud Engineer

Mississauga, ON ยท On-site +1

CA$122K - CA$162K/yr

Minimum 8+ years of experience in a Software Engineering or Infrastructure team in a Cloud, PaaS, and DevOps environment. * 5+ years of full-stack software development, or SRE experience. * 5+ years ...

Lead Cloud Engineer

Mississauga, ON ยท On-site +1

CA$122K - CA$162K/yr

Minimum 8+ years of experience in a Software Engineering or Infrastructure team in a Cloud, PaaS, and DevOps environment. * 5+ years of full-stack software development, or SRE experience. * 5+ years ...

Lead Cloud Engineer

Mississauga, ON ยท On-site +1

CA$122K - CA$162K/yr

Minimum 8+ years of experience in a Software Engineering or Infrastructure team in a Cloud, PaaS, and DevOps environment. * 5+ years of full-stack software development, or SRE experience. * 5+ years ...

next page

Showing results 1-20

Remote Reliability Engineer information

What is the difference between Remote Reliability Engineer vs Remote Site Reliability Engineer?

AspectRemote Reliability EngineerRemote Site Reliability Engineer
CredentialsTypically requires certifications like AWS Certified Solutions Architect, Linux Foundation certificationsSimilar credentials, often with additional focus on site-specific tools and monitoring
Work EnvironmentPrimarily remote, focusing on cloud infrastructure and system reliabilityRemote with some on-site responsibilities, focusing on infrastructure and operational stability
Industry UsageUsed across tech, cloud providers, SaaS companiesCommon in data centers, cloud providers, and large enterprise IT
Search & Comparison IntentOften compared due to overlapping roles in system reliability and cloud infrastructureCompared for on-site vs remote operational responsibilities

The main difference is that Remote Reliability Engineers focus on cloud and system reliability remotely, while Remote Site Reliability Engineers may have some on-site duties related to infrastructure. Both roles require similar skills and certifications but differ in their work environment and specific responsibilities.

What are the key skills and qualifications needed to thrive as a Remote Reliability Engineer, and why are they important?

To thrive as a Remote Reliability Engineer, you need a strong background in systems engineering, software development, and infrastructure management, often supported by a degree in computer science or a related field. Proficiency with cloud platforms (such as AWS, Azure, or GCP), monitoring tools (like Prometheus, Grafana), and relevant certifications (e.g., AWS Certified DevOps Engineer) is highly valuable. Excellent problem-solving, communication, and collaboration skills are crucial for working effectively across distributed teams and responding to incidents. These abilities ensure system reliability, quick incident resolution, and seamless remote teamwork, which are vital for maintaining high service uptime and user satisfaction.

How do Remote Reliability Engineers typically collaborate with on-site teams to address urgent technical issues?

Remote Reliability Engineers often utilize a combination of video conferencing, instant messaging, and collaborative monitoring tools to stay closely connected with on-site teams. When urgent technical issues arise, they participate in real-time troubleshooting sessions, analyze system logs remotely, and may guide on-site staff through step-by-step resolution procedures. Building strong communication channels and regular check-ins are essential to ensure swift and effective collaboration, even across different time zones. This structure allows Remote Reliability Engineers to contribute significantly to system uptime while working from a distance.

What is a Remote Reliability Engineer?

A Remote Reliability Engineer is a professional who works from a remote location to ensure that systems, applications, or infrastructure are reliable, available, and performing well. Their responsibilities typically include monitoring system health, diagnosing issues, implementing preventative measures, and collaborating with teams to improve system reliability. They often use tools for automation, incident response, and performance monitoring, all while working offsite. This role is critical in minimizing downtime and ensuring a smooth user experience, especially for companies with complex technical environments. Remote Reliability Engineers must have strong problem-solving skills and be proficient in cloud technologies, automation, and incident management.
What are the most commonly searched types of Reliability Engineer jobs in Ontario? The most popular types of Reliability Engineer jobs in Ontario are:
What are popular job titles related to Remote Reliability Engineer jobs in Ontario? For Remote Reliability Engineer jobs in Ontario, the most frequently searched job titles are:
What job categories do people searching Remote Reliability Engineer jobs in Ontario look for? The top searched job categories for Remote Reliability Engineer jobs in Ontario are:
What cities in Ontario are hiring for Remote Reliability Engineer jobs? Cities in Ontario with the most Remote Reliability Engineer job openings:

Senior / Staff Software Engineer (Observability / SRE)

Waabi

Toronto, ON โ€ข On-site, Remote

CA$148K - CA$249K/yr

Full-time

Medical, Dental, Vision, PTO

Posted 3 days ago


Job description

Waabi, founded by AI visionary Raquel Urtasun, is the leader in Physical AI. With a world-class team, we're unlocking the next era of autonomous transportation with technology that's powering commercial autonomous trucks and robotaxis. Waabi is backed by and partners with world leaders in AI, automotive, logistics, and deep tech.

With offices in Toronto, San Francisco, Dallas, and Pittsburgh, Waabi is growing quickly and looking for diverse, innovative and collaborative candidates who want to impact the world in a positive way. To learn more visit: www.waabi.ai

You will..
- Design and lead the architecture and development of Waabi's monitoring and observability stack, used to monitor the health and performance of cloud and on-prem environments.
- Develop and extend workloads and benchmarks (compute, storage, network, ML/AI) and integrate stress, chaos, and regression tests to validate hardware and platform choices.
- Analyze and optimize end-to-end performance across hardware, firmware, Linux kernel, runtimes, and distributed services using advanced profiling tools (perf, eBPF, flamegraphs, tracing frameworks).
- Build automation and observability tooling (Go/Python/Java, Kubernetes/Docker) for CI/CD-based performance regression detection, telemetry, alerting, and anomaly detection.
- Work with client teams to support their applications' observability requirements.
- Influence system architecture and tooling decisions that improve how Waabi builds, monitors, and scales its infrastructure.
- Drive execution and quality, writing design docs, setting milestones, mentoring ICs, and communicating insights and results to stakeholders and leadership.
ย 
Qualifications:
- 5+ years software engineering or systems/performance engineering experience (BS in CS/EE or related), with demonstrated end-to-end ownership of complex projects.
- Proficient in at least one of: Python, Rust, C/C++; strong CS fundamentals and system design skills.
- Hands-on with Linux internals (CPU scheduling, memory, I/O, networking) and perf tooling (perf, eBPF, flamegraphs, tracing frameworks).
- Experience with Kubernetes, microservices, and distributed systems; comfort building production services and pipelines.
- Proven track record of clear communication, writing design docs, and leading cross-functional efforts.
ย 
Bonus:
- Experience deploying and managing observability platforms (OpenTelemetry, Grafana OSS).
- Performance tuning for databases/streaming/batch/ML platforms; GPU/xPU or Arm performance exposure.
- Experience tuning stream processing, batch or ML platforms (e.g. Argo Workflows, PyTorch).
- Familiarity with microservices debugging and distributed tracing (OpenTelemetry, Prometheus).
The US yearly salary range for this role is: $148,000 - $249,000 USD in addition to competitive perks & benefits. Waabi (US) Inc.'s yearly salary ranges are determined based on several factors in accordance with the Company's compensation practices. The salary base range is reflective of the minimum and maximum target for new hire salaries for the position across all US locations.ย  Note: The Company provides additional compensation for employees in this role, including equity incentive awards and an annual performance bonus.

Perks/Benefits:
- Competitive compensation and equity awards.
- Health and Wellness benefits encompassing Medical, Dental and Vision coverage (for full-time employees only).
- Unlimited Vacation.
- Flexible hours and Work from Home support.
- Daily drinks, snacks and catered meals (when in office).
- Regularly scheduled team building activities and social events both on-site, off-site & virtually.
- As we grow, this list continues to evolve!ย 

Waabi is a technology start-up building technologies to transform the way the world moves. Join our talented team to be a part of the future and to make an impact!

Waabi is an equal opportunity employer. We celebrate diversity and are committed to creating a supportive, inclusive, and accessible workplace for all our employees. We seek applicants of all backgrounds and identities, across race, color, ethnicity, national origin or ancestry, age, citizenship, religion, sex, sexual orientation, gender identity or expression, military or veteran status, marital status, pregnancy or parental status, caregiver status, disability, or any other characteristic protected by law. We make workplace accommodations for qualified individuals with disabilities as required by applicable law. If reasonable accommodation is needed to participate in the job application or interview process please let our recruiting team know.
We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.
apply for this job