2

Remote Chaos Engineering Jobs (NOW HIRING)

... chaos engineering techniques such as edge cases, failure modes and design review • Advise on capacity planning and provide continuous assessments on systems behavior and consumption • Work with ...

New

Pre/Post Sales Solutions Architect

$64.50 - $85/hr

As the industry leader in Chaos Engineering and reliability testing, we work with hundreds of the ... But as a remote company, teamwork and collaboration won't happen by accident. We approach every ...

Staff AI Architect, Remote

Charleston, WV · Remote

$58.25 - $76.75/hr

Experience establishing SRE practices including SLO definition, error budgets, runbooks, chaos ... Flexible work environment, ability to work remote, hybrid or in-office * Flexible time off ...

$58.25 - $76.75/hr

Experience establishing SRE practices including SLO definition, error budgets, runbooks, chaos ... Flexible work environment, ability to work remote, hybrid or in-office * Flexible time off ...

Experience establishing SRE practices including SLO definition, error budgets, runbooks, chaos ... Flexible work environment, ability to work remote, hybrid or in-office * Flexible time off ...

Associate Site Reliability Engineer

OR · Remote

$57 - $75.75/hr

... chaos engineering and more. The SRE role is a blend of infrastructure, networking, operating ... Apply now and help us build the future of IT! #LI-Remote #LI-AA1

... remote global workforce. If you are passionate about working on business problems that can be ... chaos engineering, performance engineering, toil reduction, reliability engineering etc

$44 - $58.50/hr

... including chaos engineering practices) * Improve developer experience by enabling self-service ... Remote-first work model with global collaboration * Opportunity to work on high-impact systems ...

Experience with Chaos Engineering methodologies in a public cloud environment #LI-Remote #LI-YC2 Zscaler's salary ranges are benchmarked and are determined by role and level. The range displayed on ...

next page

Showing results 1-20

Remote Chaos Engineering information

See salary details

$73K

$194.7K

$254K

How much do remote chaos engineering jobs pay per year?

As of Jun 6, 2026, the average yearly pay for remote chaos engineering in the United States is $194,709.00, according to ZipRecruiter salary data. Most workers in this role earn between $141,500.00 and $253,000.00 per year, depending on experience, location, and employer.

What is Remote Chaos Engineering?

Remote Chaos Engineering is the practice of testing distributed systems' resilience by intentionally introducing failures and disruptions in remote or cloud environments. The goal is to identify weaknesses and improve system reliability by simulating real-world incidents, such as network outages or server crashes, in a controlled manner. This approach helps teams understand how their applications behave under stress and develop strategies to mitigate future incidents. Remote Chaos Engineering is particularly valuable for organizations leveraging cloud infrastructure and remote services, ensuring robust performance even under unexpected conditions.

What are some common challenges faced by professionals working in remote chaos engineering roles?

Professionals in remote chaos engineering often encounter challenges such as coordinating experiments across distributed teams, ensuring clear communication about system vulnerabilities, and managing the complexity of large-scale systems without direct, on-site access. Establishing robust monitoring and rollback procedures is essential to minimize risk during remote testing. Additionally, building trust with development and operations teams is key, as chaos engineering often involves intentionally introducing failures to improve system resilience.

What are the key skills and qualifications needed to thrive as a Remote Chaos Engineer, and why are they important?

To thrive as a Remote Chaos Engineer, you need a strong background in software engineering, systems architecture, and site reliability, often supported by a degree in computer science or a related field. Familiarity with chaos engineering platforms (such as Gremlin or Chaos Monkey), cloud environments (AWS, Azure, GCP), and automation tools is typically required. Strong problem-solving abilities, clear communication, and a collaborative mindset help you effectively identify weaknesses and drive reliability improvements across distributed teams. These skills are crucial for proactively uncovering system vulnerabilities, ensuring system resilience, and maintaining high availability in complex, remote-first infrastructures.

What is the difference between Remote Chaos Engineering vs Remote Site Reliability Engineer?

AspectRemote Chaos EngineeringRemote Site Reliability Engineer
Primary FocusDesigning and executing chaos experiments to improve system resilienceEnsuring system reliability, availability, and performance through monitoring and automation
Skills & CertificationsKnowledge of chaos engineering tools, scripting, cloud platformsMonitoring tools, scripting, cloud infrastructure, SRE certifications
Work EnvironmentCollaborates with development and operations teams, often in DevOps cultureWorks closely with engineering teams to maintain system health and SLAs

While both roles focus on system stability, Remote Chaos Engineering specializes in testing system resilience through chaos experiments, whereas Remote Site Reliability Engineers focus on maintaining overall system reliability and performance. Both roles require scripting skills and cloud knowledge, but their core objectives differ: one proactively tests, the other maintains system health.

More about Remote Chaos Engineering jobs
What cities are hiring for Remote Chaos Engineering jobs? Cities with the most Remote Chaos Engineering job openings:
What are the most commonly searched types of Chaos Engineering jobs? The most popular types of Chaos Engineering jobs are:
What states have the most Remote Chaos Engineering jobs? States with the most job openings for Remote Chaos Engineering jobs include:
What job categories do people searching Remote Chaos Engineering jobs look for? The top searched job categories for Remote Chaos Engineering jobs are:
Infographic showing various Remote Chaos Engineering job openings in the United States as of May 2026, with employment types broken down into 90% Full Time, 6% Part Time, 3% Contract, and 1% Nights. Highlights an 89% Physical, 3% Hybrid, and 8% Remote job distribution, with an average salary of $194,709 per year, or $93.6 per hour.
Senior Reliability Engineer (Remote)

Senior Reliability Engineer (Remote)

Kohl's

Remote

Full-time

Posted 2 days ago


Kohl's rating

5.8

Company rating: 5.8 out of 10

Based on 1,435 frontline employees who took The Breakroom Quiz

12th of 21 rated department stores


Job description

Job Summary:
Kohl's is seeking a Senior Reliability Engineer to ensure the resilience and availability of their systems and applications. The role involves collaborating with development teams, conducting risk assessments, implementing monitoring mechanisms, and driving operational excellence through automation.
Responsibilities:
• Drive error budget and Service Level Objective (SLO) adoption across products
• Drive incident response efforts, perform root cause analysis and implement preventative measures to enhance system reliability
• Establish consistent practices that elevate Kohl’s operational excellence through automation and process improvements
• Follow software lifecycle and drive reliability, observability, and efficiency across product teams within an assigned domain
• Identify repeated toil and find opportunities for automation and risk reduction
• On-call on a rotation to respond to production incidents and conduct blameless retros and root-cause analyses (RCAs) to drive a culture of continuous improvements
• Proactively identifies failures before they cause outages using chaos engineering techniques such as edge cases, failure modes and design review
• Advise on capacity planning and provide continuous assessments on systems behavior and consumption
• Work with product managers to identify and prioritize work for reliability best practices (i.e., leveraging SLIs/SLOs/Error Budgets)
• Mentors and assists engineers on the team
• Additional tasks may be assigned
Qualifications:
Required:
• Bachelor's Degree or equivalent in MIS, Computer Science or related field
• 4+ years of experience in software development
• Strong programming skills in one or more languages (Java, Python, Go or Node.js)
• In-depth knowledge of systems architecture, operating system internals and network fundamentals
• In-depth knowledge of application design patterns, event-driven architecture, database schemas, and testing strategies
• Experience with multi-region application troubleshooting and performance tuning
• Working experience with one cloud platform (GCP, AWS, or Azure)
• Working experience with monitoring techniques and tools (e.g., CloudWatch, Grafana, Prometheus, OpenTelemetry, Tracing)
Preferred:
• In-depth knowledge of containerization and container orchestration (e.g., Docker, Kubernetes, Rancher)
• Experience with one or more configuration management systems (e.g., Chef, Ansible, Puppet)
• Passion for and experience with AI and ML methodologies (MLOps)
• Experience writing Infrastructure as code (e.g., Terraform, OpenTofu)
Company:
Kohl’s is a leading omnichannel retailer with more than 1,100 stores in 49 states. Founded in 1988, the company is headquartered in Menomonee Falls, USA, with a team of 10001+ employees. The company is currently Late Stage.

What Kohl's employees say

Pay

Benefits

Hours and flexibility

Workplace

Get the full story on Breakroom