Experience with leveraging a chaos engineering vendor such as Gremlin, Harness, or something ... The majority of our roles are remote and you can work almost anywhere within the country of ...
Experience with leveraging a chaos engineering vendor such as Gremlin, Harness, or something ... The majority of our roles are remote and you can work almost anywhere within the country of ...
Site Reliability Engineer
San Francisco, CA · Remote
$67.25 - $89.25/hr
Remote (US) Department: Cloud Platform Engineering / SRE/Reliability Position summary The Site ... Drive chaos engineering, game days, and reliability testing programs * Produce SLA performance ...
Site Reliability Engineer
San Francisco, CA · Remote
$67.25 - $89.25/hr
Remote (US) Department: Cloud Platform Engineering / SRE/Reliability Position summary The Site ... Drive chaos engineering, game days, and reliability testing programs * Produce SLA performance ...
Mission Engineering at CHAOS turns threat and CONEMP-informed simulation output into product ... C. office; or Work full time off-site (remote, work from home) in Huntsville, AL. * Light ( Minimum ...
Mission Engineering at CHAOS turns threat and CONEMP-informed simulation output into product ... C. office; or Work full time off-site (remote, work from home) in Huntsville, AL. * Light ( Minimum ...
Chaos engineering. • Communication: Technical teaching. Influence without authority. Clear written communication. Stakeholder management. Preferred : • Developer platforms or CLI tools. • DORA ...
Chaos engineering. • Communication: Technical teaching. Influence without authority. Clear written communication. Stakeholder management. Preferred : • Developer platforms or CLI tools. • DORA ...
Chaos engineering. • Communication: Technical teaching. Influence without authority. Clear written communication. Stakeholder management. Preferred : • Developer platforms or CLI tools. • DORA ...
Chaos engineering. • Communication: Technical teaching. Influence without authority. Clear written communication. Stakeholder management. Preferred : • Developer platforms or CLI tools. • DORA ...
Chaos engineering. • Communication: Technical teaching. Influence without authority. Clear written communication. Stakeholder management. Preferred : • Developer platforms or CLI tools. • DORA ...
Chaos engineering. • Communication: Technical teaching. Influence without authority. Clear written communication. Stakeholder management. Preferred : • Developer platforms or CLI tools. • DORA ...
Chaos engineering. • Communication: Technical teaching. Influence without authority. Clear written communication. Stakeholder management. Preferred : • Developer platforms or CLI tools. • DORA ...
Chaos engineering. • Communication: Technical teaching. Influence without authority. Clear written communication. Stakeholder management. Preferred : • Developer platforms or CLI tools. • DORA ...
Chaos engineering. * Communication: Technical teaching. Influence without authority. Clear written communication. Stakeholder management. Nice-to-Haves * Developer platforms or CLI tools. DORA/SPACE ...
Chaos engineering. * Communication: Technical teaching. Influence without authority. Clear written communication. Stakeholder management. Nice-to-Haves * Developer platforms or CLI tools. DORA/SPACE ...
Chaos engineering. * Communication: Technical teaching. Influence without authority. Clear written communication. Stakeholder management. Nice-to-Haves * Developer platforms or CLI tools. DORA/SPACE ...
Chaos engineering. * Communication: Technical teaching. Influence without authority. Clear written communication. Stakeholder management. Nice-to-Haves * Developer platforms or CLI tools. DORA/SPACE ...
Chaos engineering. * Communication: Technical teaching. Influence without authority. Clear written communication. Stakeholder management. Nice-to-Haves * Developer platforms or CLI tools. DORA/SPACE ...
Chaos engineering. * Communication: Technical teaching. Influence without authority. Clear written communication. Stakeholder management. Nice-to-Haves * Developer platforms or CLI tools. DORA/SPACE ...
Chaos engineering. * Communication: Technical teaching. Influence without authority. Clear written communication. Stakeholder management. Nice-to-Haves * Developer platforms or CLI tools. DORA/SPACE ...
Chaos engineering. * Communication: Technical teaching. Influence without authority. Clear written communication. Stakeholder management. Nice-to-Haves * Developer platforms or CLI tools. DORA/SPACE ...
Chaos engineering. * Communication: Technical teaching. Influence without authority. Clear written communication. Stakeholder management. Nice-to-Haves * Developer platforms or CLI tools. DORA/SPACE ...
Chaos engineering. * Communication: Technical teaching. Influence without authority. Clear written communication. Stakeholder management. Nice-to-Haves * Developer platforms or CLI tools. DORA/SPACE ...
Lead the organization from reactive firefighting to a predictive, self-healing culture through the aggressive adoption of Chaos Engineering and AlOps Job Designation Remote: Employee is not required ...
Lead the organization from reactive firefighting to a predictive, self-healing culture through the aggressive adoption of Chaos Engineering and AlOps Job Designation Remote: Employee is not required ...
Director, Engineering
CA · Remote
Lead the organization from reactive firefighting to a predictive, self-healing culture through the aggressive adoption of Chaos Engineering and AlOps Job Designation Remote: Employee is not required ...
Director, Engineering
CA · Remote
Lead the organization from reactive firefighting to a predictive, self-healing culture through the aggressive adoption of Chaos Engineering and AlOps Job Designation Remote: Employee is not required ...
USA-Remote Job type: Contract Design, deploy, and maintain Cisco PCCE (15.x or newer) in high ... and chaos engineering exercises. • Collaborate with business and clinical stakeholders.
USA-Remote Job type: Contract Design, deploy, and maintain Cisco PCCE (15.x or newer) in high ... and chaos engineering exercises. • Collaborate with business and clinical stakeholders.
Deep practical knowledge of Site Reliability Engineering (SRE) principles, chaos engineering, and ... Remote (Regardless of Location): $244,700 - $279,200 for Distinguished Engineer Cambridge, MA: $269 ...
Deep practical knowledge of Site Reliability Engineering (SRE) principles, chaos engineering, and ... Remote (Regardless of Location): $244,700 - $279,200 for Distinguished Engineer Cambridge, MA: $269 ...
Deep practical knowledge of Site Reliability Engineering (SRE) principles, chaos engineering, and ... Remote (Regardless of Location): $244,700 - $279,200 for Distinguished Engineer Cambridge, MA: $269 ...
Deep practical knowledge of Site Reliability Engineering (SRE) principles, chaos engineering, and ... Remote (Regardless of Location): $244,700 - $279,200 for Distinguished Engineer Cambridge, MA: $269 ...
Deep practical knowledge of Site Reliability Engineering (SRE) principles, chaos engineering, and ... Remote (Regardless of Location): $244,700 - $279,200 for Distinguished Engineer Cambridge, MA: $269 ...
Deep practical knowledge of Site Reliability Engineering (SRE) principles, chaos engineering, and ... Remote (Regardless of Location): $244,700 - $279,200 for Distinguished Engineer Cambridge, MA: $269 ...
Deep practical knowledge of Site Reliability Engineering (SRE) principles, chaos engineering, and ... Remote (Regardless of Location): $244,700 - $279,200 for Distinguished Engineer Cambridge, MA: $269 ...
Deep practical knowledge of Site Reliability Engineering (SRE) principles, chaos engineering, and ... Remote (Regardless of Location): $244,700 - $279,200 for Distinguished Engineer Cambridge, MA: $269 ...
Distinguished Engineer (Remote - Eligible)
Mclean, VA · On-site +1
Deep practical knowledge of Site Reliability Engineering (SRE) principles, chaos engineering, and ... Remote (Regardless of Location): $244,700 - $279,200 for Distinguished Engineer Cambridge, MA: $269 ...
Distinguished Engineer (Remote - Eligible)
Mclean, VA · On-site +1
Deep practical knowledge of Site Reliability Engineering (SRE) principles, chaos engineering, and ... Remote (Regardless of Location): $244,700 - $279,200 for Distinguished Engineer Cambridge, MA: $269 ...
Remote Chaos Engineering information
See salary details
$73K - $89.5K
3% of jobs
$89.5K - $105.9K
5% of jobs
$105.9K - $122.4K
6% of jobs
$122.4K - $138.8K
9% of jobs
$140.1K is the 25th percentile. Wages below this are outliers.
$138.8K - $155.3K
11% of jobs
$155.3K - $171.7K
7% of jobs
$171.7K - $188.2K
7% of jobs
The median wage is $189.6K / yr.
$188.2K - $204.6K
6% of jobs
$204.6K - $221.1K
3% of jobs
$221.1K - $237.5K
1% of jobs
$243.7K is the 75th percentile. Wages above this are outliers.
$237.5K - $254K
40% of jobs
$73K
$194.7K
$254K
How much do remote chaos engineering jobs pay per year?
What is Remote Chaos Engineering?
What are some common challenges faced by professionals working in remote chaos engineering roles?
What is the least stressful remote job?
What are the key skills and qualifications needed to thrive as a Remote Chaos Engineer, and why are they important?
Is it possible to work remotely as an engineer?
What engineers make $500,000?
What is the difference between Remote Chaos Engineering vs Remote Site Reliability Engineer?
| Aspect | Remote Chaos Engineering | Remote Site Reliability Engineer |
|---|---|---|
| Primary Focus | Designing and executing chaos experiments to improve system resilience | Ensuring system reliability, availability, and performance through monitoring and automation |
| Skills & Certifications | Knowledge of chaos engineering tools, scripting, cloud platforms | Monitoring tools, scripting, cloud infrastructure, SRE certifications |
| Work Environment | Collaborates with development and operations teams, often in DevOps culture | Works closely with engineering teams to maintain system health and SLAs |
While both roles focus on system stability, Remote Chaos Engineering specializes in testing system resilience through chaos experiments, whereas Remote Site Reliability Engineers focus on maintaining overall system reliability and performance. Both roles require scripting skills and cloud knowledge, but their core objectives differ: one proactively tests, the other maintains system health.
Is chaos engineering still used today?

Full-time
Medical, Dental, Vision
Posted 23 hours ago
Job description
We are seeking a seasoned Engineering Manager to lead our Resilience Engineering team. This role is critical in ensuring the safety and reliability of our production systems through proactive validation techniques, including production load testing and chaos engineering.
You will lead the development of systems and practices that allow engineers to safely test system behavior under stress and failure conditions in production, ensuring issues are discovered and mitigated before they impact real users.
What you'll do
Leadership & Strategy
- Define and drive the vision for resilience engineering at Affirm, with a focus on production load testing and chaos engineering as first-class engineering practices.
- Lead and mentor a team of engineers building platforms and tooling for safe production experimentation.
- Partner with infrastructure, product, and security leadership to embed resilience validation into the software development lifecycle.
- Establish best practices for safely testing system limits and failure scenarios in production.
Systems & Operations
- Own the design and evolution of platforms that enable safe, controlled production load testing and fault injection.
- Ensure strong safeguards are in place, including isolation boundaries, approval workflows, and automated rollback mechanisms to protect real users.
- Build systems that provide end-to-end observability, traceability, and auditability for all resilience experiments.
- Drive reliability improvements by systematically identifying weaknesses through load testing and chaos experiments.
- Establish monitoring, alerting, and incident response practices tailored to proactive resilience validation.
Collaboration & Enablement
- Work closely with engineering teams to design and execute production load tests and chaos experiments safely.
- Partner with infrastructure teams to build guardrails around tests and experimentations.
- Enable teams to adopt resilience practices by providing reusable tooling, frameworks, and standardized workflows.
- Identify systemic weaknesses and lead cross-functional efforts to improve reliability and fault tolerance.
- Evangelize a culture of "test failure before failure tests you" across the organization.
What we look for
- Proven experience leading engineering teams in reliability, infrastructure, or distributed systems.
- Hands-on experience with production load testing, chaos engineering, or large-scale system validation.
- Experience with leveraging a chaos engineering vendor such as Gremlin, Harness, or something similar.
- Strong understanding of failure modes in distributed systems, including latency, partial failure, and cascading outages.
- Experience building or operating systems with strong safety guarantees (isolation, rate limiting, guardrails, auditability).
- Familiarity with cloud-native environments (AWS, Kubernetes) and observability tooling.
- Strong programming background (e.g., Python, Kotlin, Java, or similar).
- Excellent problem-solving skills and the ability to balance long-term resilience investments with immediate business needs.
- Strong communication and leadership skills, with a track record of influencing engineering practices across teams.
- This position requires either equivalent practical experience or a Bachelor's degree in a related field.
Base Pay Grade - P
Equity Grade - 13
Employees new to Affirm typically come in at the start of the pay range. Affirm focuses on providing a simple and transparent pay structure which is based on a variety of factors, including location, experience and job-related skills.
Base pay is part of a total compensation package that may include equity rewards, monthly stipends for health, wellness and tech spending, and benefits (including 100% subsidized medical coverage, dental and vision for you and your dependents.)
USA base pay range (CA, WA, NY, NJ, CT) per year: 230,000 - 290,000
USA base pay range (all other U.S. states) per year: 204,000 - 264,000
#LI-Remote
Affirm is proud to be a remote-first company! The majority of our roles are remote and you can work almost anywhere within the country of employment. Affirmers in proximal roles have the flexibility to work remotely, but will occasionally be required to work out of their assigned Affirm office. A limited number of roles remain office-based due to the nature of their job responsibilities.
We're extremely proud to offer competitive benefits that are anchored to our core value of people come first. Some key highlights of our benefits package include:
- Health care coverage - Affirm covers all premiums for all levels of coverage for you and your dependents
- Flexible Spending Wallets - generous stipends for spending on Technology, Food, various Lifestyle needs, and family forming expenses
- Time off - competitive vacation and holiday schedules allowing you to take time off to rest and recharge
- ESPP - An employee stock purchase plan enabling you to buy shares of Affirm at a discount
We believe It's On Us to provide an inclusive interview experience for all, including people with disabilities. We are happy to provide reasonable accommodations to candidates in need of individualized support during the hiring process.
[For U.S. positions that could be performed in Los Angeles or San Francisco] Pursuant to the San Francisco Fair Chance Ordinance and Los Angeles Fair Chance Initiative for Hiring Ordinance, Affirm will consider for employment qualified applicants with arrest and conviction records.
By clicking "Submit Application," you acknowledge that you have read Affirm's Global Candidate Privacy Notice and hereby freely and unambiguously give informed consent to the collection, processing, use, and storage of your personal information as described therein.
About Affirm
Sourced by ZipRecruiter
Industry
Finance and insurance
Company size
51 - 200 Employees
Headquarters location
San Francisco, CA, US
Year founded
2012