1

Resilience Engineer Jobs (NOW HIRING)

We're looking for a Staff Cyber Resilience Engineer to lead our defense against the attacks that matter most: ransomware, destructive wipes, and data loss at scale. This is a hands-on technical ...

We're looking for a Staff Cyber Resilience Engineer to lead our defense against the attacks that matter most: ransomware, destructive wipes, and data loss at scale. This is a hands-on technical ...

next page

Showing results 1-20

Resilience Engineer information

See salary details

$50.5K

$110.7K

$152K

How much do resilience engineer jobs pay per year?

As of Jun 7, 2026, the average yearly pay for resilience engineer in the United States is $110,698.00, according to ZipRecruiter salary data. Most workers in this role earn between $84,000.00 and $135,000.00 per year, depending on experience, location, and employer.

What are the key skills and qualifications needed to thrive as a Resilience Engineer, and why are they important?

Thriving as a Resilience Engineer requires strong knowledge of systems engineering, reliability analysis, and incident response, often supported by a degree in computer science or a related field. Familiarity with infrastructure monitoring tools, cloud platforms, chaos engineering frameworks, and certifications like AWS Certified Solutions Architect or Certified Reliability Engineer are typically valuable. Excellent problem-solving skills, collaboration, and clear communication help individuals excel at anticipating risks and working with cross-functional teams. These skills are crucial for minimizing downtime, ensuring system robustness, and maintaining business continuity in complex technical environments.

What is a Resilience Engineer?

A Resilience Engineer is a professional responsible for ensuring that systems and organizations can withstand, recover from, and adapt to disruptions, failures, or unexpected changes. They analyze existing infrastructure, identify potential risks or vulnerabilities, and implement strategies to improve reliability and robustness. Resilience Engineers often work closely with development, operations, and security teams to design systems that continue functioning smoothly during incidents. Their ultimate goal is to minimize downtime and data loss, ensuring business continuity and a positive user experience.

What is the difference between Resilience Engineer vs Reliability Engineer?

AspectResilience EngineerReliability Engineer
CertificationsISO 22301, Business Continuity certificationsASQ Reliability Engineer, Six Sigma
Work EnvironmentIT systems, infrastructure, complex systemsManufacturing, industrial systems, equipment
Industry UsageTechnology, finance, critical infrastructureManufacturing, aerospace, energy
FocusBuilding resilience against disruptions and failuresEnsuring systems operate reliably over time

Resilience Engineers focus on designing systems and processes to withstand and recover from disruptions, emphasizing overall system robustness. Reliability Engineers concentrate on maintaining consistent system performance and minimizing failures. While both roles aim to improve system stability, Resilience Engineers have a broader scope including disaster recovery and business continuity, whereas Reliability Engineers focus more on product and system reliability metrics.

How does a Resilience Engineer typically collaborate with other teams during incident response?

Resilience Engineers play a crucial role in incident response by working closely with development, operations, and security teams to identify the root causes of system failures and implement solutions that improve system reliability. During incidents, they often facilitate blameless post-mortems, coordinate communication between stakeholders, and drive the adoption of best practices to prevent future issues. This collaborative approach ensures that knowledge is shared across teams and that long-term improvements are made to the organization's infrastructure.
What states have the most Resilience Engineer jobs? States with the most job openings for Resilience Engineer jobs include:
Infographic showing various Resilience Engineer job openings in the United States as of May 2026, with employment types broken down into 100% Full Time. Highlights an 100% In-person job distribution, with an average salary of $110,698 per year, or $53.2 per hour.
Staff Cyber Resilience Engineer

Staff Cyber Resilience Engineer

Xometry

Denver, CO โ€ข On-site

Full-time

Medical, Dental, Vision, Life, Retirement, PTO

Posted 15 days ago


Job description

Xometry (NASDAQ: XMTR) powers the industries of today and tomorrow by connecting the people with big ideas to the manufacturers who can bring them to life. Xometry's digital marketplace gives manufacturers the critical resources they need to grow their business while also making it easy for buyers at Fortune 1000 companies to tap into global manufacturing capacity.
We're looking for a Staff Cyber Resilience Engineer to lead our defense against the attacks that matter most: ransomware, destructive wipes, and data loss at scale. This is a hands-on technical leadership role. You will own the design and engineering of our Isolated Recovery Environment, set the standard for Infrastructure as Code across the organization, and ensure that if our AWS environment is ever compromised, we can restore operations with certainty and speed.
You will work with a high-caliber engineering team, have direct influence on our security architecture, and lead recovery exercises that test the organization end-to-end.
What You'll Do
Own Our Recovery Architecture
  • Design and build our Isolated Recovery Environment - a hardened AWS account with immutable vaults that break the attacker's kill chain before it reaches our data.
  • Threat model our environment with a deep understanding of cloud-native attack patterns: IAM privilege escalation, backup deletion, ransomware persistence, and lateral movement across accounts.
  • Validate and continuously improve backup configurations to ensure recoverability, not just existence.

Standardize and Automate Infrastructure
  • Lead our transition to 100% Infrastructure as Code. Every asset (VPCs, IAM roles, security groups) must be defined in Terraform so we can redeploy the entire stack into a clean account via automated pipeline.
  • Build automated recovery workflows that can tear down a compromised environment and bootstrap a fresh, hardened one from verified code and clean data.
  • Write and maintain executable recovery playbooks that detail the exact API calls and CLI commands needed to restore the application - tested, versioned, and runnable, not static documents.

Validate, Test, and Lead Exercises
  • Develop automated scripts (Python or Go) to smoke test recovered data and validate integrity post-restoration.
  • Lead regular hands-on recovery drills that simulate total loss of a critical environment and full recovery into a secondary clean account. Own the after-action process and drive improvements.

Drive Engineering Standards
  • Act as the resilience authority for the engineering organization - shaping high-availability architecture decisions, influencing design reviews, and raising the floor on how we think about recoverability.
  • Partner with the Site Reliability Engineering team on multi-region deployments and high-availability design, ensuring cyber resilience is embedded in architecture from the start.
  • Champion IaC and immutable infrastructure practices across teams, not just within your own workstream.
What You Bring
Required
  • 8+ years of experience in complex cloud environments (any of AWS/GCP/Azure), including at least 3 years in AWS. EKS/Kubernetes experience is a strong plus.
  • Strong Terraform skills. You should be able to modularize complex environments so they are environment-agnostic.
  • Hands-on familiarity with the Secure Vault pattern: protecting data in a separate, highly restricted AWS account with tight network controls.
  • Advanced shell scripting and proficiency in either Python or Go to automate restoration tasks that native AWS tooling doesn't cover.
  • Experience with CI/CD tooling (Scalr, GitHub Actions, or equivalent) to enable broad adoption of recovery pipelines across the organization.
  • Proven ability to engineer and automate end-to-end restoration workflows.

Preferred
  • Hands-on experience leading technical recovery efforts from an actual cyber attack or destructive incident.
  • Experience with chaos engineering tooling to stress-test recovery assumptions.
  • Familiarity with NIST SP 800-34 (Contingency Planning) or similar frameworks.
  • AWS Security Specialty certification or equivalent demonstrated expertise.

The estimated base salary range for new hires into this role is $205,000- $233,000 annually + annual bonus depending on factors such as job-related skills, relevant experience, and location. We also offer a competitive benefits package, including 401(k) match, medical, dental and vision insurance; life and disability insurance; generous paid time off including vacation, sick leave, floating and fixed holidays, maternity and bonding leave; EAP, other wellbeing resources; and much more.
#LI-Hybrid
Xometry is an equal opportunity employer. All applicants will be considered for employment without attention to race, color, religion, sex, sexual orientation, gender identity, national origin, veteran, or disability status.
For US based roles: Xometry participates in E-Verify and after a job offer is accepted, will provide the federal government with your Form I-9 information to confirm that you are authorized to work in the U.S.

Xometry logo

About Xometry

Sourced by ZipRecruiter

Xometry (NASDAQ: XMTR) powers the industries of today and tomorrow by connecting the people with big ideas to the manufacturers who can bring them to life. Xometry's digital marketplace gives manufacturers the critical resources they need to grow their business while also making it easy for buyers at Fortune 1000 companies to tap into global manufacturing capacity.

Industry

Software development

Company size

501 - 1,000 Employees

Headquarters location

Gaithersburg, MD, US

Year founded

2013