Experience with red-teaming or adversarial testing of AI systems * Native mobile testing experience (iOS, Android) * Prior work with 21 CFR Part 11, GxP, or similar regulated-software validation ...
Quick apply
Experience with red-teaming or adversarial testing of AI systems * Native mobile testing experience (iOS, Android) * Prior work with 21 CFR Part 11, GxP, or similar regulated-software validation ...
Quick apply
Experience with red-teaming or adversarial testing of AI systems * Native mobile testing experience (iOS, Android) * Prior work with 21 CFR Part 11, GxP, or similar regulated-software validation ...
Experience with red-teaming or adversarial testing of AI systems * Native mobile testing experience (iOS, Android) * Prior work with 21 CFR Part 11, GxP, or similar regulated-software validation ...
Experience with red-teaming or adversarial testing of AI systems * Native mobile testing experience (iOS, Android) * Prior work with 21 CFR Part 11, GxP, or similar regulated-software validation ...
Define evaluation frameworks (quality, safety, latency, cost) and success metrics; implement prompt and model testing, red-teaming approaches, and ongoing performance monitoring * Serve as a ...
Define evaluation frameworks (quality, safety, latency, cost) and success metrics; implement prompt and model testing, red-teaming approaches, and ongoing performance monitoring * Serve as a ...
Experience with Red teaming, incident management will be a plus * Indepth knowledge of Agentic AI, security frameworks, risk management, and incident response. * Excellent communication and ...
Experience with Red teaming, incident management will be a plus * Indepth knowledge of Agentic AI, security frameworks, risk management, and incident response. * Excellent communication and ...

Full-time
Posted 9 days ago
We're looking for a Senior SDET who thinks deeply about quality in systems that are inherently non-deterministic. Agentic AI doesn't fail the same way traditional software does — and testing it requires a new toolkit: eval frameworks, prompt regression, tool-call reliability, adversarial scenarios, and more.
You'll own the entire quality infrastructure across our product portfolio — from test data and CI pipelines to the standards and culture of how we ship. You'll work directly with product, devops, and AI engineering, with no layers between your decisions and their impact.
What You'll OwnStrong foundation: Series A, top-tier investors, and a data asset (200M+ patient records) that most companies spend years trying to build
Sourced by ZipRecruiter
It services
1 - 10 Employees
Palo Alto, CA, US
2021