Interpretability Ai Jobs (NOW HIRING)

Research Scientist, AI Controls and Monitoring

Manhattan, NY · On-site

... interpretability, debate). • Experience with post-training and RL techniques such as RLHF, DPO, GRPO, and similar approaches. Company : Scale's mission is to develop reliable AI systems for the ...

Research Scientist, AI Controls and Monitoring

Manhattan, NY · On-site

Research Scientist, AI Controls and Monitoring

Manhattan, NY · On-site

Research Scientist, AI Controls and Monitoring

Manhattan, NY · On-site

Umd

Postdoctoral Associate - AI Security

College Park, MD · Hybrid

Apply interpretability techniques (e.g., circuit analysis, feature attribution, sparse autoencoders ... Adversarial AI & Red Teaming Adaptive, multi-turn attacks and reasoning-based adversarial ...

Umd

Postdoctoral Associate - AI Security

College Park, MD · Hybrid

New York University

[EOI] Postdoctoral Researcher at Polymathic AI: Generalization and Transfer in Scientific AI

New York, NY · On-site

$62K - $125K/yr

... of the Polymathic AI initiative, focused on generalization and transfer across scientific ... interpretability of scientific foundation models. The rush to build foundation models has led to ...

New York University

[EOI] Postdoctoral Researcher at Polymathic AI: Generalization and Transfer in Scientific AI

New York, NY · On-site

$62K - $125K/yr

FAR.AI

$170K - $270K/yr

... AI, from demonstrating superhuman systems can be vulnerable, to scaling laws for robustness and jailbreaking constitutional classifiers. Mechanistic Interpretability: finding issues with Sparse ...

FAR.AI

$170K - $270K/yr

University of Maryland

Postdoctoral Associate - AI Security

College Park, MD · On-site

... interpretability. Key Responsibilities • Conduct original research in AI security, including adversarial machine learning, model robustness, and secure AI system design. • Develop and evaluate ...

University of Maryland

Postdoctoral Associate - AI Security

College Park, MD · On-site

Ataraxis AI

Research Scientist

New York, NY · On-site

$120K - $210K/yr

About Ataraxis AI Ataraxis is a clinical AI research lab working at the intersection of multi-modal ... interpretability, computational pathology}

Ataraxis AI

Research Scientist

New York, NY · On-site

$120K - $210K/yr

About Ataraxis AI Ataraxis is a clinical AI research lab working at the intersection of multi-modal ... interpretability, computational pathology}

Senior AI/ML Architect - AI Program

Rochester, MN · On-site +1

$100K - $128K/yr

Senior AI/ML Architects at Mayo Clinic serve at the leading edge of data, systems, and computer ... interpretability, and the integration of diverse healthcare data modalities. As part of our ...

Senior AI/ML Architect - AI Program

Rochester, MN · On-site +1

$100K - $128K/yr

Senior AI/ML Architects at Mayo Clinic serve at the leading edge of data, systems, and computer ... interpretability, and the integration of diverse healthcare data modalities. As part of our ...

Senior AI/ML Architect - AI Program

Rochester, MN · On-site

... interpretability, and the integration of diverse healthcare data modalities. As part of our ... our AI/ML Architect job family, which includes AI/ML Architect, Senior AI/ML Architect, and ...

Senior AI/ML Architect - AI Program

Rochester, MN · On-site

... interpretability, and the integration of diverse healthcare data modalities. As part of our ... our AI/ML Architect job family, which includes AI/ML Architect, Senior AI/ML Architect, and ...

Senior AI/ML Architect - AI Program

Rochester, MN · On-site +1

$68.25 - $91.50/hr

Responsibilities Senior AI/ML Architects at Mayo Clinic serve at the leading edge of data, systems ... interpretability, and the integration of diverse healthcare data modalities. As part of our ...

Senior AI/ML Architect - AI Program

Rochester, MN · On-site +1

$68.25 - $91.50/hr

$68.75 - $92/hr

$68.75 - $92/hr

Virtusa Corporation

AI Architect

Las Vegas, NV · On-site

$60.50 - $79.75/hr

Define and implement best practices for agentic AI development, including governance, security, safety, interpretability, and performance monitoring. * Research and evaluate emerging technologies in ...

Virtusa Corporation

AI Architect

Las Vegas, NV · On-site

$60.50 - $79.75/hr

Spear AI

Create governance processes for AI model lifecycle management * Implement model interpretability and explainability solutions * Establish metrics and monitoring systems for RAI compliance * Lead ...

Spear AI

Senior AI/ML Architect - AI Program

Rochester, MN · On-site +1

$100K - $128K/yr

Senior AI/ML Architects at Mayo Clinic serve at the leading edge of data, systems, and computer ... interpretability, and the integration of diverse healthcare data modalities. As part of our ...

Senior AI/ML Architect - AI Program

Rochester, MN · On-site +1

$100K - $128K/yr

Senior AI/ML Architects at Mayo Clinic serve at the leading edge of data, systems, and computer ... interpretability, and the integration of diverse healthcare data modalities. As part of our ...

Product Marketing Lead

San Francisco, CA · On-site

About Goodfire Goodfire is a research company using interpretability to understand, learn from, and design AI systems. Our mission is to build the next generation of safe and powerful AI-not by ...

Product Marketing Lead

San Francisco, CA · On-site

Field Team - Member of Technical Staff

San Francisco, CA · On-site

... AI systems. They are seeking a Member of Technical Staff to join their Field Team, responsible for integrating their interpretability platform into partner environments and ensuring technical ...

Field Team - Member of Technical Staff

San Francisco, CA · On-site

Machine Learning Engineer

Manhattan, NY · On-site

Goodfire is an AI research lab using interpretability to turn AI into something that can be understood, debugged, and shaped like software Founded in 2024, the company is headquartered in San ...