The hardest problems in both AI and biology are being solved here, and there is room for you to own ... You will work closely with the pretraining and generation teams, feeding interpretability findings ...
The hardest problems in both AI and biology are being solved here, and there is room for you to own ... You will work closely with the pretraining and generation teams, feeding interpretability findings ...
AI is an innovative company that empowers people to connect, learn, and tell stories through ... and interpretability. • Conduct adversarial testing to proactively uncover potential ...
AI is an innovative company that empowers people to connect, learn, and tell stories through ... and interpretability. • Conduct adversarial testing to proactively uncover potential ...
Research Engineer, AI Safety & Alignment
Redwood City, CA · On-site
$225K - $400K/yr
Experience with explainable AI (XAI) and interpretability techniques. * Have research in AI safety ... alignment, ethics , or a related area. * Knowledge of the broader societal and ethical implications ...
Research Engineer, AI Safety & Alignment
Redwood City, CA · On-site
$225K - $400K/yr
Experience with explainable AI (XAI) and interpretability techniques. * Have research in AI safety ... alignment, ethics , or a related area. * Knowledge of the broader societal and ethical implications ...
... interpretability, debate). • Experience with post-training and RL techniques such as RLHF, DPO, GRPO, and similar approaches. Company : Scale's mission is to develop reliable AI systems for the ...
... interpretability, debate). • Experience with post-training and RL techniques such as RLHF, DPO, GRPO, and similar approaches. Company : Scale's mission is to develop reliable AI systems for the ...
... interpretability, debate). • Experience with post-training and RL techniques such as RLHF, DPO, GRPO, and similar approaches. Company : Scale's mission is to develop reliable AI systems for the ...
... interpretability, debate). • Experience with post-training and RL techniques such as RLHF, DPO, GRPO, and similar approaches. Company : Scale's mission is to develop reliable AI systems for the ...
Apply interpretability techniques (e.g., circuit analysis, feature attribution, sparse autoencoders ... Adversarial AI & Red Teaming Adaptive, multi-turn attacks and reasoning-based adversarial ...
Apply interpretability techniques (e.g., circuit analysis, feature attribution, sparse autoencoders ... Adversarial AI & Red Teaming Adaptive, multi-turn attacks and reasoning-based adversarial ...
[EOI] Postdoctoral Researcher at Polymathic AI: Generalization and Transfer in Scientific AI
New York, NY · On-site
$62K - $125K/yr
... of the Polymathic AI initiative, focused on generalization and transfer across scientific ... interpretability of scientific foundation models. The rush to build foundation models has led to ...
[EOI] Postdoctoral Researcher at Polymathic AI: Generalization and Transfer in Scientific AI
New York, NY · On-site
$62K - $125K/yr
... of the Polymathic AI initiative, focused on generalization and transfer across scientific ... interpretability of scientific foundation models. The rush to build foundation models has led to ...
$170K - $270K/yr
... AI, from demonstrating superhuman systems can be vulnerable, to scaling laws for robustness and jailbreaking constitutional classifiers. Mechanistic Interpretability: finding issues with Sparse ...
$170K - $270K/yr
... AI, from demonstrating superhuman systems can be vulnerable, to scaling laws for robustness and jailbreaking constitutional classifiers. Mechanistic Interpretability: finding issues with Sparse ...
... interpretability. Key Responsibilities • Conduct original research in AI security, including adversarial machine learning, model robustness, and secure AI system design. • Develop and evaluate ...
... interpretability. Key Responsibilities • Conduct original research in AI security, including adversarial machine learning, model robustness, and secure AI system design. • Develop and evaluate ...
Research Scientist
New York, NY · On-site
$120K - $210K/yr
About Ataraxis AI Ataraxis is a clinical AI research lab working at the intersection of multi-modal ... interpretability, computational pathology}
Research Scientist
New York, NY · On-site
$120K - $210K/yr
About Ataraxis AI Ataraxis is a clinical AI research lab working at the intersection of multi-modal ... interpretability, computational pathology}
Senior AI/ML Architect - AI Program
Rochester, MN · On-site +1
$100K - $128K/yr
Senior AI/ML Architects at Mayo Clinic serve at the leading edge of data, systems, and computer ... interpretability, and the integration of diverse healthcare data modalities. As part of our ...
Senior AI/ML Architect - AI Program
Rochester, MN · On-site +1
$100K - $128K/yr
Senior AI/ML Architects at Mayo Clinic serve at the leading edge of data, systems, and computer ... interpretability, and the integration of diverse healthcare data modalities. As part of our ...
... interpretability, and the integration of diverse healthcare data modalities. As part of our ... our AI/ML Architect job family, which includes AI/ML Architect, Senior AI/ML Architect, and ...
... interpretability, and the integration of diverse healthcare data modalities. As part of our ... our AI/ML Architect job family, which includes AI/ML Architect, Senior AI/ML Architect, and ...
Senior AI/ML Architect - AI Program
Rochester, MN · On-site +1
$68.25 - $91.50/hr
Responsibilities Senior AI/ML Architects at Mayo Clinic serve at the leading edge of data, systems ... interpretability, and the integration of diverse healthcare data modalities. As part of our ...
Senior AI/ML Architect - AI Program
Rochester, MN · On-site +1
$68.25 - $91.50/hr
Responsibilities Senior AI/ML Architects at Mayo Clinic serve at the leading edge of data, systems ... interpretability, and the integration of diverse healthcare data modalities. As part of our ...
$68.75 - $92/hr
Responsibilities Senior AI/ML Architects at Mayo Clinic serve at the leading edge of data, systems ... interpretability, and the integration of diverse healthcare data modalities. As part of our ...
$68.75 - $92/hr
Responsibilities Senior AI/ML Architects at Mayo Clinic serve at the leading edge of data, systems ... interpretability, and the integration of diverse healthcare data modalities. As part of our ...
AI Architect
Las Vegas, NV · On-site
$60.50 - $79.75/hr
Define and implement best practices for agentic AI development, including governance, security, safety, interpretability, and performance monitoring. * Research and evaluate emerging technologies in ...
AI Architect
Las Vegas, NV · On-site
$60.50 - $79.75/hr
Define and implement best practices for agentic AI development, including governance, security, safety, interpretability, and performance monitoring. * Research and evaluate emerging technologies in ...
Create governance processes for AI model lifecycle management * Implement model interpretability and explainability solutions * Establish metrics and monitoring systems for RAI compliance * Lead ...
Create governance processes for AI model lifecycle management * Implement model interpretability and explainability solutions * Establish metrics and monitoring systems for RAI compliance * Lead ...
Senior AI/ML Architect - AI Program
Rochester, MN · On-site +1
$100K - $128K/yr
Senior AI/ML Architects at Mayo Clinic serve at the leading edge of data, systems, and computer ... interpretability, and the integration of diverse healthcare data modalities. As part of our ...
Senior AI/ML Architect - AI Program
Rochester, MN · On-site +1
$100K - $128K/yr
Senior AI/ML Architects at Mayo Clinic serve at the leading edge of data, systems, and computer ... interpretability, and the integration of diverse healthcare data modalities. As part of our ...
Product Marketing Lead
San Francisco, CA · On-site
About Goodfire Goodfire is a research company using interpretability to understand, learn from, and design AI systems. Our mission is to build the next generation of safe and powerful AI-not by ...
Product Marketing Lead
San Francisco, CA · On-site
About Goodfire Goodfire is a research company using interpretability to understand, learn from, and design AI systems. Our mission is to build the next generation of safe and powerful AI-not by ...
... AI systems. They are seeking a Member of Technical Staff to join their Field Team, responsible for integrating their interpretability platform into partner environments and ensuring technical ...
... AI systems. They are seeking a Member of Technical Staff to join their Field Team, responsible for integrating their interpretability platform into partner environments and ensuring technical ...
Machine Learning Engineer
Manhattan, NY · On-site
Goodfire is an AI research lab using interpretability to turn AI into something that can be understood, debugged, and shaped like software Founded in 2024, the company is headquartered in San ...
Machine Learning Engineer
Manhattan, NY · On-site
Goodfire is an AI research lab using interpretability to turn AI into something that can be understood, debugged, and shaped like software Founded in 2024, the company is headquartered in San ...
Interpretability Ai information
See salary details
$44.5K - $56.6K
0% of jobs
$56.6K - $68.7K
1% of jobs
$68.7K - $80.8K
3% of jobs
$80.8K - $92.9K
4% of jobs
$92.9K - $105K
7% of jobs
$115.5K is the 25th percentile. Wages below this are outliers.
$105K - $117K
11% of jobs
$117K - $129.1K
12% of jobs
The median wage is $132.4K / yr.
$129.1K - $141.2K
45% of jobs
$141.2K - $153.3K
8% of jobs
$153.3K - $165.4K
5% of jobs
$165.4K - $177.5K
3% of jobs
$44.5K
$129.7K
$177.5K
How much do interpretability ai jobs pay per year?
What is Interpretability in AI?
What is the difference between Interpretability Ai vs Data Scientist?
| Aspect | Interpretability Ai | Data Scientist |
|---|---|---|
| Required Credentials | Typically a background in AI, machine learning, or data analysis; often a master's or PhD in related fields | Degree in computer science, statistics, or related fields; often a master's or PhD |
| Work Environment | Research labs, AI development teams, tech companies focusing on explainable AI | Data analysis, modeling, and insights generation across various industries |
| Employer & Industry Usage | Tech firms, AI startups, research institutions | Finance, healthcare, tech, consulting, and more |
Interpretability Ai specialists focus on making AI models transparent and understandable, often working on explainability tools. Data Scientists analyze data, build models, and generate insights. While both roles require strong analytical skills, Interpretability Ai emphasizes explainability techniques, whereas Data Scientists focus on data analysis and modeling across diverse industries.
What are the key skills and qualifications needed to thrive as an AI Interpretability Specialist, and why are they important?
What are the main challenges faced when working in Interpretability AI roles, and how can professionals address them?
Full-time
Medical, Dental, Vision
Posted 25 days ago
Job description
Output is currently in stealth, operated by a team of repeat founders and biotech veterans with multiple exits in AI x Bio, and backed by top-tier VCs including Y Combinator.
You will continue developing methods to understand what our foundation model learns about biology, and build the tools that make it a glass box model. We believe that in biology, a model's reasoning must be visible. And the features you find are not just explanations: they expand what the model can do.
- You will continue developing our methods for probing and reverse-engineering the model's learned representations, understanding how it encodes biological information across molecular scales
- You will design and run experiments to identify and characterize capabilities, mapping what the model has learned about molecular interactions and biological function
- You will build methods to extract the model's biological understanding as explicit, usable outputs that downstream systems and researchers can act on
- You will create tools that connect model internals to meaningful biological concepts, making the model's reasoning interpretable to scientists and useful in practice
- You will work closely with the pretraining and generation teams, feeding interpretability findings back into model development to strengthen the capabilities you uncover
- You will own the full pipeline from probing experiments to production-quality interpretability tools, building robust systems on distributed infrastructure
About You
- You have a PhD in computer science, machine learning, physics, mathematics, or a related field with 2+ years of post-doctoral or industry research experience, or a Bachelor's or Master's degree with 5+ years of hands-on research and engineering experience in model interpretability or representation analysis
- You have a strong publication record at top-tier venues (e.g., NeurIPS, ICML, ICLR) with contributions to mechanistic interpretability, representation analysis, probing methods, or model understanding
- You have hands-on experience analyzing the internal representations of large neural networks, with demonstrated ability to design experiments that reveal what models have learned
- You are proficient in Python and PyTorch, and have experience working with large models on GPU infrastructure
- You have demonstrated the ability to take interpretability research from experiments to usable tools: you do not just analyze models, you build systems others can use
- You write production-quality code that is well-tested and maintainable, and you are comfortable working in shared codebases with version control and code review
- You think carefully about what constitutes evidence that a model has learned a concept, and you design experiments that distinguish real capabilities from artifacts
Bonus Points
- You have a background in chemistry, biology, computational biology, biophysics, or a related natural science
- You have experience interpreting ML models trained on scientific or biological data
- You have experience building visualization or analysis tools for model internals
- You have experience with multimodal models or representations that span multiple data types
- You have contributed to open-source machine learning projects
Our Values
♥ Heart: We foster a culture of ownership. We are assembling a team of individuals who are passionate and take pride in their contributions.
Excellence: We have an unwavering commitment to excellence and continuously challenge ourselves to reach the highest standards.
Practicality: We value practicality and results-oriented thinking. We are committed to making a tangible impact on the lives of patients and the broader community.
Honesty: We place a high value on honesty and directness. We firmly believe in addressing issues as they arise, in an open and transparent manner.
Fun: We believe that life is too short to not have fun. Our goal is to create a workplace that is fun, engaging, rewarding and fulfilling.
What We Offer
- We encourage new and different ideas, creativity and contrarian thinking
- Healthy feedback focused environment to help you strive - leadership will have high expectations, regularly share constructive feedback, support you and help you grow, and welcome receiving feedback and ideas from you
- You own your day-to-day management. What we care about is that we all hit our milestones
- Competitive salary and equity in a growing, well-funded startup
- Excellent medical, dental, and vision coverage