The Role We're looking for a Member of Technical Staff - Diffusion Models to help design and train the next generation of multimodal generative systems powering Moonlake's interactive world platform.
The Role We're looking for a Member of Technical Staff - Diffusion Models to help design and train the next generation of multimodal generative systems powering Moonlake's interactive world platform.
You will work on cutting-edge diffusion and flow-based models for image, video, and multimodal generation, pushing model quality, efficiency, and scalability. This role combines deep research ...
You will work on cutting-edge diffusion and flow-based models for image, video, and multimodal generation, pushing model quality, efficiency, and scalability. This role combines deep research ...
You will work on cutting-edge diffusion and flow-based models for image, video, and multimodal generation, pushing model quality, efficiency, and scalability. This role combines deep research ...
You will work on cutting-edge diffusion and flow-based models for image, video, and multimodal generation, pushing model quality, efficiency, and scalability. This role combines deep research ...
Modeling & architecture * Build and iterate on 2D/3D/image/video/audio diffusion architectures * Work on conditioning: text/image/pose/layout/control signals, multi-modal encoders, guidance ...
Modeling & architecture * Build and iterate on 2D/3D/image/video/audio diffusion architectures * Work on conditioning: text/image/pose/layout/control signals, multi-modal encoders, guidance ...
Benchmark diffusion models, vision systems, and generative workflows. * Validate model checkpoints and detect regressions across versions. * Develop evaluation metrics for realism, consistency, and ...
Benchmark diffusion models, vision systems, and generative workflows. * Validate model checkpoints and detect regressions across versions. * Develop evaluation metrics for realism, consistency, and ...
Benchmark diffusion models, vision systems, and generative workflows. * Validate model checkpoints and detect regressions across versions. * Develop evaluation metrics for realism, consistency, and ...
Benchmark diffusion models, vision systems, and generative workflows. * Validate model checkpoints and detect regressions across versions. * Develop evaluation metrics for realism, consistency, and ...
We are directly responsible for the on-device optimization and deployment of the Apple Intelligence LLM and diffusion models. As a Machine Learning Engineer, you will have the opportunity to be at ...
We are directly responsible for the on-device optimization and deployment of the Apple Intelligence LLM and diffusion models. As a Machine Learning Engineer, you will have the opportunity to be at ...
About the Institute of Foundation Models We are a dedicated research lab for building ... The Role As a member of the Diffusion LLM Team at MBZUAI, you will play a central role in designing ...
About the Institute of Foundation Models We are a dedicated research lab for building ... The Role As a member of the Diffusion LLM Team at MBZUAI, you will play a central role in designing ...
About the Institute of Foundation Models We are a dedicated research lab for building ... The Role As a member of the Diffusion LLM Team at MBZUAI, you will play a central role in designing ...
Quick apply
About the Institute of Foundation Models We are a dedicated research lab for building ... The Role As a member of the Diffusion LLM Team at MBZUAI, you will play a central role in designing ...
Member of Technical Staff (Agents & Diffusion) ($200k-$320k + Equity) at Roam
San Jose, CA ยท On-site
$200K - $320K/yr
Member of Technical Staff (Agents & Diffusion) Salary: $200k-$320k + Equity Company Description ... Roam - Venture-backed Applied AI lab building World Models for interactive 3D environments Join a ...
Member of Technical Staff (Agents & Diffusion) ($200k-$320k + Equity) at Roam
San Jose, CA ยท On-site
$200K - $320K/yr
Member of Technical Staff (Agents & Diffusion) Salary: $200k-$320k + Equity Company Description ... Roam - Venture-backed Applied AI lab building World Models for interactive 3D environments Join a ...
We are Genmo, a research lab dedicated to building open, state-of-the-art models for video ... Lead research initiatives in advanced diffusion models for text-to-video generation, focusing on ...
We are Genmo, a research lab dedicated to building open, state-of-the-art models for video ... Lead research initiatives in advanced diffusion models for text-to-video generation, focusing on ...
We are Genmo, a research lab dedicated to building open, state-of-the-art models for video ... Lead research initiatives in advanced diffusion models for text-to-video generation, focusing on ...
Quick apply
We are Genmo, a research lab dedicated to building open, state-of-the-art models for video ... Lead research initiatives in advanced diffusion models for text-to-video generation, focusing on ...
Experience in large-scale model training (LLMs or Diffusion Models) on large clusters. * Hands-on experience with state-of-the-art video generative models (e.g., Sora, Veo2, MovieGen, CogVideoX, etc.
Quick apply
Experience in large-scale model training (LLMs or Diffusion Models) on large clusters. * Hands-on experience with state-of-the-art video generative models (e.g., Sora, Veo2, MovieGen, CogVideoX, etc.
Research Scientist - World Modeling
Sunnyvale, CA ยท On-site
$150K - $450K/yr
Experience in large-scale model training (LLMs or Diffusion Models) on large clusters. * Hands-on experience with state-of-the-art video generative models (e.g., Sora, Veo2, MovieGen, CogVideoX, etc.
Research Scientist - World Modeling
Sunnyvale, CA ยท On-site
$150K - $450K/yr
Experience in large-scale model training (LLMs or Diffusion Models) on large clusters. * Hands-on experience with state-of-the-art video generative models (e.g., Sora, Veo2, MovieGen, CogVideoX, etc.
Real-time Video Researcher
Palo Alto, CA ยท On-site
$185K - $400K/yr
Work on diffusion model distillation and develop diffusion-based world models for video applications * Train and finetune autoregressive models and diffusion models with a focus on real-time ...
Real-time Video Researcher
Palo Alto, CA ยท On-site
$185K - $400K/yr
Work on diffusion model distillation and develop diffusion-based world models for video applications * Train and finetune autoregressive models and diffusion models with a focus on real-time ...
Forward Deployed Machine Learning Engineer
San Francisco, CA ยท On-site
$180K - $300K/yr
About Black Forest Labs We're the team behind Latent Diffusion, Stable Diffusion, and FLUX - foundational technologies that changed how the world creates images and video. Our models power the tools ...
Forward Deployed Machine Learning Engineer
San Francisco, CA ยท On-site
$180K - $300K/yr
About Black Forest Labs We're the team behind Latent Diffusion, Stable Diffusion, and FLUX - foundational technologies that changed how the world creates images and video. Our models power the tools ...
Forward Deployed Machine Learning Engineer
San Francisco, CA ยท On-site
$180K - $300K/yr
About Black Forest Labs We're the team behind Latent Diffusion, Stable Diffusion, and FLUX - foundational technologies that changed how the world creates images and video. Our models power the tools ...
Forward Deployed Machine Learning Engineer
San Francisco, CA ยท On-site
$180K - $300K/yr
About Black Forest Labs We're the team behind Latent Diffusion, Stable Diffusion, and FLUX - foundational technologies that changed how the world creates images and video. Our models power the tools ...
The work spans modern architectures such as diffusion models, transformers, and learned visual representations, with emphasis on controllability, compute efficiency, and production readiness. This ...
The work spans modern architectures such as diffusion models, transformers, and learned visual representations, with emphasis on controllability, compute efficiency, and production readiness. This ...
Research Engineer
New York, NY ยท On-site
$200K - $300K/yr
To do this we're developing cutting-edge diffusion models and designing novel, personalized interfaces. We're a small team of creative builders in NYC with a rare combination of taste and deep AI ...
Research Engineer
New York, NY ยท On-site
$200K - $300K/yr
To do this we're developing cutting-edge diffusion models and designing novel, personalized interfaces. We're a small team of creative builders in NYC with a rare combination of taste and deep AI ...
Responsibilities include designing and iterating on diffusion models and collaborating with project ... Required qualifications include extensive experience in PyTorch, expertise in deep learning model ...
New
Responsibilities include designing and iterating on diffusion models and collaborating with project ... Required qualifications include extensive experience in PyTorch, expertise in deep learning model ...
New
Diffusion Model information
See salary details
$30.05 - $36.08
22% of jobs
$36.34 is the 25th percentile. Wages below this are outliers.
$36.08 - $42.11
60% of jobs
$42.11 - $48.14
0% of jobs
$48.14 - $54.17
0% of jobs
$54.17 - $60.21
0% of jobs
$60.21 - $66.24
0% of jobs
$66.24 - $72.27
0% of jobs
$72.27 - $78.30
0% of jobs
$78.30 - $84.33
0% of jobs
$84.33 - $90.36
0% of jobs
$90.36 - $96.39
17% of jobs
$30
$52
$96
How much do diffusion model jobs pay per hour?
What are the key skills and qualifications needed to thrive as a Diffusion Model Engineer, and why are they important?
What are some common challenges faced by professionals working with diffusion models, and how can these be addressed?
What are diffusion models in machine learning?
What is the difference between Diffusion Model vs Data Scientist?
| Aspect | Diffusion Model | Data Scientist |
|---|---|---|
| Required Credentials | Typically a background in machine learning, statistics, or computer science | Degree in data science, statistics, computer science, or related fields |
| Work Environment | Research labs, AI development teams, tech companies | Business, tech firms, consulting, research institutions |
| Industry Usage | Used in AI image generation, generative modeling | Analyzing data, building predictive models, data visualization |
While both roles involve data and algorithms, a Diffusion Model focuses on developing generative AI models, whereas a Data Scientist analyzes data to inform business decisions. Understanding these differences helps in choosing the right career path or job focus.

Full-time
Posted 28 days ago
Job description
About Moonlake
Moonlake is building the frontier of interactive world models: systems that generate, simulate, and reason over 3D environments for embodied AI, robotics and gaming. We develop the simulation infrastructure to build worlds (e.g., assets, scenes, digital twins) at scale.
Our team sits at the intersection of:
- Embodied AI
- Robotics simulation
- Interactive 3D worlds
- World models
- Real-time generation
- AI infrastructure
Moonlake is building the next generation of AI infrastructure for interactive digital worlds. Our mission is to enable anyone to create, simulate, and interact with rich environments using natural language and multimodal inputs, turning simple ideas into worlds with structure, logic, and agents that can perceive and act.
Our team has raised $28M in seed funding from NVIDIA Ventures, Threshold Ventures, AIX ventures and notable angels including Naval Ravikant and Jeff Dean to build the foundational layer for the future of AI - powering everything from creative tools and games to robotics training, simulations, and digital twins. Our goal is to make building and experimenting with these environments as accessible and scalable as publishing video on the internet.
We are looking for exceptional research engineers and applied researchers to help push the frontier of interactive AI.
The Role
We're looking for a Member of Technical Staff - Diffusion Models to help design and train the next generation of multimodal generative systems powering Moonlake's interactive world platform.
This is a research-heavy role focused on:
- Diffusion architectures
- Video generation
- Conditioning systems
- Multimodal generation
- Control and personalization
- Large-scale training
The ideal candidate combines:
- Strong ML research fundamentals
- Practical systems intuition
- Experience training generative models at scale
- Deep curiosity around interactive world generation
This role has a very high technical bar. Successful candidates typically have:
- Published research
- Strong generative modeling experience
- Video generation or graphics-related experience
- Prior work on frontier multimodal systems
- Build and iterate on diffusion architectures across:
- 2D
- 3D
- Image
- Video
- Audio
- Develop conditioning and control systems for multimodal generation
- Improve generation quality, controllability, consistency, and efficiency
- Train large-scale generative models
- Build systems for editing, personalization, and controllable generation
- Collaborate closely with infrastructure, world-modeling, and product teams
- Push generation systems toward real-time and interactive applications
Modeling & Architecture
- Build and improve diffusion architectures
- Video diffusion systems
- Multimodal generation pipelines
- Latent-space modeling
- Real-time generation architectures
- Interactive generation systems
Conditioning & Multi-Modal Learning
- Text conditioning
- Image conditioning
- Pose/layout/control signals
- Multi-modal encoders
- Guidance strategies
- Structured generation control
Training & Optimization
- Large-scale diffusion training
- Distributed training systems
- Sample quality vs. compute optimization
- Distillation techniques
- Consistency models
- One-step generation systems
- Efficient generation pipelines
Control & Alignment
- ControlNet
- LoRA
- IP-Adapters
- Style / identity / geometry conditioning
- Editing pipelines
- Inpainting systems
- Personalization systems
- DreamBooth and custom tuning workflows
- Strong ML research background
- Deep understanding of diffusion models and generative architectures
- Experience training large-scale generative systems
- Strong grasp of optimization, scaling, and multimodal learning
- Ability to work across both research and implementation
- Strong engineering fundamentals
- Ability to iterate quickly in a fast-moving research environment
- Experience with 3D generation or world models
- Robotics simulation or embodied AI familiarity
- Interactive generation systems
- Real-time inference optimization
- Graphics or game-engine experience
- Experience building production-grade generation pipelines
Moonlake is not building static image generators.
The company is building systems capable of generating:
- Interactive worlds
- Dynamic simulations
- Controllable environments
- Real-time multimodal experiences
The diffusion stack is foundational to making these systems coherent, controllable, scalable, and interactive.
You'll help define the generation systems behind the next generation of world-model AI.
We are committed to being an on-site, in-person team currently based in San Francisco.