Proficiency in Python and deep learning frameworks such as PyTorch * Experience with large-scale ... Publication record in reinforcement learning, machine learning, or robotics The pay offered for ...
Proficiency in Python and deep learning frameworks such as PyTorch * Experience with large-scale ... Publication record in reinforcement learning, machine learning, or robotics The pay offered for ...
Reinforcement Learning Engineer
New York, NY ยท On-site
$87K - $118K/yr
Reinforcement Learning (RL) Engineer Location: New York (Office) On-site Full-time Compensation ... A deep-dive assessment into RL architecture, simulation frameworks, and live production experience.
Reinforcement Learning Engineer
New York, NY ยท On-site
$87K - $118K/yr
Reinforcement Learning (RL) Engineer Location: New York (Office) On-site Full-time Compensation ... A deep-dive assessment into RL architecture, simulation frameworks, and live production experience.
Our Reinforcement Learning teams sit at the intersection of cutting-edge research and engineering excellence, with a deep commitment to building high-quality, scalable systems that push the ...
Our Reinforcement Learning teams sit at the intersection of cutting-edge research and engineering excellence, with a deep commitment to building high-quality, scalable systems that push the ...
The selected candidate will drive the design, development, and integration of innovative Artificial Intelligence (AI) and Deep Reinforcement Learning (DRL) capabilities into defense and mission ...
The selected candidate will drive the design, development, and integration of innovative Artificial Intelligence (AI) and Deep Reinforcement Learning (DRL) capabilities into defense and mission ...
The role involves building scalable systems for training large generative models, implementing reinforcement learning methods, and shipping deep learning solutions to enhance self-driving behaviors.
The role involves building scalable systems for training large generative models, implementing reinforcement learning methods, and shipping deep learning solutions to enhance self-driving behaviors.
Research Engineer, Code RL (Reinforcement Learning)
San Francisco, CA ยท On-site +1
$241K/yr
Our Reinforcement Learning teams sit at the intersection of cutting-edge research and engineering excellence, with a deep commitment to building high-quality, scalable systems that push the ...
Research Engineer, Code RL (Reinforcement Learning)
San Francisco, CA ยท On-site +1
$241K/yr
Our Reinforcement Learning teams sit at the intersection of cutting-edge research and engineering excellence, with a deep commitment to building high-quality, scalable systems that push the ...
Software Developer
Colorado Springs, CO ยท On-site
The selected candidate will drive the design, development, and integration of innovative Artificial Intelligence (AI) and Deep Reinforcement Learning (DRL) capabilities into defense and mission ...
Software Developer
Colorado Springs, CO ยท On-site
The selected candidate will drive the design, development, and integration of innovative Artificial Intelligence (AI) and Deep Reinforcement Learning (DRL) capabilities into defense and mission ...
AI Researcher
Manhattan, NY ยท On-site
... with deep reinforcement learning in any context (autonomous vehicles, robotics, or LLMs) โข Experience working with data generated by human experts for model training โข Financial services ...
AI Researcher
Manhattan, NY ยท On-site
... with deep reinforcement learning in any context (autonomous vehicles, robotics, or LLMs) โข Experience working with data generated by human experts for model training โข Financial services ...
Senior PMT ES - Reinforcement Learning, SageMaker AI
$142K - $188K/yr
... deep expertise, complex infrastructure, and significant experimentation. We need a product leader who can make reinforcement learning dramatically more accessible, helping a broad range of customers ...
Senior PMT ES - Reinforcement Learning, SageMaker AI
$142K - $188K/yr
... deep expertise, complex infrastructure, and significant experimentation. We need a product leader who can make reinforcement learning dramatically more accessible, helping a broad range of customers ...
Work with technologies and concepts at the cutting edge of AI, including but not limited to: deep reinforcement learning, foundation models, large language models, convolutional/recurrent/graph ...
Work with technologies and concepts at the cutting edge of AI, including but not limited to: deep reinforcement learning, foundation models, large language models, convolutional/recurrent/graph ...
Machine Learning Engineer
Laurel, MD ยท On-site
Work with technologies and concepts at the cutting edge of AI, including but not limited to: deep reinforcement learning, foundation models, large language models, convolutional/recurrent/graph ...
Machine Learning Engineer
Laurel, MD ยท On-site
Work with technologies and concepts at the cutting edge of AI, including but not limited to: deep reinforcement learning, foundation models, large language models, convolutional/recurrent/graph ...
Senior PMT ES - Reinforcement Learning, SageMaker AI
Bellevue, WA ยท On-site
$142K - $188K/yr
... deep expertise, complex infrastructure, and significant experimentation. We need a product leader who can make reinforcement learning dramatically more accessible, helping a broad range of customers ...
Senior PMT ES - Reinforcement Learning, SageMaker AI
Bellevue, WA ยท On-site
$142K - $188K/yr
... deep expertise, complex infrastructure, and significant experimentation. We need a product leader who can make reinforcement learning dramatically more accessible, helping a broad range of customers ...
The selected candidate will drive the design, development, and integration of innovative Artificial Intelligence (AI) and Deep Reinforcement Learning (DRL) capabilities into defense and mission ...
The selected candidate will drive the design, development, and integration of innovative Artificial Intelligence (AI) and Deep Reinforcement Learning (DRL) capabilities into defense and mission ...
Research Engineer, Code RL (Reinforcement Learning)
San Francisco, CA ยท On-site
$500K - $850K/yr
Our Reinforcement Learning teams sit at the intersection of cutting-edge research and engineering excellence, with a deep commitment to building high-quality, scalable systems that push the ...
Research Engineer, Code RL (Reinforcement Learning)
San Francisco, CA ยท On-site
$500K - $850K/yr
Our Reinforcement Learning teams sit at the intersection of cutting-edge research and engineering excellence, with a deep commitment to building high-quality, scalable systems that push the ...
Senior Machine Learning Engineer, Reinforcement Learning - Egofold
Beverly Hills, CA ยท On-site +1
$150K - $185K/yr
T-shaped capability: deep machine learning expertise plus practical range across one or more ... Experience with reinforcement learning methods such as PPO, SAC, DQN, actor-critic, or related ...
Senior Machine Learning Engineer, Reinforcement Learning - Egofold
Beverly Hills, CA ยท On-site +1
$150K - $185K/yr
T-shaped capability: deep machine learning expertise plus practical range across one or more ... Experience with reinforcement learning methods such as PPO, SAC, DQN, actor-critic, or related ...
Develop and integrate AI agent and/or deep reinforcement learning (DRL) frameworks for decision-making, workflow automation, or adaptive optimization in video-based applications * Stay current with ...
Quick apply
Develop and integrate AI agent and/or deep reinforcement learning (DRL) frameworks for decision-making, workflow automation, or adaptive optimization in video-based applications * Stay current with ...
Senior Machine Learning Engineer, Reinforcement Learning - Egofold
Beverly Hills, CA ยท On-site +1
$150K - $185K/yr
T-shaped capability: deep machine learning expertise plus practical range across one or more ... Experience with reinforcement learning methods such as PPO, SAC, DQN, actor-critic, or related ...
Senior Machine Learning Engineer, Reinforcement Learning - Egofold
Beverly Hills, CA ยท On-site +1
$150K - $185K/yr
T-shaped capability: deep machine learning expertise plus practical range across one or more ... Experience with reinforcement learning methods such as PPO, SAC, DQN, actor-critic, or related ...
Work with technologies and concepts at the cutting edge of AI, including but not limited to: deep reinforcement learning, foundation models, large language models, convolutional/recurrent/graph ...
Work with technologies and concepts at the cutting edge of AI, including but not limited to: deep reinforcement learning, foundation models, large language models, convolutional/recurrent/graph ...
We are building next-generation end-to-end autonomous driving systems powered by reinforcement ... Proficiency in deep learning frameworks such as PyTorch * Experience with distributed training ...
We are building next-generation end-to-end autonomous driving systems powered by reinforcement ... Proficiency in deep learning frameworks such as PyTorch * Experience with distributed training ...
Applied Reinforcement Learning Engineer Location: Palo Alto, CA or Seattle, WA (Hybrid/Remote ... This role requires deep expertise in both classical RL methodologies and modern LLM-based agent ...
Applied Reinforcement Learning Engineer Location: Palo Alto, CA or Seattle, WA (Hybrid/Remote ... This role requires deep expertise in both classical RL methodologies and modern LLM-based agent ...
Internship Deep Reinforcement Learning information
See salary details
$8.89 - $10.29
3% of jobs
$10.29 - $11.69
3% of jobs
$11.69 - $13.09
3% of jobs
$13.09 - $14.49
9% of jobs
$14.94 is the 25th percentile. Wages below this are outliers.
$14.49 - $15.89
21% of jobs
The median wage is $16.47 / hr.
$15.89 - $17.29
26% of jobs
$18.39 is the 75th percentile. Wages above this are outliers.
$17.29 - $18.68
13% of jobs
$18.68 - $20.08
12% of jobs
$20.08 - $21.48
4% of jobs
$21.48 - $22.88
3% of jobs
$22.88 - $24.28
3% of jobs
$8
$17
$24
How much do internship deep reinforcement learning jobs pay per hour?
What types of projects or tasks can I expect to work on during a Deep Reinforcement Learning internship?
What is an internship in Deep Reinforcement Learning?
What are the key skills and qualifications needed to thrive as an Intern in Deep Reinforcement Learning, and why are they important?
What is the difference between Internship Deep Reinforcement Learning vs Data Science Intern?
| Aspect | Internship Deep Reinforcement Learning | Data Science Intern |
|---|---|---|
| Required Skills | Machine learning, programming (Python), reinforcement learning concepts | Statistics, data analysis, programming (Python/R), data visualization |
| Work Environment | Research labs, AI companies, tech startups | Business analytics, tech firms, consulting agencies |
| Industry Usage | AI research, robotics, autonomous systems | Business intelligence, marketing, finance |
Internship Deep Reinforcement Learning focuses on developing algorithms that enable systems to learn through trial and error, often in AI research or robotics. Data Science Internships involve analyzing data to extract insights and support decision-making. While both roles require programming skills, reinforcement learning emphasizes AI-specific techniques, whereas data science centers on statistical analysis and data visualization.

Job description
Our Helix team is responsible for developing the core AI systems that power humanoid autonomy. We are looking for a Helix AI Engineer, Reinforcement Learning to develop learning systems that enable robots to acquire skills through interaction, feedback, and experience.
This role focuses on applying and advancing reinforcement learning across simulation and real-world environments-improving policy performance, robustness, and long-horizon decision-making in embodied systems.
Responsibilities
- Design and implement reinforcement learning algorithms for embodied agents operating in real-world and simulated environments
- Train policies that learn from interaction, feedback, and large-scale experience across diverse tasks
- Develop reward modeling, credit assignment, and exploration strategies for complex, long-horizon behaviors
- Improve policy robustness to real-world challenges such as noise, partial observability, and environment variability
- Work across online and offline RL settings, including learning from large-scale logged robot data
- Collaborate closely with pretraining, video, generative, agent, and robot learning teams to integrate RL into the full autonomy stack
- Build scalable training systems for RL, including distributed rollouts, simulation infrastructure, and experiment management
- Design evaluation frameworks to measure policy performance, stability, and generalization
- Experience developing and applying reinforcement learning algorithms in complex environments
- Strong understanding of RL fundamentals (e.g., policy optimization, value methods, model-based RL)
- Experience training policies in simulation and/or real-world systems
- Proficiency in Python and deep learning frameworks such as PyTorch
- Experience with large-scale experimentation and distributed training systems
- Strong experimental rigor and ability to diagnose and improve learning systems
- Solid software engineering skills and ability to build scalable, reliable systems
- Ability to operate independently and drive ambiguous, high-impact technical problems
- Experience applying RL to robotics, control systems, or embodied AI
- Experience with large-scale RL infrastructure (distributed rollouts, simulation at scale)
- Background in offline RL, imitation learning, or hybrid learning approaches
- Experience with reward modeling or human-in-the-loop learning
- Experience at leading AI labs such as OpenAI, Google DeepMind, Anthropic, or xAI
- Familiarity with robotics systems, simulation environments, or real-world deployment constraints
- Publication record in reinforcement learning, machine learning, or robotics
The pay offered for this position may vary based on several individual factors, including job-related knowledge, skills, and experience. The total compensation package may also include additional components/benefits depending on the specific role. This information will be shared if an employment offer is extended.