The role involves building scalable systems for training large generative models, implementing reinforcement learning methods, and shipping deep learning solutions to enhance self-driving behaviors.
The role involves building scalable systems for training large generative models, implementing reinforcement learning methods, and shipping deep learning solutions to enhance self-driving behaviors.
The selected candidate will drive the design, development, and integration of innovative Artificial Intelligence (AI) and Deep Reinforcement Learning (DRL) capabilities into defense and mission ...
The selected candidate will drive the design, development, and integration of innovative Artificial Intelligence (AI) and Deep Reinforcement Learning (DRL) capabilities into defense and mission ...
Research Engineer, Code RL (Reinforcement Learning)
San Francisco, CA ยท On-site +1
$241K/yr
Our Reinforcement Learning teams sit at the intersection of cutting-edge research and engineering excellence, with a deep commitment to building high-quality, scalable systems that push the ...
Research Engineer, Code RL (Reinforcement Learning)
San Francisco, CA ยท On-site +1
$241K/yr
Our Reinforcement Learning teams sit at the intersection of cutting-edge research and engineering excellence, with a deep commitment to building high-quality, scalable systems that push the ...
Software Developer
Colorado Springs, CO ยท On-site
The selected candidate will drive the design, development, and integration of innovative Artificial Intelligence (AI) and Deep Reinforcement Learning (DRL) capabilities into defense and mission ...
Software Developer
Colorado Springs, CO ยท On-site
The selected candidate will drive the design, development, and integration of innovative Artificial Intelligence (AI) and Deep Reinforcement Learning (DRL) capabilities into defense and mission ...
Work with technologies and concepts at the cutting edge of AI, including but not limited to: deep reinforcement learning, foundation models, large language models, convolutional/recurrent/graph ...
Work with technologies and concepts at the cutting edge of AI, including but not limited to: deep reinforcement learning, foundation models, large language models, convolutional/recurrent/graph ...
Senior PMT ES - Reinforcement Learning, SageMaker AI
$142K - $188K/yr
... deep expertise, complex infrastructure, and significant experimentation. We need a product leader who can make reinforcement learning dramatically more accessible, helping a broad range of customers ...
Senior PMT ES - Reinforcement Learning, SageMaker AI
$142K - $188K/yr
... deep expertise, complex infrastructure, and significant experimentation. We need a product leader who can make reinforcement learning dramatically more accessible, helping a broad range of customers ...
Senior PMT ES - Reinforcement Learning, SageMaker AI
Bellevue, WA ยท On-site
$142K - $188K/yr
... deep expertise, complex infrastructure, and significant experimentation. We need a product leader who can make reinforcement learning dramatically more accessible, helping a broad range of customers ...
Senior PMT ES - Reinforcement Learning, SageMaker AI
Bellevue, WA ยท On-site
$142K - $188K/yr
... deep expertise, complex infrastructure, and significant experimentation. We need a product leader who can make reinforcement learning dramatically more accessible, helping a broad range of customers ...
Machine Learning Engineer
Laurel, MD ยท On-site
Work with technologies and concepts at the cutting edge of AI, including but not limited to: deep reinforcement learning, foundation models, large language models, convolutional/recurrent/graph ...
Machine Learning Engineer
Laurel, MD ยท On-site
Work with technologies and concepts at the cutting edge of AI, including but not limited to: deep reinforcement learning, foundation models, large language models, convolutional/recurrent/graph ...
Senior Machine Learning Engineer, Reinforcement Learning - Egofold
Beverly Hills, CA ยท On-site +1
$150K - $185K/yr
T-shaped capability: deep machine learning expertise plus practical range across one or more ... Experience with reinforcement learning methods such as PPO, SAC, DQN, actor-critic, or related ...
Senior Machine Learning Engineer, Reinforcement Learning - Egofold
Beverly Hills, CA ยท On-site +1
$150K - $185K/yr
T-shaped capability: deep machine learning expertise plus practical range across one or more ... Experience with reinforcement learning methods such as PPO, SAC, DQN, actor-critic, or related ...
Software Developer
Colorado Springs, CO ยท On-site
The selected candidate will drive the design, development, and integration of innovative Artificial Intelligence (AI) and Deep Reinforcement Learning (DRL) capabilities into defense and mission ...
Software Developer
Colorado Springs, CO ยท On-site
The selected candidate will drive the design, development, and integration of innovative Artificial Intelligence (AI) and Deep Reinforcement Learning (DRL) capabilities into defense and mission ...
Research Engineer, Code RL (Reinforcement Learning)
San Francisco, CA ยท On-site
$500K - $850K/yr
Our Reinforcement Learning teams sit at the intersection of cutting-edge research and engineering excellence, with a deep commitment to building high-quality, scalable systems that push the ...
Research Engineer, Code RL (Reinforcement Learning)
San Francisco, CA ยท On-site
$500K - $850K/yr
Our Reinforcement Learning teams sit at the intersection of cutting-edge research and engineering excellence, with a deep commitment to building high-quality, scalable systems that push the ...
Develop and integrate AI agent and/or deep reinforcement learning (DRL) frameworks for decision-making, workflow automation, or adaptive optimization in video-based applications * Stay current with ...
Quick apply
Develop and integrate AI agent and/or deep reinforcement learning (DRL) frameworks for decision-making, workflow automation, or adaptive optimization in video-based applications * Stay current with ...
Work with technologies and concepts at the cutting edge of AI, including but not limited to: deep reinforcement learning, foundation models, large language models, convolutional/recurrent/graph ...
Work with technologies and concepts at the cutting edge of AI, including but not limited to: deep reinforcement learning, foundation models, large language models, convolutional/recurrent/graph ...
Senior Machine Learning Engineer, Reinforcement Learning - Egofold
Beverly Hills, CA ยท On-site +1
$150K - $185K/yr
T-shaped capability: deep machine learning expertise plus practical range across one or more ... Experience with reinforcement learning methods such as PPO, SAC, DQN, actor-critic, or related ...
Senior Machine Learning Engineer, Reinforcement Learning - Egofold
Beverly Hills, CA ยท On-site +1
$150K - $185K/yr
T-shaped capability: deep machine learning expertise plus practical range across one or more ... Experience with reinforcement learning methods such as PPO, SAC, DQN, actor-critic, or related ...
We are building next-generation end-to-end autonomous driving systems powered by reinforcement ... Proficiency in deep learning frameworks such as PyTorch * Experience with distributed training ...
We are building next-generation end-to-end autonomous driving systems powered by reinforcement ... Proficiency in deep learning frameworks such as PyTorch * Experience with distributed training ...
AI Research Scientist, Reinforcement Learning Responsibilities: * Explore and develop novel post ... Experience in C/C++ and Python and deep learning frameworks (e.g., PyTorch, TensorFlow) * Must ...
AI Research Scientist, Reinforcement Learning Responsibilities: * Explore and develop novel post ... Experience in C/C++ and Python and deep learning frameworks (e.g., PyTorch, TensorFlow) * Must ...
Applied Reinforcement Learning Engineer Location: Palo Alto, CA or Seattle, WA (Hybrid/Remote ... This role requires deep expertise in both classical RL methodologies and modern LLM-based agent ...
Applied Reinforcement Learning Engineer Location: Palo Alto, CA or Seattle, WA (Hybrid/Remote ... This role requires deep expertise in both classical RL methodologies and modern LLM-based agent ...
Deep reinforcement learning We are looking for self-motivated, top-notch researchers to join us! * Passion to conduct innovative and hands-on research on robotics; * A strong track record on ...
Deep reinforcement learning We are looking for self-motivated, top-notch researchers to join us! * Passion to conduct innovative and hands-on research on robotics; * A strong track record on ...
Senior Machine Learning Engineer, Reinforcement Learning - Egofold
Beverly Hills, CA ยท On-site +1
$150K - $185K/yr
T-shaped capability: deep machine learning expertise plus practical range across one or more ... Experience with reinforcement learning methods such as PPO, SAC, DQN, actor-critic, or related ...
Quick apply
Senior Machine Learning Engineer, Reinforcement Learning - Egofold
Beverly Hills, CA ยท On-site +1
$150K - $185K/yr
T-shaped capability: deep machine learning expertise plus practical range across one or more ... Experience with reinforcement learning methods such as PPO, SAC, DQN, actor-critic, or related ...
Machine Learning Engineer: Imitation and Reinforcement Learning for Robotics
San Francisco, CA ยท On-site
... with Deep Learning frameworks, such as PyTorch/Tensorflow/JAX to solve real-world problems * 3+ ... Practical experience in behavior cloning and/or reinforcement learning * Bonus: Experience with ...
Machine Learning Engineer: Imitation and Reinforcement Learning for Robotics
San Francisco, CA ยท On-site
... with Deep Learning frameworks, such as PyTorch/Tensorflow/JAX to solve real-world problems * 3+ ... Practical experience in behavior cloning and/or reinforcement learning * Bonus: Experience with ...
Deep Reinforcement Learning information
See salary details
$28.5K - $33.2K
2% of jobs
$33.2K - $37.9K
3% of jobs
$37.9K - $42.5K
6% of jobs
$42.5K - $47.2K
3% of jobs
$51.4K is the 25th percentile. Wages below this are outliers.
$47.2K - $51.9K
12% of jobs
The median wage is $56.3K / yr.
$51.9K - $56.6K
25% of jobs
$56.6K - $61.3K
12% of jobs
$61.3K - $66K
9% of jobs
$66.9K is the 75th percentile. Wages above this are outliers.
$66K - $70.6K
12% of jobs
$70.6K - $75.3K
11% of jobs
$75.3K - $80K
5% of jobs
$28.5K
$58.3K
$80K
How much do deep reinforcement learning jobs pay per year?
What does a typical day look like for someone working in Deep Reinforcement Learning?
A typical day for a Deep Reinforcement Learning professional involves designing algorithms, running experiments, analyzing results, and optimizing models to improve performance. You may collaborate regularly with data scientists, software engineers, and domain experts to integrate RL solutions into larger systems or products. Tasks often include reading the latest research, contributing to code reviews, and documenting findings while troubleshooting technical challenges. This dynamic environment encourages continuous learning and teamwork, ensuring you stay at the forefront of AI innovation.
What is a Deep Reinforcement Learning job?
A Deep Reinforcement Learning (DRL) job involves researching, developing, and applying AI models that use reinforcement learning techniques combined with deep learning. Professionals in this role design algorithms that enable agents to learn optimal decision-making policies through trial and error. Common applications include robotics, game AI, autonomous systems, and financial modeling. This job typically requires expertise in machine learning, neural networks, and programming languages like Python, along with frameworks such as TensorFlow or PyTorch.
What are the key skills and qualifications needed to thrive in the Deep Reinforcement Learning position, and why are they important?
To thrive in Deep Reinforcement Learning, you need expertise in machine learning, programming (Python, TensorFlow, or PyTorch), and applied mathematics, often supported by an advanced degree in computer science or a related field. Familiarity with version control systems, cloud computing platforms, and relevant certifications in AI or data science are valuable assets. Strong problem-solving abilities, collaboration, and effective communication are important soft skills in this position. These skills are essential for developing, implementing, and iterating cutting-edge algorithms that solve complex real-world problems in dynamic environments.
- Junior Machine Learning
- Graduate Machine Learning
- Machine Learning Intern Remote
- Artificial Intelligence Machine Learning Physics
- Learning Specialist Athletics
- Freelance Artificial Intelligence Machine Learning
- Math Learning Specialist
- Machine Learning Internship Microsoft
- Remote Machine Learning
- Physics Informed Machine Learning

Full-time
Posted 6 days ago
Job description
Pony.ai is a global leader in autonomous mobility, recognized for its innovative technologies and services in the field. The role involves building scalable systems for training large generative models, implementing reinforcement learning methods, and shipping deep learning solutions to enhance self-driving behaviors.
Responsibilities:
โข Build scalable systems for training and fine-tuning large generative models that produce realistic, informative driving behaviors for evaluation and scenario coverage.
โข Implement and iterate on RL-style methods: algorithms, reward / preference objectives, and training setups suited to high-fidelity, insightful behaviors in simulation-aligned workflows (closed-loop evaluation mindset).
โข Ship deep learning solutions (including LLM / VLM where appropriate) that improve human-led triaging, automate high-volume workflows, and support nuanced analysis of self-driving behavior to surface critical anomalies.
โข Own production-oriented ML for fleet-scale assessment: training, optimization, monitoring, and iteration of models used to judge performance across large real-world exposure.
โข Design and evolve data + evaluation systems inspired by RL from human preferences (RLHF) and related paradigmsโturning preference/judgment signals into repeatable, scalable training and evaluation loops.
โข Partner broadly with teams such as Prediction, Planning, Research, and platform/engineering leads to land cross-cutting improvements with clear metrics.
Qualifications:
Required:
โข M.S. or Ph.D. in Computer Science, Machine Learning, AI, or a related fieldโor equivalent practical experience.
โข Hands-on experience building and applying ML in production-grade settings, with a strong RL component (policy learning, preference/feedback optimization, or offline/online RL pipelines).
โข Depth in deep learning, sequence modeling, and generative models.
โข Demonstrated impact via strong publications or a clear history of shipping impactful ML systems end-to-end.
โข Experience with large-scale distributed training and large-scale data processing.
โข Ability to lead ambiguous technical work from problem framing through reliable delivery.
Preferred:
โข Background in autonomous vehicles, robotics, or complex simulation environments.
โข Strong grasp of modern RL and post-training techniques in LLM, dLLM, VLA and video generations.
โข Hands-on integration of simulation platforms with ML training and evaluation workflows.
โข Python fluency and frameworks such as PyTorch.
โข Experience defining and operating metrics for complex, safety-critical AI systems.
โข Technical leadership: influencing stakeholders, aligning teams, and raising the bar for evaluation rigor.
โข Excellent communicationโsimple explanations of complex trade-offs.
Company:
Pony.ai develops autonomous driving technology for vehicles that operates using artificial intelligence and machine learning. Founded in 2016, the company is headquartered in Fremont, USA, with a team of 1001-5000 employees. The company is currently Late Stage.
About pony.ai
Sourced by ZipRecruiter
Industry
It services
Company size
51 - 200 Employees
Headquarters location
Fremont, CA, US
Year founded
2016