The Planning & Decision-Making group is investing heavily in deep reinforcement learning to move beyond classical planning, learning policies that generalize across novel driving scenarios, handle ...
The Planning & Decision-Making group is investing heavily in deep reinforcement learning to move beyond classical planning, learning policies that generalize across novel driving scenarios, handle ...
Machine Learning Engineer, Reinforcement Learning
San Mateo, CA · On-site
$100K - $300K/yr
Deep understanding and practical experience with various reinforcement learning algorithms and techniques (model-free, model-based, multi-task, hierarchical, multi-agent, etc.). * Strong background ...
Machine Learning Engineer, Reinforcement Learning
San Mateo, CA · On-site
$100K - $300K/yr
Deep understanding and practical experience with various reinforcement learning algorithms and techniques (model-free, model-based, multi-task, hierarchical, multi-agent, etc.). * Strong background ...
Deep understanding and practical experience with various reinforcement learning algorithms and techniques (model-free, model-based, multi-task, hierarchical, multi-agent, etc.). * Strong background ...
Deep understanding and practical experience with various reinforcement learning algorithms and techniques (model-free, model-based, multi-task, hierarchical, multi-agent, etc.). * Strong background ...
Design and implement advanced reinforcement learning algorithms (e.g., multi-agent RL, model-based RL, deep RL) for real-time control of data center infrastructure. * Simulation and Training: Build ...
Design and implement advanced reinforcement learning algorithms (e.g., multi-agent RL, model-based RL, deep RL) for real-time control of data center infrastructure. * Simulation and Training: Build ...
The role requires deep familiarity with modern reinforcement learning algorithms, simulation environments, reward modeling, and the engineering complexity of training and evaluating policies at scale.
The role requires deep familiarity with modern reinforcement learning algorithms, simulation environments, reward modeling, and the engineering complexity of training and evaluating policies at scale.
Reinforcement Learning Expert Dexmate is building the foundation for physical AI -- a unified ... Proficiency in Python and deep learning frameworks (PyTorch, TensorFlow, JAX) * Strong ...
Reinforcement Learning Expert Dexmate is building the foundation for physical AI -- a unified ... Proficiency in Python and deep learning frameworks (PyTorch, TensorFlow, JAX) * Strong ...
Intern, ML Algorithms
San Jose, CA · On-site
Knowledge of (deep) reinforcement learning, optimization, and search techniques. * Knowledge of statistical learning-e.g., deep neural networks, sequence processing, graph neural networks, etc.
Intern, ML Algorithms
San Jose, CA · On-site
Knowledge of (deep) reinforcement learning, optimization, and search techniques. * Knowledge of statistical learning-e.g., deep neural networks, sequence processing, graph neural networks, etc.
Reinforcement Learning Engineer
Fremont, CA · Remote
$100K - $150K/yr
The role requires deep familiarity with modern reinforcement learning algorithms, simulation environments, reward modeling, and the engineering complexity of training and evaluating policies at scale.
Reinforcement Learning Engineer
Fremont, CA · Remote
$100K - $150K/yr
The role requires deep familiarity with modern reinforcement learning algorithms, simulation environments, reward modeling, and the engineering complexity of training and evaluating policies at scale.
Design and implement advanced reinforcement learning algorithms (e.g., multi-agent RL, model-based RL, deep RL) for real-time control of data center infrastructure. * Simulation and Training: Build ...
Design and implement advanced reinforcement learning algorithms (e.g., multi-agent RL, model-based RL, deep RL) for real-time control of data center infrastructure. * Simulation and Training: Build ...
Applied Deep Learning PhD Research Intern, Reinforcement Learning for LLMs - Fall 2026
$17.50 - $23.50/hr
We are looking for PhD research interns excited to advance the next generation of large language models through reinforcement learning. Our applied deep learning research team at NVIDIA has helped ...
Applied Deep Learning PhD Research Intern, Reinforcement Learning for LLMs - Fall 2026
$17.50 - $23.50/hr
We are looking for PhD research interns excited to advance the next generation of large language models through reinforcement learning. Our applied deep learning research team at NVIDIA has helped ...
Applied Deep Learning PhD Research Intern, Reinforcement Learning for LLMs - Fall 2026
Santa Clara, CA · On-site
$17.50 - $23.50/hr
We are looking for PhD research interns excited to advance the next generation of large language models through reinforcement learning. Our applied deep learning research team at NVIDIA has helped ...
Applied Deep Learning PhD Research Intern, Reinforcement Learning for LLMs - Fall 2026
Santa Clara, CA · On-site
$17.50 - $23.50/hr
We are looking for PhD research interns excited to advance the next generation of large language models through reinforcement learning. Our applied deep learning research team at NVIDIA has helped ...
Design and implement advanced reinforcement learning algorithms (e.g., multi-agent RL, model-based RL, deep RL) for real-time control of data center infrastructure. * Simulation and Training: Build ...
Design and implement advanced reinforcement learning algorithms (e.g., multi-agent RL, model-based RL, deep RL) for real-time control of data center infrastructure. * Simulation and Training: Build ...
Design and implement advanced reinforcement learning algorithms (e.g., multi-agent RL, model-based RL, deep RL) for real-time control of data center infrastructure. * Simulation and Training: Build ...
Quick apply
Design and implement advanced reinforcement learning algorithms (e.g., multi-agent RL, model-based RL, deep RL) for real-time control of data center infrastructure. * Simulation and Training: Build ...
... deep learning, driving impactful discoveries that inspire the next generation of AI pioneers. Position Summary As a Research Scientist within our Reinforcement Learning team, you will play a ...
Quick apply
... deep learning, driving impactful discoveries that inspire the next generation of AI pioneers. Position Summary As a Research Scientist within our Reinforcement Learning team, you will play a ...
Research Scientist - Reinforcement Learning
Sunnyvale, CA · On-site
$150K - $450K/yr
... deep learning, driving impactful discoveries that inspire the next generation of AIpioneers ... Position Summary As a Research Scientist within our Reinforcement Learning team, you will play a ...
Research Scientist - Reinforcement Learning
Sunnyvale, CA · On-site
$150K - $450K/yr
... deep learning, driving impactful discoveries that inspire the next generation of AIpioneers ... Position Summary As a Research Scientist within our Reinforcement Learning team, you will play a ...
Our Reinforcement Learning teams sit at the intersection of cutting-edge research and engineering excellence, with a deep commitment to building high-quality, scalable systems that push the ...
Our Reinforcement Learning teams sit at the intersection of cutting-edge research and engineering excellence, with a deep commitment to building high-quality, scalable systems that push the ...
Helix AI Engineer, Reinforcement Learning Figure is an AI robotics company developing autonomous ... Proficiency in Python and deep learning frameworks such as PyTorch * Experience with large-scale ...
Helix AI Engineer, Reinforcement Learning Figure is an AI robotics company developing autonomous ... Proficiency in Python and deep learning frameworks such as PyTorch * Experience with large-scale ...
... in Python and deep learning frameworks such as PyTorch • Experience with large-scale ... reinforcement learning, machine learning, or robotics Company : Figure is an AI robotics company ...
... in Python and deep learning frameworks such as PyTorch • Experience with large-scale ... reinforcement learning, machine learning, or robotics Company : Figure is an AI robotics company ...
Research Intern - Reinforcement Learning (RL) - Onsite
$17.75 - $23.75/hr
Design and build reinforcement learning environments that model real-world customer interaction ... Build alongside a team with deep expertise from Amazon, Google, and Meta * Be part of a fast ...
Research Intern - Reinforcement Learning (RL) - Onsite
$17.75 - $23.75/hr
Design and build reinforcement learning environments that model real-world customer interaction ... Build alongside a team with deep expertise from Amazon, Google, and Meta * Be part of a fast ...
Research Engineer, Machine Learning (Reinforcement Learning)
San Francisco, CA · On-site
$500K - $850K/yr
Our Reinforcement Learning teams sit at the intersection of cutting-edge research and engineering excellence, with a deep commitment to building high-quality, scalable systems that push the ...
Research Engineer, Machine Learning (Reinforcement Learning)
San Francisco, CA · On-site
$500K - $850K/yr
Our Reinforcement Learning teams sit at the intersection of cutting-edge research and engineering excellence, with a deep commitment to building high-quality, scalable systems that push the ...
Deep Reinforcement Learning information
See California salary details
$28.1K - $32.7K
2% of jobs
$32.7K - $37.4K
3% of jobs
$37.4K - $42K
6% of jobs
$42K - $46.6K
3% of jobs
$50.7K is the 25th percentile. Wages below this are outliers.
$46.6K - $51.2K
12% of jobs
The median wage is $55.6K / yr.
$51.2K - $55.9K
25% of jobs
$55.9K - $60.5K
12% of jobs
$60.5K - $65.1K
9% of jobs
$66K is the 75th percentile. Wages above this are outliers.
$65.1K - $69.7K
12% of jobs
$69.7K - $74.3K
11% of jobs
$74.3K - $79K
5% of jobs
$28.1K
$57.6K
$79K
How much do deep reinforcement learning jobs pay per year?
What is a Deep Reinforcement Learning job?
What are the key skills and qualifications needed to thrive in the Deep Reinforcement Learning position, and why are they important?
What does a typical day look like for someone working in Deep Reinforcement Learning?
- Data Science Machine Learning
- Machine Engineer
- Flexible Machine Learning Engineer Biotech
- Deep Learning Engineer
- Volunteer Junior Machine Learning Engineer
- Machine Learning Specialist
- Machine Learning Engineer New Grad
- Temporary Machine Learning R
- Scientific Machine Learning
- Manager Remote Machine Learning Engineer
- Associate Machine Learning Chemistry
- Overnight Data Science Mentor
- Staff Machine Learning Engineer
- Research Machine Learning Federated Learning
- Helper Reinforcement Learning
- Meta Machine Learning
- Machine Learning Engineer Two
- Privacy Preserving Machine Learning
- Applied Mathematics Phd
- Senior Meta Machine Learning

Full-time
Medical, Dental, Vision, Life, Retirement, PTO
Posted yesterday
DoorDash rating
6.4
Based on 177 frontline employees who took The Breakroom Quiz
13th of 22 rated food delivery companies
Job description
Our DD Labs team builds real-time autonomous delivery systems. The Planning & Decision-Making group is investing heavily in deep reinforcement learning to move beyond classical planning, learning policies that generalize across novel driving scenarios, handle long-tail edge cases, and improve continuously from large-scale fleet data. Our models jointly handle prediction and planning in a single unified architecture. Our stack is pure JAX end-to-end: the same code you train with is the code that runs on the robot. No C++ rewrites, no TensorRT export. A new policy goes from training to on-vehicle deployment in minutes.
About the Role
As a Senior/Staff Deep RL Engineer, you will design, train, and deploy deep reinforcement learning policies that make real-time driving decisions for our autonomous vehicles. You will own the full lifecycle, from problem formulation and reward design through large-scale distributed training to on-vehicle inference. You'll help define how learned components compose with the rest of the autonomy stack to produce robust, shippable behavior.
You're excited about this opportunity because you will...
- Formulate complex driving tasks as RL problems with well-shaped reward functions and expressive state/action representations.
- Design and train model-based deep RL agents using GPU-accelerated simulation at massive scale, including improving the simulator itself.
- Build and maintain distributed training infrastructure in JAX across large compute clusters.
- Build agentic optimization systems that automatically improve code, run experiments, analyze metrics, and iterate on RL policies with minimal human intervention.
- BS/MS/PhD in CS, EE, Robotics, or a related field, with a strong foundation in reinforcement learning and deep learning.
- You have proficiency in using AI coding tools (e.g., Claude Code, Codex, Cursor) in the full software development lifecycle, including designing, generating code, testing, monitoring and releasing software
- Hands-on experience training RL agents at scale, ideally in robotics, autonomous driving, or other real-time decision-making domains.
- Proficiency in JAX or a similar functional ML framework; comfort with JIT compilation, vectorized environments, and GPU-accelerated simulation.
- Deep grasp of core RL concepts: policy gradients, value functions, exploration-exploitation, model-based RL, reward shaping, and sim-to-real transfer.
- Data-driven mindset: comfortable building experiment pipelines, analyzing training runs, and letting metrics guide architectural decisions.
- Publications at top venues (NeurIPS, ICML, ICLR, CoRL, RSS, ICRA) on RL or learned planning.
- Experience building or working with GPU-accelerated simulators for RL training.
- Track record of shipping a learned component in a production robotics or autonomous vehicle stack.
Notice to Applicants for Jobs Located in NYC or Remote Jobs Associated With Office in NYC Only
We use Covey as part of our hiring and/or promotional process for jobs in NYC and certain features may qualify it as an AEDT in NYC. As part of the hiring and/or promotion process, we provide Covey with job requirements and candidate submitted applications. We began using Covey Scout for Inbound from August 21, 2023, through December 21, 2023, and resumed using Covey Scout for Inbound again on June 29, 2024.
The Covey tool has been reviewed by an independent auditor. Results of the audit may be viewed here: Covey
Compensation
The successful candidate's starting pay will fall within the pay range listed below and is determined based on job-related factors including, but not limited to, skills, experience, qualifications, work location, and market conditions. Base salary is localized according to an employee's work location. Ranges are market-dependent and may be modified in the future.
In addition to base salary, the compensation for this role includes opportunities for equity grants. Talk to your recruiter for more information.
DoorDash cares about you and your overall well-being. That's why we offer a comprehensive benefits package to all regular employees, which includes a 401(k) plan with employer matching, 16 weeks of paid parental leave, wellness benefits, commuter benefits match, paid time off and paid sick leave in compliance with applicable laws (e.g. Colorado Healthy Families and Workplaces Act). DoorDash also offers medical, dental, and vision benefits, 11 paid holidays, disability and basic life insurance, family-forming assistance, and a mental health program, among others.
To learn more about our benefits, visit our careers page here.
See below for paid time off details:
- For salaried roles: flexible paid time off/vacation, plus 80 hours of paid sick time per year.
- For hourly roles: vacation accrued at about 1 hour for every 25.97 hours worked (e.g. about 6.7 hours/month if working 40 hours/week; about 3.4 hours/month if working 20 hours/week), and paid sick time accrued at 1 hour for every 30 hours worked (e.g. about 5.8 hours/month if working 40 hours/week; about 2.9 hours/month if working 20 hours/week).
The national base pay range for this position within the United States, including Illinois and Colorado.
$168,000-$247,000 USD
About DoorDash
At DoorDash, our mission to empower local economies shapes how our team members move quickly, learn, and reiterate in order to make impactful decisions that display empathy for our range of users-from Dashers to merchant partners to consumers. We are a technology and logistics company that started by enabling door-to-door delivery, and we are looking for team members who can help us go from a company that is known as the place you order food to a company that people turn to for any and all goods.
DoorDash is growing rapidly and changing constantly, which gives our team members the opportunity to share their unique perspectives, solve new challenges, and own their careers. We're committed to supporting employees' happiness, healthiness, and overall well-being by providing comprehensive benefits and perks including premium healthcare, wellness expense reimbursement, paid parental leave and more.
Our Commitment to Diversity and Inclusion
We're committed to growing and empowering a more inclusive community within our company, industry, and cities. That's why we hire and cultivate diverse teams of people from all backgrounds, experiences, and perspectives. We believe that true innovation happens when everyone has room at the table and the tools, resources, and opportunity to excel.
Statement of Non-Discrimination: In keeping with our beliefs and goals, no employee or applicant will face discrimination or harassment based on: race, color, ancestry, national origin, religion, age, gender, marital/domestic partner status, sexual orientation, gender identity or expression, disability status, or veteran status. Above and beyond discrimination and harassment based on "protected categories," we also strive to prevent other subtler forms of inappropriate behavior (i.e., stereotyping) from ever gaining a foothold in our office. Whether blatant or hidden, barriers to success have no place at DoorDash. We value a diverse workforce - people who identify as women, non-binary or gender non-conforming, LGBTQIA+, American Indian or Native Alaskan, Black or African American, Hispanic or Latinx, Native Hawaiian or Other Pacific Islander, differently-abled, caretakers and parents, and veterans are strongly encouraged to apply. Thank you to the Level Playing Field Institute for this statement of non-discrimination.
Pursuant to the San Francisco Fair Chance Ordinance, Los Angeles Fair Chance Initiative for Hiring Ordinance, and any other state or local hiring regulations, we will consider for employment any qualified applicant, including those with arrest and conviction records, in a manner consistent with the applicable regulation.
If you need any accommodations, please inform your recruiting contact upon initial connection.
Notice to Applicants for Jobs Located in NYC or Remote Jobs Associated With Office in NYC Only
We used Covey as part of our hiring and/or promotional process for jobs in NYC and certain features may qualify it as an AEDT in NYC. As part of the hiring and/or promotion process, we provided Covey with job requirements and candidate submitted applications. We began using Covey Scout for Inbound from August 21, 2023, through December 21, 2023. We resumed using Covey Scout for Inbound again on June 29, 2024, and ceased using Covey Scout for Inbound on April 30, 2026.
The Covey tool has been reviewed by an independent auditor. Results of the audit may be viewed here: https://getcovey.com/nyc-local-law-144.
About DoorDash
Sourced by ZipRecruiter
At DoorDash, our mission to empower local economies shapes how our team members move quickly, learn, and reiterate in order to make impactful decisions that display empathy for our range of users--from Dashers to merchant partners to consumers. We are a technology and logistics company that started with door-to-door delivery, and we are looking for team members who can help us go from a company that is known for delivering food to a company that people turn to for any and all goods. DoorDash is growing rapidly and changing constantly, which gives our team members the opportunity to share their unique perspectives, solve new challenges, and own their careers. We're committed to supporting employees' happiness, healthiness, and overall well-being by providing comprehensive benefits and perks including premium healthcare, wellness expense reimbursement, paid parental leave and more.
Industry
Transportation equipment manufacturing
Company size
10,000+ Employees
Headquarters location
San Francisco, CA, US
Year founded
2013