The Planning & Decision-Making group is investing heavily in deep reinforcement learning to move ... Notice to Applicants for Jobs Located in NYC or Remote Jobs Associated With Office in NYC Only We ...
The Planning & Decision-Making group is investing heavily in deep reinforcement learning to move ... Notice to Applicants for Jobs Located in NYC or Remote Jobs Associated With Office in NYC Only We ...
Machine Learning Engineer, Depot Automation
Mountain View, CA · On-site
$175K - $215K/yr
Key work involves leveraging foundation models, reinforcement learning, simulation, and integrating ... remote, the specific salary range for your preferred location, during the hiring process. Waymo ...
Machine Learning Engineer, Depot Automation
Mountain View, CA · On-site
$175K - $215K/yr
Key work involves leveraging foundation models, reinforcement learning, simulation, and integrating ... remote, the specific salary range for your preferred location, during the hiring process. Waymo ...
Senior Staff Machine Learning Engineer, Depot Automation
Mountain View, CA · On-site
$298K - $368K/yr
Key work involves leveraging foundation models, reinforcement learning, simulation, and integrating ... remote, the specific salary range for your preferred location, during the hiring process. Waymo ...
Senior Staff Machine Learning Engineer, Depot Automation
Mountain View, CA · On-site
$298K - $368K/yr
Key work involves leveraging foundation models, reinforcement learning, simulation, and integrating ... remote, the specific salary range for your preferred location, during the hiring process. Waymo ...
Staff Machine Learning Engineer - VLM/LLM Evaluation
Mountain View, CA · On-site
$238K - $302K/yr
AI Foundations areas that we are currently focusing on include reinforcement learning, learning ... remote, the specific salary range for your preferred location, during the hiring process. Waymo ...
Staff Machine Learning Engineer - VLM/LLM Evaluation
Mountain View, CA · On-site
$238K - $302K/yr
AI Foundations areas that we are currently focusing on include reinforcement learning, learning ... remote, the specific salary range for your preferred location, during the hiring process. Waymo ...
Senior Machine Learning Engineer, Data Mining
San Francisco, CA · On-site +1
$144K - $190K/yr
Reinforcement Learning for Data Discover : Build RL-based policy learning and reasoning systems for ... be fully remote. The salary range for this role is an estimate based on a wide range of ...
Quick apply
Senior Machine Learning Engineer, Data Mining
San Francisco, CA · On-site +1
$144K - $190K/yr
Reinforcement Learning for Data Discover : Build RL-based policy learning and reasoning systems for ... be fully remote. The salary range for this role is an estimate based on a wide range of ...
Research Scientist, RL for Autonomous Planning & World Modeling
Mountain View, CA · On-site
$204K - $259K/yr
AI Foundations areas that we are currently focusing on include reinforcement learning, learning ... Please speak with your recruiter about your preferred location for remote work when you begin the ...
Research Scientist, RL for Autonomous Planning & World Modeling
Mountain View, CA · On-site
$204K - $259K/yr
AI Foundations areas that we are currently focusing on include reinforcement learning, learning ... Please speak with your recruiter about your preferred location for remote work when you begin the ...
Senior Machine Learning Engineer
San Francisco, CA · On-site +1
$186K - $300K/yr
We are building a self-healing ecosystem where Multi-Agent Systems and Reinforcement Learning (RL ... Employee divides their time between in-office and remote work. Access to an office location is ...
Senior Machine Learning Engineer
San Francisco, CA · On-site +1
$186K - $300K/yr
We are building a self-healing ecosystem where Multi-Agent Systems and Reinforcement Learning (RL ... Employee divides their time between in-office and remote work. Access to an office location is ...
Senior Research Scientist, Perception
Mountain View, CA · On-site
$213K - $263K/yr
AI Foundations areas that we are currently focusing on include reinforcement learning, learning ... remote, the specific salary range for your preferred location, during the hiring process. Waymo ...
Senior Research Scientist, Perception
Mountain View, CA · On-site
$213K - $263K/yr
AI Foundations areas that we are currently focusing on include reinforcement learning, learning ... remote, the specific salary range for your preferred location, during the hiring process. Waymo ...
AI Foundations areas that we are currently focusing on include reinforcement learning, learning ... remote, the specific salary range for your preferred location, during the hiring process. Waymo ...
AI Foundations areas that we are currently focusing on include reinforcement learning, learning ... remote, the specific salary range for your preferred location, during the hiring process. Waymo ...
Machine Learning Engineer - Expert
San Francisco, CA · On-site +1
$90/hr
Remote Role Responsibilities * Develop end-to-end machine learning solutions for challenging ... tuning, or reinforcement learning. * Publications, patents, or significant open-source ...
Quick apply
Machine Learning Engineer - Expert
San Francisco, CA · On-site +1
$90/hr
Remote Role Responsibilities * Develop end-to-end machine learning solutions for challenging ... tuning, or reinforcement learning. * Publications, patents, or significant open-source ...
Explore and apply advanced methods where beneficial-e.g., game-theoretic approaches, reinforcement ... REMOTE
Explore and apply advanced methods where beneficial-e.g., game-theoretic approaches, reinforcement ... REMOTE
Applied Research Intern, Proactive Intelligence & Customer World Models (PhD / Graduate Co-op)
Bodega Bay, CA · Remote
Remote (US / Canada) Duration: Fall/Winter 2026 co-op - 8 months, flexible start September 2026 ... You'll work at the intersection of representation learning, foundation models, reinforcement ...
Applied Research Intern, Proactive Intelligence & Customer World Models (PhD / Graduate Co-op)
Bodega Bay, CA · Remote
Remote (US / Canada) Duration: Fall/Winter 2026 co-op - 8 months, flexible start September 2026 ... You'll work at the intersection of representation learning, foundation models, reinforcement ...
Senior Machine Learning Engineer, Simulation
Mountain View, CA · On-site
$204K - $259K/yr
Experience in reinforcement learning, transfer learning, or learning. * Experience with large scale ... remote, the specific salary range for your preferred location, during the hiring process. Waymo ...
Senior Machine Learning Engineer, Simulation
Mountain View, CA · On-site
$204K - $259K/yr
Experience in reinforcement learning, transfer learning, or learning. * Experience with large scale ... remote, the specific salary range for your preferred location, during the hiring process. Waymo ...
Our team focuses on building advanced AI agents through reinforcement learning, game-solving, fine ... What We Offer- Remote-first environment - Opportunity to work on innovative AI projects ...
Our team focuses on building advanced AI agents through reinforcement learning, game-solving, fine ... What We Offer- Remote-first environment - Opportunity to work on innovative AI projects ...
Research Engineer
San Francisco, CA · Remote
On-site (some team members are remote, but this role is currently on-site) Industry: AI infrastructure / Reinforcement Learning (RL) training data & evaluations Compensation: Competitive (range not ...
Research Engineer
San Francisco, CA · Remote
On-site (some team members are remote, but this role is currently on-site) Industry: AI infrastructure / Reinforcement Learning (RL) training data & evaluations Compensation: Competitive (range not ...
Data Scientist
San Francisco, CA · On-site +1
$160K - $200K/yr
... Reinforcement Learning, Statistics, and Optimization. The role will report directly to the CTO. ... This is a remote position, but we do have an office in San Fransisco. You will be the first data ...
Data Scientist
San Francisco, CA · On-site +1
$160K - $200K/yr
... Reinforcement Learning, Statistics, and Optimization. The role will report directly to the CTO. ... This is a remote position, but we do have an office in San Fransisco. You will be the first data ...
Senior Research Scientist, World Action Modeling
Mountain View, CA · On-site
$213K - $263K/yr
AI Foundations areas that we are currently focusing on include reinforcement learning, learning ... remote, the specific salary range for your preferred location, during the hiring process. Waymo ...
Senior Research Scientist, World Action Modeling
Mountain View, CA · On-site
$213K - $263K/yr
AI Foundations areas that we are currently focusing on include reinforcement learning, learning ... remote, the specific salary range for your preferred location, during the hiring process. Waymo ...
Data Scientist
San Francisco, CA · Remote
$160K - $200K/yr
... Reinforcement Learning, Statistics, and Optimization. The role will report directly to the CTO. ... This is a remote position, but we do have an office in San Fransisco. You will be the first data ...
Data Scientist
San Francisco, CA · Remote
$160K - $200K/yr
... Reinforcement Learning, Statistics, and Optimization. The role will report directly to the CTO. ... This is a remote position, but we do have an office in San Fransisco. You will be the first data ...
PhD Fall Machine Learning Intern (ATG - Visual, Multimodal, and Recommender Systems)
San Francisco, CA · On-site
Depending on the team, our fall internships will be located either remote or hybrid in San ... Reinforcement Learning, ML efficiency optimization, Generative AI, and LLMs. * Ability to legally ...
PhD Fall Machine Learning Intern (ATG - Visual, Multimodal, and Recommender Systems)
San Francisco, CA · On-site
Depending on the team, our fall internships will be located either remote or hybrid in San ... Reinforcement Learning, ML efficiency optimization, Generative AI, and LLMs. * Ability to legally ...
Director, Prediction and ML Planning
San Francisco, CA · On-site +1
$288K - $396K/yr
Advanced expertise in Imitation Learning and Reinforcement Learning for decision-making. Strong ... be fully remote. The salary range for this role is an estimate based on a wide range of ...
Quick apply
Director, Prediction and ML Planning
San Francisco, CA · On-site +1
$288K - $396K/yr
Advanced expertise in Imitation Learning and Reinforcement Learning for decision-making. Strong ... be fully remote. The salary range for this role is an estimate based on a wide range of ...
Remote Reinforcement Learning information
What is a Remote Reinforcement Learning job?
What are the key skills and qualifications needed to thrive as a Remote Reinforcement Learning Engineer, and why are they important?
What is the difference between Remote Reinforcement Learning vs Remote Machine Learning Engineer?
| Aspect | Remote Reinforcement Learning |
|---|---|
| Required Credentials | Master's or PhD in Computer Science, AI, or related fields; knowledge of RL algorithms |
| Work Environment | Research-focused, experimental, often involves simulation and algorithm development |
| Employer & Industry Usage | Tech companies, research labs, AI startups focusing on autonomous systems |
| Common Search & Comparison Intent | Understanding specialized AI roles, research focus, and technical skills |
Remote Reinforcement Learning specialists focus on developing algorithms that enable machines to learn through trial and error in simulated or real environments. In contrast, Remote Machine Learning Engineers typically work on deploying and optimizing various machine learning models across applications. While both roles require strong programming skills and knowledge of AI, reinforcement learning emphasizes decision-making processes, whereas machine learning engineering covers a broader range of models and deployment strategies.
What are common challenges faced when working remotely in a Reinforcement Learning role and how can they be addressed?
- Remote Instructional Designer
- Evening Remote Instructional Designer
- Chief Online Learning Officer
- Overnight Epic Instructional Designer
- Work From Home Remote Contract Specialist
- Vp Learning Development
- E Learning Development
- Assistant Docebo Lms
- Virtual Instructional Designer
- Learning & Development Facilitator
Full-time
Medical, Dental, Vision, Life, Retirement, PTO
Posted 15 hours ago
DoorDash rating
6.4
Based on 179 frontline employees who took The Breakroom Quiz
11th of 22 rated food delivery companies
Job description
Our DD Labs team builds real-time autonomous delivery systems. The Planning & Decision-Making group is investing heavily in deep reinforcement learning to move beyond classical planning, learning policies that generalize across novel driving scenarios, handle long-tail edge cases, and improve continuously from large-scale fleet data. Our models jointly handle prediction and planning in a single unified architecture. Our stack is pure JAX end-to-end: the same code you train with is the code that runs on the robot. No C++ rewrites, no TensorRT export. A new policy goes from training to on-vehicle deployment in minutes.
About the Role
As a Senior/Staff Deep RL Engineer, you will design, train, and deploy deep reinforcement learning policies that make real-time driving decisions for our autonomous vehicles. You will own the full lifecycle, from problem formulation and reward design through large-scale distributed training to on-vehicle inference. You'll help define how learned components compose with the rest of the autonomy stack to produce robust, shippable behavior.
You're excited about this opportunity because you will...
- Formulate complex driving tasks as RL problems with well-shaped reward functions and expressive state/action representations.
- Design and train model-based deep RL agents using GPU-accelerated simulation at massive scale, including improving the simulator itself.
- Build and maintain distributed training infrastructure in JAX across large compute clusters.
- Build agentic optimization systems that automatically improve code, run experiments, analyze metrics, and iterate on RL policies with minimal human intervention.
- BS/MS/PhD in CS, EE, Robotics, or a related field, with a strong foundation in reinforcement learning and deep learning.
- You have proficiency in using AI coding tools (e.g., Claude Code, Codex, Cursor) in the full software development lifecycle, including designing, generating code, testing, monitoring and releasing software
- Hands-on experience training RL agents at scale, ideally in robotics, autonomous driving, or other real-time decision-making domains.
- Proficiency in JAX or a similar functional ML framework; comfort with JIT compilation, vectorized environments, and GPU-accelerated simulation.
- Deep grasp of core RL concepts: policy gradients, value functions, exploration-exploitation, model-based RL, reward shaping, and sim-to-real transfer.
- Data-driven mindset: comfortable building experiment pipelines, analyzing training runs, and letting metrics guide architectural decisions.
- Publications at top venues (NeurIPS, ICML, ICLR, CoRL, RSS, ICRA) on RL or learned planning.
- Experience building or working with GPU-accelerated simulators for RL training.
- Track record of shipping a learned component in a production robotics or autonomous vehicle stack.
The Storage teams build and operate online stateful systems and abstractions that are reliable, efficient, secure and easy to use for DoorDash Engineering. The teams are responsible for understanding Product Engineering's evolving needs and developing platform and infrastructure capabilities to serve them. The team currently supports CockroachDB, Cassandra, Kafka and Redis as well as data abstraction services to reduce the complexity of interacting with storage systems for Product Engineers.
About the Role
We're hiring a Data Solutions Engineer with deep expertise in distributed databases, particularly Apache Cassandra, Redis, Kafka, and database agnostic abstractions. In this role, you will design, optimize, and scale distributed data access layers that power DoorDash's most critical systems, ensuring high availability, low latency, and fault tolerance.
You'll serve as a hands-on architect and technical partner to product engineering and infrastructure teams, helping translate complex business requirements into resilient and scalable data models. Your work will directly influence the evolution of Taulu, DoorDash's unified storage abstraction layer, by shaping best practices and identifying platform gaps through real world engagements.
This is a high-impact, cross functional role that combines deep technical expertise with a customer centric approach. You'll lead solutioning engagements from design through production, drive the adoption of Taulu modeling best practices, and ensure that our systems meet goals around reliability, cost efficiency, and velocity. You must be located in San Francisco, Sunnyvale, Seattle or New York for this hybrid opportunity.
You're excited about this opportunity because you will...
- Design and implement highly scalable, fault tolerant distributed database solutions using Taulu, Apache Cassandra, Redis, Kafka, and other paved path storage solutions.
- Architect and optimize multi-region, globally distributed systems to meet our high standards for availability, latency, and throughput.
- Lead data modeling, performance tuning, and capacity planning for large-scale, mission-critical storage workloads.
- Partner with product engineering and infrastructure teams to deeply understand domain specific data needs and guide them in adopting paved path storage solutions.
- Serve as the DRI for solutioning engagements, owning modeling in Taulu from experimentation through launch and scale.
- Shape the evolution of Taulu by identifying abstraction gaps and converting customer feedback into platform improvements.
- Apply workload-aware design patterns, including caching strategies, partitioning, and consistency tuning to improve performance and efficiency.
- Drive adoption of operational best practices across observability, schema design, capacity planning, and cost optimization across storage systems.
- Promote clarity and continuity by contributing to solutioning playbooks, decision logs, and architectural documentation.
- You have 10+ years of experience designing and scaling distributed data systems, with deep expertise in NoSQL technologies like Apache Cassandra, DynamoDB, or ScyllaDB.
- You have a strong command of distributed system concepts such as replication, partitioning, tunable consistency, and failure recovery.
- You've led data modeling efforts for high-throughput, low-latency workloads and understand the real-world trade-offs involved in NoSQL schema design.
- You are experienced with caching technologies like Redis or Memcached and know how to layer them effectively over storage systems to optimize for performance and cost.
- You have a customer-first mindset, and thrive when working closely with product and platform teams to translate complex requirements into clean, scalable data models.
- You are skilled at communicating complex architecture decisions and building alignment across infrastructure and product engineering organizations.
- You have a track record of mentoring engineers, influencing data architecture at scale, and fostering best practices in reliability, observability, and data access patterns.
- You document decisions, share learnings, and take pride in contributing to reusable playbooks and durable frameworks for others to build upon.
- Bonus: You've worked on or contributed to open-source distributed databases.
Notice Regarding Use of AI and Automated Tools: To streamline our hiring process, DoorDash utilizes an automated recruitment tool called Gem.
How it works: Gem assists our recruiting team by evaluating job related qualifications and characteristics in connection with hiring. The tool is designed and used to support - rather than replace - human decision-making; trained personnel make final decisions with meaningful human review and oversight, and DoorDash does not use Gem or other AI-enabled tool in a manner that has the effect of subjecting applicants or employees to discrimination based on any protected characteristic or proxy or for engaging in any protected activity under applicable law.
Data Retention, Privacy & Bias Audit: Data collected during this process is retained in accordance with our Candidate Privacy Policy and applicable state laws. In compliance with New York City Local Law 144, the independent bias audit summary for Gem is publicly available for review at our Careers Page.
Notice to Applicants for Jobs Located in NYC or Remote Jobs Associated With Office in NYC Only
We use Covey as part of our hiring and/or promotional process for jobs in NYC and certain features may qualify it as an AEDT in NYC. As part of the hiring and/or promotion process, we provide Covey with job requirements and candidate submitted applications. We began using Covey Scout for Inbound from August 21, 2023, through December 21, 2023, and resumed using Covey Scout for Inbound again on June 29, 2024.
The Covey tool has been reviewed by an independent auditor. Results of the audit may be viewed here: Covey
Compensation
The successful candidate's starting pay will fall within the pay range listed below and is determined based on job-related factors including, but not limited to, skills, experience, qualifications, work location, and market conditions. Base salary is localized according to an employee's work location. Ranges are market-dependent and may be modified in the future.
In addition to base salary, the compensation for this role includes opportunities for equity grants. Talk to your recruiter for more information.
DoorDash cares about you and your overall well-being. That's why we offer a comprehensive benefits package to all regular employees, which includes a 401(k) plan with employer matching, 16 weeks of paid parental leave, wellness benefits, commuter benefits match, paid time off and paid sick leave in compliance with applicable laws (e.g. Colorado Healthy Families and Workplaces Act). DoorDash also offers medical, dental, and vision benefits, 11 paid holidays, disability and basic life insurance, family-forming assistance, and a mental health program, among others.
To learn more about our benefits, visit our careers page here.
See below for paid time off details:
- For salaried roles: flexible paid time off/vacation, plus 80 hours of paid sick time per year.
- For hourly roles: vacation accrued at about 1 hour for every 25.97 hours worked (e.g. about 6.7 hours/month if working 40 hours/week; about 3.4 hours/month if working 20 hours/week), and paid sick time accrued at 1 hour for every 30 hours worked (e.g. about 5.8 hours/month if working 40 hours/week; about 2.9 hours/month if working 20 hours/week).
The national base pay range for this position within the United States, including Illinois and Colorado.
$168,000-$247,000 USD
About DoorDash
At DoorDash, our mission to empower local economies shapes how our team members move quickly, learn, and reiterate in order to make impactful decisions that display empathy for our range of users-from Dashers to merchant partners to consumers. We are a technology and logistics company that started by enabling door-to-door delivery, and we are looking for team members who can help us go from a company that is known as the place you order food to a company that people turn to for any and all goods.
DoorDash is growing rapidly and changing constantly, which gives our team members the opportunity to share their unique perspectives, solve new challenges, and own their careers. We're committed to supporting employees' happiness, healthiness, and overall well-being by providing comprehensive benefits and perks including premium healthcare, wellness expense reimbursement, paid parental leave and more.
Our Commitment to Diversity and Inclusion
We're committed to growing and empowering a more inclusive community within our company, industry, and cities. That's why we hire and cultivate diverse teams of people from all backgrounds, experiences, and perspectives. We believe that true innovation happens when everyone has room at the table and the tools, resources, and opportunity to excel.
Statement of Non-Discrimination: In keeping with our beliefs and goals, no employee or applicant will face discrimination or harassment based on: race, color, ancestry, national origin, religion, age, gender, marital/domestic partner status, sexual orientation, gender identity or expression, disability status, or veteran status. Above and beyond discrimination and harassment based on "protected categories," we also strive to prevent other subtler forms of inappropriate behavior (i.e., stereotyping) from ever gaining a foothold in our office. Whether blatant or hidden, barriers to success have no place at DoorDash. We value a diverse workforce - people who identify as women, non-binary or gender non-conforming, LGBTQIA+, American Indian or Native Alaskan, Black or African American, Hispanic or Latinx, Native Hawaiian or Other Pacific Islander, differently-abled, caretakers and parents, and veterans are strongly encouraged to apply. Thank you to the Level Playing Field Institute for this statement of non-discrimination.
Pursuant to the San Francisco Fair Chance Ordinance, Los Angeles Fair Chance Initiative for Hiring Ordinance, and any other state or local hiring regulations, we will consider for employment any qualified applicant, including those with arrest and conviction records, in a manner consistent with the applicable regulation.
If you need any accommodations, please inform your recruiting contact upon initial connection.
Notice to Applicants for Jobs Located in NYC or Remote Jobs Associated With Office in NYC Only
We used Covey as part of our hiring and/or promotional process for jobs in NYC and certain features may qualify it as an AEDT in NYC. As part of the hiring and/or promotion proce
About DoorDash
Sourced by ZipRecruiter
At DoorDash, our mission to empower local economies shapes how our team members move quickly, learn, and reiterate in order to make impactful decisions that display empathy for our range of users--from Dashers to merchant partners to consumers. We are a technology and logistics company that started with door-to-door delivery, and we are looking for team members who can help us go from a company that is known for delivering food to a company that people turn to for any and all goods. DoorDash is growing rapidly and changing constantly, which gives our team members the opportunity to share their unique perspectives, solve new challenges, and own their careers. We're committed to supporting employees' happiness, healthiness, and overall well-being by providing comprehensive benefits and perks including premium healthcare, wellness expense reimbursement, paid parental leave and more.
Industry
Transportation equipment manufacturing
Company size
10,000+ Employees
Headquarters location
San Francisco, CA, US
Year founded
2013