Remote Reinforcement Learning Intern Jobs (NOW HIRING)

Senior Machine Learning Engineer, Data Mining

$118K - $156K/yr

Reinforcement Learning for Data Discover : Build RL-based policy learning and reasoning systems for ... be fully remote. The salary range for this role is an estimate based on a wide range of ...

Quick apply

Motional

Senior Machine Learning Engineer, Data Mining

Pittsburgh, PA · On-site +1

$118K - $156K/yr

Truveta

ML PhD Intern - LLMs & Generative AI

Seattle, WA · Remote

... remote Who We Need We are seeking a highly motivated and talented Machine Learning PhD Intern to ... Familiarity with parameter-efficient tuning techniques, Reinforcement Learning from Human Feedback ...

Truveta

ML PhD Intern - LLMs & Generative AI

Seattle, WA · Remote

Motional

Senior Machine Learning Engineer, Data Mining

San Francisco, CA · On-site +1

$144K - $190K/yr

Quick apply

Motional

Senior Machine Learning Engineer, Data Mining

San Francisco, CA · On-site +1

$144K - $190K/yr

Motional

Senior Machine Learning Engineer, Data Mining

Las Vegas, NV · On-site +1

$117K - $154K/yr

Reinforcement Learning for Data Discover : Build RL-based policy learning and reasoning systems for ... remote.

Motional

Senior Machine Learning Engineer, Data Mining

Las Vegas, NV · On-site +1

$117K - $154K/yr

Reinforcement Learning for Data Discover : Build RL-based policy learning and reasoning systems for ... remote.

Motional

Senior Machine Learning Engineer, Data Mining

Pittsburgh, PA · On-site +1

$118K - $156K/yr

Reinforcement Learning for Data Discover : Build RL-based policy learning and reasoning systems for ... remote.

Motional

Senior Machine Learning Engineer, Data Mining

Pittsburgh, PA · On-site +1

$118K - $156K/yr

Reinforcement Learning for Data Discover : Build RL-based policy learning and reasoning systems for ... remote.

Motional

Senior Machine Learning Engineer, Data Mining

$125K - $165K/yr

Motional

Senior Machine Learning Engineer, Data Mining

$125K - $165K/yr

Motional

Senior Machine Learning Engineer, Data Mining

Boston, MA · On-site +1

$133K - $175K/yr

Quick apply

Motional

Senior Machine Learning Engineer, Data Mining

Boston, MA · On-site +1

$133K - $175K/yr

Motional

Senior Machine Learning Engineer, Data Mining

Las Vegas, NV · On-site +1

$117K - $154K/yr

Quick apply

Motional

Senior Machine Learning Engineer, Data Mining

Las Vegas, NV · On-site +1

$117K - $154K/yr

Keebo

Senior Machine Learning Engineer

$125K - $165K/yr

Experience with Reinforcement Learning * Experience with Google Cloud and BigQuery Our Environment Keebo is a fully remote, global team with team members currently in the US, EU, and Canada. What we ...

Keebo

Senior Machine Learning Engineer

$125K - $165K/yr

TikTok

$42.75/hr

The team is made up of machine learning researchers and engineers, who support and innovate on ... Interns who are not working 100% remote may also be eligible for housing allowance. The Company ...

TikTok

$42.75/hr

talentpluto

Go-to-Market Engineer

$130K - $160K/yr

Fully remote Industry: Reinforcement learning environments (AI) Compensation: $130K-$160K base, plus signing bonus for the right candidate About the Company Our partner is a YC-backed company helping ...

talentpluto

Go-to-Market Engineer

$130K - $160K/yr

Mercor

Machine Learning Expert - Fully Remote | Upto $90/hr

San Francisco, CA · Remote

$90/hr

Remote Commitment: 20+ hours/week Role Responsibilities * Attempt open-ended machine learning ... Practical experience in Pretraining , Reinforcement learning , Post-training , Dataset curation ...

Quick apply

Mercor

Machine Learning Expert - Fully Remote | Upto $90/hr

San Francisco, CA · Remote

$90/hr

talentpluto

Go-to-Market Engineer

$130K - $160K/yr

talentpluto

Go-to-Market Engineer

$130K - $160K/yr

talentpluto

Go-to-Market Engineer

$130K - $160K/yr

talentpluto

Go-to-Market Engineer

$130K - $160K/yr

talentpluto

Founding Go-To-Market Engineer

$130K - $160K/yr

talentpluto

Founding Go-To-Market Engineer

$130K - $160K/yr

talentpluto

Founding Go-To-Market Engineer

$130K - $160K/yr

talentpluto

Founding Go-To-Market Engineer

$130K - $160K/yr

Lockheed Martin Corporation

AI / Machine Learning Engineer Sr Staff (Signal Processing)

Fort Worth, TX · On-site +1

... and reinforcement learning, to solve complex signal processing problems and field innovative ... Remote work arrangements may also be considered for qualified candidates. #LMLAIC

Lockheed Martin Corporation

AI / Machine Learning Engineer Sr Staff (Signal Processing)

Fort Worth, TX · On-site +1

... and reinforcement learning, to solve complex signal processing problems and field innovative ... Remote work arrangements may also be considered for qualified candidates. #LMLAIC

SW5 Consulting

Senior Machine Learning Engineer

New York, NY · Remote

$180K - $250K/yr

Remote (U.S.) or New York City Compensation: $180K - $250K + Equity Employment Type: Full-time ... Background in marketing tech or ad tech . * Experience with LLMs, reinforcement learning, or bandit ...

Quick apply

SW5 Consulting

Senior Machine Learning Engineer

New York, NY · Remote

$180K - $250K/yr

Synnex

Physical AI Engineer - Simulation & Synthetic Data

Clearwater, FL · On-site +1

Design and run reinforcement learning and imitation learning pipelines using simulationgenerated ... Remote / hybrid (USbased) * Occasional domestic and global travel * Flexible working hours aligned ...

Synnex

Physical AI Engineer - Simulation & Synthetic Data

Clearwater, FL · On-site +1

Nrel

Graduate (3-12 month) Intern - Artificial Intelligence for Power System Operations

Golden, CO · On-site +1

$51K - $81K/yr

Experience with other ML/AI techniques, including reinforcement learning and graph neural networks ... Intern assignments extending beyond six months will be subject to this requirement. Drug Free ...

Nrel

Graduate (3-12 month) Intern - Artificial Intelligence for Power System Operations

Golden, CO · On-site +1

$51K - $81K/yr

Showing results 1-20

Remote Reinforcement Learning Intern Jobs

Remote Reinforcement Learning Intern information

See salary details

$17

$24

How much do remote reinforcement learning intern jobs pay per hour?

As of Jul 22, 2026, the average hourly pay for remote reinforcement learning intern in the United States is $17.04, according to ZipRecruiter salary data. Most workers in this role earn between $14.42 and $19.23 per hour, depending on experience, location, and employer.

What does a Remote Reinforcement Learning Intern do?

A Remote Reinforcement Learning Intern assists with research and development projects that focus on reinforcement learning, a type of machine learning where agents learn to make decisions by trial and error. Their tasks often include implementing algorithms, running experiments, analyzing results, and contributing to academic papers or practical applications. Working remotely, they collaborate with teams using online tools and communicate progress regularly. The role is ideal for students or recent graduates who want to gain hands-on experience in artificial intelligence and machine learning.

What are some common challenges faced by remote reinforcement learning interns, and how can they be overcome?

Remote reinforcement learning interns often encounter challenges related to communication and collaboration, especially when working with distributed teams. It can also be difficult to access computational resources or receive timely feedback on experiments. To overcome these challenges, it's important to proactively schedule regular check-ins with mentors, utilize collaborative tools (such as Slack or GitHub), and ensure a reliable internet connection. Additionally, keeping detailed documentation and being transparent about progress can help facilitate smoother teamwork and problem-solving.

What are the key skills and qualifications needed to thrive as a Remote Reinforcement Learning Intern, and why are they important?

To thrive as a Remote Reinforcement Learning Intern, you need a strong background in mathematics, programming (especially Python), and foundational knowledge of machine learning concepts, typically demonstrated through coursework or relevant projects. Familiarity with reinforcement learning libraries (such as TensorFlow, PyTorch, or OpenAI Gym), version control systems like Git, and possibly cloud computing platforms is highly valuable. Excellent problem-solving abilities, self-motivation, and effective remote communication skills help interns excel in independent and collaborative tasks. These skills are essential for contributing to innovative research and development projects while working efficiently in a distributed team environment.

More about Remote Reinforcement Learning Intern jobs

The 10 Top Types Of Remote Reinforcement Learning Intern Jobs

What cities are hiring for Remote Reinforcement Learning Intern jobs? Cities with the most Remote Reinforcement Learning Intern job openings:

What states have the most Remote Reinforcement Learning Intern jobs? States with the most job openings for Remote Reinforcement Learning Intern jobs include:

What job categories do people searching Remote Reinforcement Learning Intern jobs look for? The top searched job categories for Remote Reinforcement Learning Intern jobs are:

Remote Reinforcement Learning Intern jobs near you

Infographic showing various Remote Reinforcement Learning Intern job openings in the United States as of July 2026, with employment types broken down into 12% Internship, 1% As Needed, 49% Full Time, 34% Part Time, 2% Temporary, and 2% Contract. Highlights an 93% Physical, 2% Hybrid, and 5% Remote job distribution, with an average salary of $35,436 per year, or $17 per hour.

Senior Machine Learning Engineer, Data Mining

Motional

Pittsburgh, PA • On-site, Remote

$118K - $156K/yr

Full-time

Medical, Dental, Vision, Life, Retirement

Posted 11 days ago

Job description

Mission Summary:

At Motional, we're transforming how autonomous vehicles discover critical intelligence hidden within petabytes of multimodal sensor data. Our next-generation autonomous driving stack depends on finding the rare edge cases, long-tail scenarios, and model errors that matter most. Omnitag, our ML-powered multimodal data mining framework, is the engine that powers this discovery.

As a Senior Machine Learning Engineer on the Data Mining team, your mission is to build the "Brain" of this engine: designing massive multimodal Teacher models that understand the world, and distilling them into hyper-efficient Student models that can scour exabytes of data in near real-time. You will work at the intersection of large-scale representation learning, retrieval optimization, and reasoning systems. Your work will directly influence how we compress knowledge into efficient encoders for fast search, and how we apply reinforcement learning to optimize data discovery workflows and intelligent querying. By building smarter mining tools, you will accelerate the entire model improvement lifecycle for teams working on post-training analysis, error diagnosis, and dataset curation.

What You'll Do:

Architect and Train Distilled Models: Design and implement teacher-student model frameworks for multimodal sensor data. Develop training pipelines for knowledge distillation. Ensure student models maintain high accuracy while drastically reducing inference latency and memory footprint.
Reinforcement Learning for Data Discover: Build RL-based policy learning and reasoning systems for autonomous driving applications. Implement and scale RL training workflows (e.g., PPO, DQN, actor-critic methods) for simulation and real-world interaction. Explore reward shaping, environment modeling, and multi-agent RL where applicable.
Optimize Model Deployment for Real-Time Inference: Collaborate with backend engineers to deploy distilled and RL models into production. Optimize for latency, throughput, and hardware efficiency across GPU/CPU clusters. Implement model versioning, A/B testing, and monitoring for performance regressions.
Research and Integrate Agentic Systems: Explore and prototype agentic workflows for autonomous reasoning, chain-of-thought prompting, and goal-directed behavior. Integrate such systems into our broader autonomy stack as experimental or production components.
Drive Production Reliability: Establish patterns for graceful degradation, fault tolerance, and cost optimization. Operate Omnitag as a mission-critical data platform serving the entire ML organization, with a focus on reliability, debuggability, and operational excellence.
Mentor and Collaborate: Work closely with ML scientists, data engineers, and autonomy teams to translate research advances into scalable engineering solutions. Guide junior engineers in best practices for model training, evaluation, and deployment.

What We're Looking For:

BS in Computer Science, Machine Learning, or related field, or equivalent professional experience.
6+ years of hands-on experience in machine learning engineering, with a focus on model post training, optimization, and deployment.
Strong experience with model distillation or teacher-student training - practical knowledge of loss functions, training strategies, and evaluation of compressed models.
Proven experience with reinforcement learning in production or research settings: policy optimization, reward design, simulation environments, and RL-based reasoning.
Expert-level proficiency in Python and ML frameworks (PyTorch, TensorFlow, or JAX).
Strong software engineering fundamentals: testing, CI/CD, containerization, and system design.
Experience deploying ML models in cloud environments (AWS, GCP, or Azure) and optimizing for inference.
Demonstrated ability to ship production-grade ML systems and mentor team members.
Demonstrated track record of shipping robust, well-tested, production-grade systems and mentoring junior engineers

Bonus Points (Nice-to-Haves):

MS/PhD in Computer Science, Machine Learning, or related field.
Experience with agentic systems, autonomous reasoning, chain-of-thought models, or LLM-based planning.
Background in autonomous driving, robotics, or real-time decision-making systems.
Familiarity with multimodal learning, sensor fusion, or embodied AI.
Experience building active learning loops, using the model to find the data that breaks the model.
Experience with ML-based data mining, active learning, or contrastive learning.
Knowledge of model serving tools (TF Serving, Triton, TorchServe) and MLOps platforms.
Publications or open-source contributions in RL, distillation, or efficient ML.

We encourage a hybrid schedule with in-office time at one of our locations in Boston, Pittsburgh, or Las Vegas to support collaboration, or this role can be fully remote.

The salary range for this role is an estimate based on a wide range of compensation factors including but not limited to specific skills, experience and expertise, role location, certifications, licenses, and business needs. The estimated compensation range listed in this job posting reflects base salary only. This role may include additional forms of compensation such as a bonus or company equity. The recruiter assigned to this role can share more information about the specific compensation and benefit details associated with this role during the hiring process.

Candidates for certain positions are eligible to participate in Motional's benefits program. Motional's benefits include but are not limited to medical, dental, vision, 401k with a company match, health saving accounts, life insurance, pet insurance, and more.

Salary Range

$172,000—$229,000 USD

Motional is a driverless technology company making autonomous vehicles a safe, reliable, and accessible reality. We're driven by something more.

Our journey is always people first.

We aren't just developing driverless cars; we're creating safer roadways, more equitable transportation options, and making our communities better places to live, work, and connect. Our team is made up of engineers, researchers, innovators, dreamers and doers, who are creating a technology with the potential to transform the way we move.

Higher purpose, greater impact.

We're creating first-of-its-kind technology that will transform transportation. To do so successfully, we must design for everyone in our cities and on our roads. We believe in building a great place to work through a progressive, global culture that is diverse, inclusive, and ensures people feel valued at every level of the organization. Diversity helps us to see the world differently; it's not only good for our business, it's the right thing to do.

Scale up, not starting up.

Our team is behind some of the industry's largest leaps forward, including the first fully-autonomous cross-country drive in the U.S, the launch of the world's first robotaxi pilot, and operation of the world's longest-standing public robotaxi fleet. We're driven to scale; we're moving towards commercialization of our technology, and we need team members who are ready to embrace change and challenges.

Formed as a joint venture between Hyundai Motor Group and Aptiv, Motional is fundamentally changing how people move through their lives. Headquartered in Boston, Motional has operations in the U.S and Asia. For more information, visit www.Motional.com and follow us on Twitter, LinkedIn, Instagram and YouTube.

Motional AD Inc. is an EOE. We celebrate diversity and are committed to creating an inclusive environment for all employees. To comply with Federal Law, we participate in E-Verify. All newly-hired employees are queried through this electronic system established by the DHS and the SSA to verify their identity and employment eligibility.

About Motional

Sourced by ZipRecruiter

Industry

Motor vehicle manufacturing

Company size

501 - 1,000 Employees

Headquarters location

Boston, MA, US

Year founded

2020

Website

motional.com

Social media

View All Motional Jobs

Remote Reinforcement Learning Intern Jobs (NOW HIRING)

Senior Machine Learning Engineer, Data Mining

Senior Machine Learning Engineer, Data Mining

ML PhD Intern - LLMs & Generative AI

ML PhD Intern - LLMs & Generative AI

Senior Machine Learning Engineer, Data Mining

Senior Machine Learning Engineer, Data Mining

Senior Machine Learning Engineer, Data Mining

Senior Machine Learning Engineer, Data Mining

Senior Machine Learning Engineer, Data Mining

Senior Machine Learning Engineer, Data Mining

Senior Machine Learning Engineer, Data Mining

Senior Machine Learning Engineer, Data Mining

Senior Machine Learning Engineer, Data Mining

Senior Machine Learning Engineer, Data Mining

Senior Machine Learning Engineer, Data Mining

Senior Machine Learning Engineer, Data Mining

Senior Machine Learning Engineer

Senior Machine Learning Engineer

Machine Learning Engineer Intern (TikTok-Recommendation)

Machine Learning Engineer Intern (TikTok-Recommendation)

Go-to-Market Engineer

Go-to-Market Engineer

Machine Learning Expert - Fully Remote | Upto $90/hr

Machine Learning Expert - Fully Remote | Upto $90/hr

Go-to-Market Engineer

Go-to-Market Engineer

Go-to-Market Engineer

Go-to-Market Engineer

Founding Go-To-Market Engineer

Founding Go-To-Market Engineer

Founding Go-To-Market Engineer

Founding Go-To-Market Engineer

AI / Machine Learning Engineer Sr Staff (Signal Processing)

AI / Machine Learning Engineer Sr Staff (Signal Processing)

Senior Machine Learning Engineer

Senior Machine Learning Engineer

Physical AI Engineer - Simulation & Synthetic Data

Physical AI Engineer - Simulation & Synthetic Data

Graduate (3-12 month) Intern - Artificial Intelligence for Power System Operations

Graduate (3-12 month) Intern - Artificial Intelligence for Power System Operations

Remote Reinforcement Learning Intern information

See salary details

How much do remote reinforcement learning intern jobs pay per hour?

What does a Remote Reinforcement Learning Intern do?

What are some common challenges faced by remote reinforcement learning interns, and how can they be overcome?

What are the key skills and qualifications needed to thrive as a Remote Reinforcement Learning Intern, and why are they important?

Senior Machine Learning Engineer, Data Mining

Share this job

Job description

About Motional

Industry

Company size

Headquarters location

Year founded

Website

Social media

Share this job