2

Remote Reinforcement Learning Intern Jobs (NOW HIRING)

Reinforcement Learning for Data Discover : Build RL-based policy learning and reasoning systems for ... be fully remote. The salary range for this role is an estimate based on a wide range of ...

Senior Machine Learning Engineer

$125K - $165K/yr

Experience with Reinforcement Learning * Experience with Google Cloud and BigQuery Our Environment Keebo is a fully remote, global team with team members currently in the US, EU, and Canada. What we ...

We are building a self-healing ecosystem where Multi-Agent Systems and Reinforcement Learning (RL ... Employee divides their time between in-office and remote work. Access to an office location is ...

Senior Machine Learning Engineer

Seattle, WA · On-site +1

$186K - $300K/yr

We are building a self-healing ecosystem where Multi-Agent Systems and Reinforcement Learning (RL ... Employee divides their time between in-office and remote work. Access to an office location is ...

Senior Machine Learning Engineer

Seattle, WA · On-site +1

$186K - $300K/yr

We are building a self-healing ecosystem where Multi-Agent Systems and Reinforcement Learning (RL ... Employee divides their time between in-office and remote work. Access to an office location is ...

They'll be developing perception and language understanding, deep reasoning, and reinforcement ... This role has been categorized as a Remote position. "Remote" employees do not have a permanent ...

... optimization, reinforcement learning, and heuristic approaches • Map system complexity and ... enable remote mission operations and dramatically reduce planning cycle times Qualifications

next page

Showing results 1-20

Remote Reinforcement Learning Intern information

See salary details

$8

$17

$24

How much do remote reinforcement learning intern jobs pay per hour?

As of Jun 29, 2026, the average hourly pay for remote reinforcement learning intern in the United States is $17.04, according to ZipRecruiter salary data. Most workers in this role earn between $14.42 and $19.23 per hour, depending on experience, location, and employer.

What does a Remote Reinforcement Learning Intern do?

A Remote Reinforcement Learning Intern assists with research and development projects that focus on reinforcement learning, a type of machine learning where agents learn to make decisions by trial and error. Their tasks often include implementing algorithms, running experiments, analyzing results, and contributing to academic papers or practical applications. Working remotely, they collaborate with teams using online tools and communicate progress regularly. The role is ideal for students or recent graduates who want to gain hands-on experience in artificial intelligence and machine learning.

What are some common challenges faced by remote reinforcement learning interns, and how can they be overcome?

Remote reinforcement learning interns often encounter challenges related to communication and collaboration, especially when working with distributed teams. It can also be difficult to access computational resources or receive timely feedback on experiments. To overcome these challenges, it's important to proactively schedule regular check-ins with mentors, utilize collaborative tools (such as Slack or GitHub), and ensure a reliable internet connection. Additionally, keeping detailed documentation and being transparent about progress can help facilitate smoother teamwork and problem-solving.

What are the key skills and qualifications needed to thrive as a Remote Reinforcement Learning Intern, and why are they important?

To thrive as a Remote Reinforcement Learning Intern, you need a strong background in mathematics, programming (especially Python), and foundational knowledge of machine learning concepts, typically demonstrated through coursework or relevant projects. Familiarity with reinforcement learning libraries (such as TensorFlow, PyTorch, or OpenAI Gym), version control systems like Git, and possibly cloud computing platforms is highly valuable. Excellent problem-solving abilities, self-motivation, and effective remote communication skills help interns excel in independent and collaborative tasks. These skills are essential for contributing to innovative research and development projects while working efficiently in a distributed team environment.
More about Remote Reinforcement Learning Intern jobs
What cities are hiring for Remote Reinforcement Learning Intern jobs? Cities with the most Remote Reinforcement Learning Intern job openings:
What states have the most Remote Reinforcement Learning Intern jobs? States with the most job openings for Remote Reinforcement Learning Intern jobs include:
Infographic showing various Remote Reinforcement Learning Intern job openings in the United States as of June 2026, with employment types broken down into 25% Full Time, 25% Temporary, and 50% Contract. Highlights an 92% Physical, 1% Hybrid, and 7% Remote job distribution, with an average salary of $35,436 per year, or $17 per hour.
Senior Machine Learning Engineer, Data Mining

Senior Machine Learning Engineer, Data Mining

Motional

Boston, MA • On-site, Remote

$133K - $175K/yr

Full-time

Medical, Dental, Vision, Life, Retirement

Posted 18 days ago


Job description

Mission Summary:

At Motional, we're transforming how autonomous vehicles discover critical intelligence hidden within petabytes of multimodal sensor data. Our next-generation autonomous driving stack depends on finding the rare edge cases, long-tail scenarios, and model errors that matter most. Omnitag, our ML-powered multimodal data mining framework, is the engine that powers this discovery.

As a Senior Machine Learning Engineer on the Data Mining team, your mission is to build the "Brain" of this engine: designing massive multimodal Teacher models that understand the world, and distilling them into hyper-efficient Student models that can scour exabytes of data in near real-time. You will work at the intersection of large-scale representation learning, retrieval optimization, and reasoning systems. Your work will directly influence how we compress knowledge into efficient encoders for fast search, and how we apply reinforcement learning to optimize data discovery workflows and intelligent querying. By building smarter mining tools, you will accelerate the entire model improvement lifecycle for teams working on post-training analysis, error diagnosis, and dataset curation.

What You'll Do:

  • Architect and Train Distilled Models: Design and implement teacher-student model frameworks for multimodal sensor data. Develop training pipelines for knowledge distillation. Ensure student models maintain high accuracy while drastically reducing inference latency and memory footprint.
  • Reinforcement Learning for Data Discover: Build RL-based policy learning and reasoning systems for autonomous driving applications. Implement and scale RL training workflows (e.g., PPO, DQN, actor-critic methods) for simulation and real-world interaction. Explore reward shaping, environment modeling, and multi-agent RL where applicable.
  • Optimize Model Deployment for Real-Time Inference: Collaborate with backend engineers to deploy distilled and RL models into production. Optimize for latency, throughput, and hardware efficiency across GPU/CPU clusters. Implement model versioning, A/B testing, and monitoring for performance regressions.
  • Research and Integrate Agentic Systems: Explore and prototype agentic workflows for autonomous reasoning, chain-of-thought prompting, and goal-directed behavior. Integrate such systems into our broader autonomy stack as experimental or production components.
  • Drive Production Reliability: Establish patterns for graceful degradation, fault tolerance, and cost optimization. Operate Omnitag as a mission-critical data platform serving the entire ML organization, with a focus on reliability, debuggability, and operational excellence.
  • Mentor and Collaborate: Work closely with ML scientists, data engineers, and autonomy teams to translate research advances into scalable engineering solutions. Guide junior engineers in best practices for model training, evaluation, and deployment.

What We're Looking For:

  • BS in Computer Science, Machine Learning, or related field, or equivalent professional experience.
  • 6+ years of hands-on experience in machine learning engineering, with a focus on model post training, optimization, and deployment.
  • Strong experience with model distillation or teacher-student training - practical knowledge of loss functions, training strategies, and evaluation of compressed models.
  • Proven experience with reinforcement learning in production or research settings: policy optimization, reward design, simulation environments, and RL-based reasoning.
  • Expert-level proficiency in Python and ML frameworks (PyTorch, TensorFlow, or JAX).
  • Strong software engineering fundamentals: testing, CI/CD, containerization, and system design.
  • Experience deploying ML models in cloud environments (AWS, GCP, or Azure) and optimizing for inference.
  • Demonstrated ability to ship production-grade ML systems and mentor team members.
  • Demonstrated track record of shipping robust, well-tested, production-grade systems and mentoring junior engineers

Bonus Points (Nice-to-Haves):

  • MS/PhD in Computer Science, Machine Learning, or related field.
  • Experience with agentic systems, autonomous reasoning, chain-of-thought models, or LLM-based planning.
  • Background in autonomous driving, robotics, or real-time decision-making systems.
  • Familiarity with multimodal learning, sensor fusion, or embodied AI.
  • Experience building active learning loops, using the model to find the data that breaks the model.
  • Experience with ML-based data mining, active learning, or contrastive learning.
  • Knowledge of model serving tools (TF Serving, Triton, TorchServe) and MLOps platforms.
  • Publications or open-source contributions in RL, distillation, or efficient ML.

We encourage a hybrid schedule with in-office time at one of our locations in Boston, Pittsburgh, or Las Vegas to support collaboration, or this role can be fully remote.

The salary range for this role is an estimate based on a wide range of compensation factors including but not limited to specific skills, experience and expertise, role location, certifications, licenses, and business needs. The estimated compensation range listed in this job posting reflects base salary only. This role may include additional forms of compensation such as a bonus or company equity. The recruiter assigned to this role can share more information about the specific compensation and benefit details associated with this role during the hiring process.

Candidates for certain positions are eligible to participate in Motional's benefits program. Motional's benefits include but are not limited to medical, dental, vision, 401k with a company match, health saving accounts, life insurance, pet insurance, and more.

Salary Range
$172,000—$229,000 USD

Motional is a driverless technology company making autonomous vehicles a safe, reliable, and accessible reality. We're driven by something more.

Our journey is always people first.

We aren't just developing driverless cars; we're creating safer roadways, more equitable transportation options, and making our communities better places to live, work, and connect. Our team is made up of engineers, researchers, innovators, dreamers and doers, who are creating a technology with the potential to transform the way we move.

Higher purpose, greater impact.

We're creating first-of-its-kind technology that will transform transportation. To do so successfully, we must design for everyone in our cities and on our roads. We believe in building a great place to work through a progressive, global culture that is diverse, inclusive, and ensures people feel valued at every level of the organization. Diversity helps us to see the world differently; it's not only good for our business, it's the right thing to do.

Scale up, not starting up.

Our team is behind some of the industry's largest leaps forward, including the first fully-autonomous cross-country drive in the U.S, the launch of the world's first robotaxi pilot, and operation of the world's longest-standing public robotaxi fleet. We're driven to scale; we're moving towards commercialization of our technology, and we need team members who are ready to embrace change and challenges.

Formed as a joint venture between Hyundai Motor Group and Aptiv, Motional is fundamentally changing how people move through their lives. Headquartered in Boston, Motional has operations in the U.S and Asia. For more information, visit www.Motional.com and follow us on Twitter, LinkedIn, Instagram and YouTube.

Motional AD Inc. is an EOE. We celebrate diversity and are committed to creating an inclusive environment for all employees. To comply with Federal Law, we participate in E-Verify. All newly-hired employees are queried through this electronic system established by the DHS and the SSA to verify their identity and employment eligibility.