2

Full Time Reinforcement Learning Jobs (NOW HIRING)

Reinforcement Learning Engineer

New York, NY · On-site

$87K - $118K/yr

Reinforcement Learning (RL) Engineer Location ... New York (Office) On-site | Full-time Compensation: Competitive Our client is an elite development ...

Apply Early

Senior Machine Learning Engineer

$125K - $165K/yr

Familiarity with reinforcement learning or bandit models Nice to Have * Experience with Java and ... For full-time positions: * Competitive salary packages * Equity * Home office stipend

next page

Showing results 1-20

Full Time Reinforcement Learning information

See salary details

$21K

$61.7K

$114.5K

How much do full time reinforcement learning jobs pay per year?

As of Jul 3, 2026, the average yearly pay for full time reinforcement learning in the United States is $61,692.00, according to ZipRecruiter salary data. Most workers in this role earn between $41,000.00 and $72,000.00 per year, depending on experience, location, and employer.

What is the difference between Full Time Reinforcement Learning vs Data Scientist?

AspectFull Time Reinforcement LearningData Scientist
Required CredentialsAdvanced degree in CS, ML, or related field; experience with RL frameworksDegree in CS, Statistics, or related; strong programming and analytical skills
Work EnvironmentResearch labs, AI companies, tech firms focusing on RL projectsBusiness, tech companies, consulting firms analyzing data for insights
Industry UsageAI research, robotics, gaming, autonomous systemsFinance, healthcare, marketing, e-commerce, and more

Full Time Reinforcement Learning specialists focus on developing RL algorithms and models, often in research or AI product development. Data Scientists analyze data to extract insights and support decision-making across various industries. While both roles require strong technical skills, RL roles are more specialized in AI research and development, whereas Data Scientists have broader applications in data analysis and business strategy.

More about Full Time Reinforcement Learning jobs
What cities are hiring for Full Time Reinforcement Learning jobs? Cities with the most Full Time Reinforcement Learning job openings:
What are the most commonly searched types of Reinforcement Learning jobs? The most popular types of Reinforcement Learning jobs are:
Infographic showing various Full Time Reinforcement Learning job openings in the United States as of June 2026, with employment types broken down into 96% Full Time, and 4% Temporary. Highlights an 92% Physical, 1% Hybrid, and 7% Remote job distribution, with an average salary of $61,692 per year, or $29.7 per hour.

Reinforcement Learning Engineering Intern

Persona AI

Pensacola, FL • On-site

$14.25 - $19/hr

Full-time, Internship

Posted 13 days ago


Job description

Reinforcement Learning Engineering Intern
Location: Downtown Pensacola, FL
Type: Full-time Internship, 40 hours/week
About the Internship
The Reinforcement Learning Engineering Internship is an opportunity for Bachelors and Masters candidate students to join and contribute to the Persona team as we develop our industrial humanoids. Our objective is to provide each intern with a positive learning environment, hands-on experience with humanoids, and ownership over their own project direction. We are looking for students with an excitement for learning, technical excellence, and creative problem-solving skills.
Each intern will have a designated mentor to provide guidance and assistance in developing and making progress towards a target goal. We have a strong bias for projects that lead to software, controls, or policies deployed on our hardware and extending the capabilities of our systems. Projects will be jointly planned by the intern and their mentor to build on the intern's background, extend their experience to new areas of interest, and fit into the broader goals of the Persona reinforcement learning team.
Role Description
For this role, the specific tasks will be defined prior to the start date by the mentor and the intern based on their experience, proficiency, and personal interests. The scope may also be adjusted to fit the project within the intern's time-frame. We encourage interns to share their interests even if they may be entirely different from their technical background. Some example general tasks that may be a part of any project are described below:
  • Develop new simulation training environments
  • Design new behaviors or extend capabilities for the Persona robots
  • Deploy to hardware, log data, and analyze results
  • Create or implement new algorithms for modeling, training, sensing, or deployment
  • Characterize hardware sensors, actuators, and general robot parameters
Qualifications
  • Current Undergraduate or Masters student
  • Software proficiency in Python, C/C++, Java, or Rust
  • Experience with basic machine learning concepts
Bonus Experience
  • Worked with Pytorch or similar
  • Physics simulator experience such as IsaacLab/IsaacSim, Mujoco, or similar
  • Deployed controls software to robot hardware
  • Trained policies with reinforcement learning
  • Worked with motion diffusion models or VLAs
  • Experience with character animation
  • Worked on vision or localization
Open Technical Areas
  • Perception
  • Locomotion
  • Manipulation
  • Motion Planning
  • Imitation Learning
  • Motion Retargeting
  • Sim-to-Real Modeling
Application Timeline
We are accepting applications on a rolling basis. We will interview and make offers for upcoming intern cohorts until we fill all openings. We will close the application process for an upcoming cohort approximately 3 months before the start of the cohort and recommend applying approximately 6 months in advance.
Note: we are no longer accepting applications for Fall 2026.
The interview process we are currently following involves two interviews. First, a phone pre-screen with a member of our staff. Second, a presentation and discussion interview with one to two of our engineers. The presentation is meant to be informal and give an opportunity for you to share your background, experiences, and interests. We like the chance to see pictures and videos of your projects and hear what part of robotics excites you most! We will also give an overview of the work we are doing here at Persona AI and leave time for you to ask us questions.
We aim to get back to you as soon as we can but it may take a few weeks, especially in between cohorts. Please know we are working on it and will get back to every application!