Reinforcement Learning With Human Feedback Jobs (NOW HIRING)

Senior Machine Learning Engineer, Reinforcement Learning - Egofold

Beverly Hills, CA · On-site +1

$150K - $185K/yr

Senior Machine Learning Engineer, Reinforcement Learning - Egofold

Beverly Hills, CA · On-site +1

$150K - $185K/yr

Senior Machine Learning Engineer, Reinforcement Learning - Egofold

Beverly Hills, CA · On-site +1

$150K - $185K/yr

Reinforcement Learning Engineer

New York, NY · On-site

$87K - $118K/yr

Recruiter / HR Call: Initial screening to discuss professional background, risk management ... A strategic discussion with leadership focusing on mission alignment, role expectations, and ...

Quick apply

Reinforcement Learning Engineer

New York, NY · On-site

$87K - $118K/yr

Recruiter / HR Call: Initial screening to discuss professional background, risk management ... A strategic discussion with leadership focusing on mission alignment, role expectations, and ...

Persona AI

Reinforcement Learning Engineering Intern

Pensacola, FL · On-site

$14.25 - $19/hr

Reinforcement Learning Engineering Intern Location: Downtown Pensacola, FL Type: Full-time ... We are looking for students with an excitement for learning, technical excellence, and creative ...

Persona AI

Reinforcement Learning Engineering Intern

Pensacola, FL · On-site

$14.25 - $19/hr

Member of Technical Staff - Mechanistic Interpretability

San Francisco, CA · On-site

$300K - $500K/yr

... for reinforcement learning. * Compare interpretability-derived rewards against human feedback ... Design metrics and baselines for reward quality, including alignment with intended behavior ...

Member of Technical Staff - Mechanistic Interpretability

San Francisco, CA · On-site

$300K - $500K/yr

Member of Technical Staff - Mechanistic Interpretability

$300K - $500K/yr

Member of Technical Staff - Mechanistic Interpretability

$300K - $500K/yr

Apptronik

Senior Reinforcement Learning Engineer

Austin, TX · On-site

$103K - $142K/yr

Apptronik is a human-centered robotics company developing AI-powered robots to support humanity in ... Our flagship humanoid robot, Apollo, is built to collaborate thoughtfully with people, starting ...

Apptronik

Senior Reinforcement Learning Engineer

Austin, TX · On-site

$103K - $142K/yr

Machine Learning Engineer, Next-Generation Recommendation Systems

Bellevue, WA · On-site +1

$127K - $191K/yr

The frontier has shifted - large language models, reinforcement learning from human feedback, and ... Partner with engineering to bring research ideas into production, working across the full pipeline ...

Machine Learning Engineer, Next-Generation Recommendation Systems

Bellevue, WA · On-site +1

$127K - $191K/yr

Machine Learning Engineer, Next-Generation Recommendation Systems

New York, NY · On-site +1

$127K - $191K/yr

Machine Learning Engineer, Next-Generation Recommendation Systems

New York, NY · On-site +1

$127K - $191K/yr

Machine Learning Engineer, Next-Generation Recommendation Systems

Mountain View, CA · On-site +1

$127K - $191K/yr

Machine Learning Engineer, Next-Generation Recommendation Systems

Mountain View, CA · On-site +1

$127K - $191K/yr

Reinforcement Learning Engineer

New York, NY · On-site

$87K - $118K/yr

Recruiter / HR Call: Initial screening to discuss professional background, risk management ... A strategic discussion with leadership focusing on mission alignment, role expectations, and ...

Reinforcement Learning Engineer

New York, NY · On-site

$87K - $118K/yr

Recruiter / HR Call: Initial screening to discuss professional background, risk management ... A strategic discussion with leadership focusing on mission alignment, role expectations, and ...

Human-Robot Interaction Applied Scientist , Fauna

New York, NY · On-site

... reinforcement learning from human feedback, or other advanced techniques to achieve fluid, engaging ... with a user - Integrate perceptual sensor streams including gaze, facial expression, gesture ...

Human-Robot Interaction Applied Scientist , Fauna

New York, NY · On-site

Human-Robot Interaction Applied Scientist , Fauna

Human-Robot Interaction Applied Scientist , Fauna

Hammerhead AI

Reinforcement Learning Engineer

Redwood City, CA · On-site

About Hammerhead We're unleashing AI with intelligent orchestration while addressing one of the ... Reporting to the Head of AI / Reinforcement Learning Engineering, you will design, train, and ...

Quick apply

Hammerhead AI

Reinforcement Learning Engineer

Redwood City, CA · On-site

Fireworks AI

Applied Machine Learning Engineer

Collaborate directly with the GTM team (Account Executives and Solutions Architects) to ensure ... fine-tuning (SFT) and reinforcement learning from human feedback (RLHF or RFT). * Solid ...

Fireworks AI

Applied Machine Learning Engineer

ExxonMobil

Data Scientist, Reinforcement Learning

Spring, TX · On-site

Collaborate with engineers, scientists, and business stakeholders to turn complex operational and ... Advance the organization's capabilities in reinforcement learning, decision optimization, and ...

ExxonMobil

Data Scientist, Reinforcement Learning

Spring, TX · On-site

Boston Dynamics

Research Scientist, Reinforcement Learning - Atlas

Waltham, MA · On-site

... with a world-class team of roboticists. Responsibilities : • Design, implement, and train ... for human simulation. It is a sub-organization of Hyundai Motor Company. Founded in 1992, the ...

Boston Dynamics

Research Scientist, Reinforcement Learning - Atlas

Waltham, MA · On-site

Anthropic

Research Engineer, Machine Learning (Reinforcement Learning)

$241K/yr

About the teams Our Reinforcement Learning teams lead Anthropic's reinforcement learning research ... We've contributed to all Claude models, with significant impacts on the autonomy and coding ...

Anthropic

Research Engineer, Machine Learning (Reinforcement Learning)

$241K/yr

Helix AI Engineer, Robot Learning

San Jose, CA · On-site

$200K - $400K/yr

The goal of the company is to ship humanoid robots with human level intelligence. Its robots are ... Apply and extend techniques including behavior cloning, reinforcement learning, and VLA reasoning

Helix AI Engineer, Robot Learning

San Jose, CA · On-site

$200K - $400K/yr

Helix AI Engineer, Robot Learning

San Jose, CA · On-site

$200K - $400K/yr

Reinforcement Learning With Human Feedback Jobs

Helix AI Engineer, Robot Learning

San Jose, CA · On-site

$200K - $400K/yr

Showing results 1-20

Reinforcement Learning With Human Feedback information

See salary details

$26

$40

$69

How much do reinforcement learning with human feedback jobs pay per hour?

As of Jul 15, 2026, the average hourly pay for reinforcement learning with human feedback in the United States is $40.70, according to ZipRecruiter salary data. Most workers in this role earn between $29.57 and $52.88 per hour, depending on experience, location, and employer.

What are the key skills and qualifications needed to thrive as a Reinforcement Learning with Human Feedback (RLHF) Engineer, and why are they important?

To excel as a Reinforcement Learning with Human Feedback (RLHF) Engineer, you need a strong background in machine learning, reinforcement learning theory, statistics, and typically an advanced degree in computer science or a related field. Familiarity with deep learning frameworks (such as TensorFlow or PyTorch), RL libraries (like Ray RLlib), and experience with data collection and annotation systems are essential. Excellent problem-solving abilities, communication skills, and teamwork help you collaborate with researchers, data annotators, and other engineers. These skills enable you to design and implement RLHF systems that are robust, scalable, and aligned with human values.

What is the difference between Reinforcement Learning With Human Feedback vs Reinforcement Learning Engineer?

Aspect	Reinforcement Learning With Human Feedback	Reinforcement Learning Engineer
Credentials	Typically requires knowledge of machine learning, AI, and data analysis	Requires similar credentials in machine learning, programming, and AI
Work Environment	Research labs, AI development teams, tech companies	Development teams, research labs, tech firms
Industry Usage	Used in AI training, human-in-the-loop systems, and model refinement	Designing, implementing, and optimizing reinforcement learning algorithms

Reinforcement Learning With Human Feedback focuses on improving AI models through human input, while Reinforcement Learning Engineers develop and deploy these algorithms. Both roles require strong machine learning skills and often work in similar environments, but their core responsibilities differ in application and focus.

What is Reinforcement Learning with Human Feedback?

Reinforcement Learning with Human Feedback (RLHF) is a machine learning technique where AI agents are trained not only through automated reward signals but also by incorporating feedback from humans. This approach helps align the agent’s behavior with human preferences, values, or safety requirements by allowing humans to guide or correct the learning process. RLHF is commonly used in developing advanced AI systems, such as language models, to ensure their outputs are helpful, safe, and aligned with user expectations. The process often involves human evaluators ranking or scoring the AI's responses, which are then used to fine-tune the model’s behavior.

What are the typical collaborations involved for a Reinforcement Learning with Human Feedback (RLHF) specialist within a machine learning team?

As an RLHF specialist, you often work closely with data scientists, machine learning engineers, and domain experts to design effective feedback mechanisms and reward models. Collaboration with annotation teams or subject matter experts is common, as high-quality human feedback is crucial for training robust RLHF models. You may also partner with product managers and UX researchers to ensure that the models align with user needs and ethical considerations. Regular cross-functional meetings and code reviews help maintain alignment and foster innovation across teams.

More about Reinforcement Learning With Human Feedback jobs

The 10 Top Types Of Reinforcement Learning With Human Feedback Jobs

What cities are hiring for Reinforcement Learning With Human Feedback jobs? Cities with the most Reinforcement Learning With Human Feedback job openings:

What states have the most Reinforcement Learning With Human Feedback jobs? States with the most job openings for Reinforcement Learning With Human Feedback jobs include:

What job categories do people searching Reinforcement Learning With Human Feedback jobs look for? The top searched job categories for Reinforcement Learning With Human Feedback jobs are:

Reinforcement Learning With Human Feedback jobs near you

Infographic showing various Reinforcement Learning With Human Feedback job openings in the United States as of July 2026, with employment types broken down into 82% Full Time, 17% Part Time, and 1% Contract. Highlights an 90% Physical, 1% Hybrid, and 9% Remote job distribution, with an average salary of $84,648 per year, or $40.7 per hour.

Senior Machine Learning Engineer, Reinforcement Learning - Egofold