Helper Reinforcement Learning Jobs (NOW HIRING)

Research Engineer, Machine Learning (Reinforcement Learning)

$241K/yr

Help scale our systems to handle increasingly complex research workflows. * Design, implement, and test novel training environments, evaluations, and methodologies for reinforcement learning agents ...

Anthropic

Research Engineer, Machine Learning (Reinforcement Learning)

San Francisco, CA · On-site

$241K/yr

Anthropic

Research Engineer, Machine Learning (Reinforcement Learning)

San Francisco, CA · On-site

$500K - $850K/yr

Anthropic

Research Engineer, Machine Learning (Reinforcement Learning)

San Francisco, CA · On-site

$500K - $850K/yr

Profluent

Machine Learning Scientist, Reinforcement Learning

Emeryville, CA · On-site +1

$200K - $330K/yr

... to help shape the scientific and strategic vision of the company Qualifications * PhD (or ... reinforcement learning techniques * Publications at major machine learning conferences (NeurIPS ...

Profluent

Machine Learning Scientist, Reinforcement Learning

Emeryville, CA · On-site +1

$200K - $330K/yr

Bugcrowd

Reinforcement Learning Engineer (Cybersecurity)

$176K - $242K/yr

You will help create the training environments that teach AI systems how to hack and defend ... Reinforcement learning workflows * Building clean, reproducible Linux ML environments (containers ...

Bugcrowd

Reinforcement Learning Engineer (Cybersecurity)

$176K - $242K/yr

Nvidia

Developer Advocate - Reinforcement Learning

Santa Clara, CA · On-site

... helping developers understand our demos and encouraging them to build their own applications ... Ability to design and implement Reinforcement learning and post-training pipelines for LLM to ...

Nvidia

Developer Advocate - Reinforcement Learning

Santa Clara, CA · On-site

Anthropic

Research Engineer, Chip Design RL (Reinforcement Learning)

San Francisco, CA · On-site

$500K - $850K/yr

About the RL teams Our Reinforcement Learning teams lead Anthropic's reinforcement learning ... help with this. We encourage you to apply even if you do not believe you meet every single ...

New

Anthropic

Research Engineer, Chip Design RL (Reinforcement Learning)

San Francisco, CA · On-site

$500K - $850K/yr

New

Amazon

Senior PMT ES - Reinforcement Learning, SageMaker AI

Bellevue, WA · On-site

$142K - $188K/yr

We need a product leader who can make reinforcement learning dramatically more accessible, helping a broad range of customers use RL to build better models and get to production faster. Amazon ...

Amazon

Senior PMT ES - Reinforcement Learning, SageMaker AI

Bellevue, WA · On-site

$142K - $188K/yr

Nvidia

Applied Deep Learning PhD Research Intern, Reinforcement Learning for LLMs - Fall 2026

Santa Clara, CA · On-site

$17.50 - $23.50/hr

... reinforcement learning ... Our applied deep learning research team at NVIDIA has helped pioneer projects such as Megatron, MT ...

Nvidia

Applied Deep Learning PhD Research Intern, Reinforcement Learning for LLMs - Fall 2026

Santa Clara, CA · On-site

$17.50 - $23.50/hr

... reinforcement learning ... Our applied deep learning research team at NVIDIA has helped pioneer projects such as Megatron, MT ...

Nvidia Corporation

Applied Deep Learning PhD Research Intern, Reinforcement Learning for LLMs - Fall 2026

Santa Clara, CA · On-site

$17.50 - $23.50/hr

... reinforcement learning ... Our applied deep learning research team at NVIDIA has helped pioneer projects such as Megatron, MT ...

Nvidia Corporation

Applied Deep Learning PhD Research Intern, Reinforcement Learning for LLMs - Fall 2026

Santa Clara, CA · On-site

$17.50 - $23.50/hr

... reinforcement learning ... Our applied deep learning research team at NVIDIA has helped pioneer projects such as Megatron, MT ...

Futran Tech Solutions Pvt. Ltd.

Sr. ML Scientist (Pricing & Reinforcement Learning)

Plano, TX · On-site

... help drive superior competitive differentiation, customer experiences, and business outcomes in a ... Senior ML Scientist (Pricing Reinforcement Learning) Work Location: Plano TX Work Mode: Remote Role ...

Futran Tech Solutions Pvt. Ltd.

Sr. ML Scientist (Pricing & Reinforcement Learning)

Plano, TX · On-site

Centific

Research Intern - Applied Reinforcement Learning

$35 - $45/hr

We aim to help these organizations unlock significant business value by deploying GenAI at scale ... About Job PhD Research Intern - Applied Reinforcement Learning Centific AI Research Role Summary ...

Centific

Research Intern - Applied Reinforcement Learning

$35 - $45/hr

Amazon

Senior PMT ES - Reinforcement Learning, SageMaker AI

Bellevue, WA · On-site

$142K - $188K/yr

Amazon

Senior PMT ES - Reinforcement Learning, SageMaker AI

Bellevue, WA · On-site

$142K - $188K/yr

Meta

AI Research Scientist, Reinforcement Learning

New York, NY · On-site

$122K - $181K/yr

... reinforcement learning • Explore and develop novel LLM post-training recipes using 3D data • ... People who choose to build their careers by building with us at Meta help shape a future that will ...

Meta

AI Research Scientist, Reinforcement Learning

New York, NY · On-site

$122K - $181K/yr

Snail Games USA

Senior Machine Learning Engineer, Reinforcement Learning - Egofold

Beverly Hills, CA · On-site +1

$150K - $185K/yr

Senior Machine Learning Engineer, Reinforcement Learning - Egofold About Snail Games USA Snail ... Work closely with engineers and other partners to help integrate successful ML work into usable ...

Snail Games USA

Senior Machine Learning Engineer, Reinforcement Learning - Egofold

Beverly Hills, CA · On-site +1

$150K - $185K/yr

Meta

AI Research Scientist, Reinforcement Learning

New York, NY

$122K/yr

AI Research Scientist, Reinforcement Learning Responsibilities: * Explore and develop novel post ... Meta builds technologies that help people connect, find communities, and grow businesses. When ...

Meta

AI Research Scientist, Reinforcement Learning

New York, NY

$122K/yr

Snail Games USA

Senior Machine Learning Engineer, Reinforcement Learning - Egofold

Beverly Hills, CA · On-site +1

$150K - $185K/yr

Quick apply

Snail Games USA

Senior Machine Learning Engineer, Reinforcement Learning - Egofold

Beverly Hills, CA · On-site +1

$150K - $185K/yr

Anthropic

Research Engineer, Code RL (Reinforcement Learning)

San Francisco, CA · On-site

$500K - $850K/yr

About the RL Teams Our Reinforcement Learning teams play a critical role in advancing our AI ... help with this. We encourage you to apply even if you do not believe you meet every single ...

Anthropic

Research Engineer, Code RL (Reinforcement Learning)

San Francisco, CA · On-site

$500K - $850K/yr

About the RL Teams Our Reinforcement Learning teams play a critical role in advancing our AI ... help with this. We encourage you to apply even if you do not believe you meet every single ...

Snail Games USA

Senior Machine Learning Engineer, Reinforcement Learning - Egofold

Beverly Hills, CA · On-site +1

$150K - $185K/yr

Snail Games USA

Senior Machine Learning Engineer, Reinforcement Learning - Egofold

Beverly Hills, CA · On-site +1

$150K - $185K/yr

Siemens Healthineers

AI/ML Scientist - Reinforcement Learning, Simulation & Optimization

Princeton, NJ · On-site +1

... AI Scientist - Reinforcement Learning & Operational Twinning, where you will develop next ... In this role, you will help expand and operationalize Siemens Healthineers' Operational Twinning ...

Siemens Healthineers

AI/ML Scientist - Reinforcement Learning, Simulation & Optimization

Princeton, NJ · On-site +1

Centific

Applied Reinforcement Learning Engineer

We aim to help these organizations unlock significant business value by deploying GenAI at scale ... Applied Reinforcement Learning Engineer Location: Palo Alto, CA or Seattle, WA (Hybrid/Remote ...

Centific

Applied Reinforcement Learning Engineer

Showing results 1-20

Helper Reinforcement Learning Jobs

Helper Reinforcement Learning information

See salary details

$10

$16

$22

How much do helper reinforcement learning jobs pay per hour?

As of Jul 15, 2026, the average hourly pay for helper reinforcement learning in the United States is $16.44, according to ZipRecruiter salary data. Most workers in this role earn between $14.42 and $17.55 per hour, depending on experience, location, and employer.

What is the difference between Helper Reinforcement Learning vs Data Scientist?

Aspect	Helper Reinforcement Learning	Data Scientist
Required Credentials	Degree in Computer Science, AI, or related fields; knowledge of reinforcement learning	Degree in Data Science, Statistics, Computer Science; proficiency in programming and analytics
Work Environment	Research labs, AI development teams, tech companies	Business analytics, research, consulting firms, tech companies
Industry Usage	AI development, machine learning projects	Data analysis, predictive modeling, business insights
Common Search/Comparison	Helper Reinforcement Learning vs Data Scientist

Helper Reinforcement Learning focuses on developing algorithms that enable machines to learn through interactions, often requiring knowledge of reinforcement learning techniques. Data Scientists analyze data to extract insights, build models, and support decision-making. While both roles involve programming and data handling, Helper Reinforcement Learning is more specialized in AI algorithm development, whereas Data Scientists work broadly across data analysis and modeling in various industries.

More about Helper Reinforcement Learning jobs

The 10 Top Types Of Helper Reinforcement Learning Jobs

What cities are hiring for Helper Reinforcement Learning jobs? Cities with the most Helper Reinforcement Learning job openings:

What are the most commonly searched types of Reinforcement Learning jobs? The most popular types of Reinforcement Learning jobs are:

What states have the most Helper Reinforcement Learning jobs? States with the most job openings for Helper Reinforcement Learning jobs include:

What job categories do people searching Helper Reinforcement Learning jobs look for? The top searched job categories for Helper Reinforcement Learning jobs are:

Helper Reinforcement Learning jobs near you

Infographic showing various Helper Reinforcement Learning job openings in the United States as of July 2026, with employment types broken down into 1% Locum Tenens, 78% Full Time, 15% Part Time, 1% Temporary, 4% Contract, and 1% Nights. Highlights an 99% Physical, and 1% Remote job distribution, with an average salary of $34,190 per year, or $16.4 per hour.

Research Engineer, Machine Learning (Reinforcement Learning)

Anthropic

San Francisco, CA • On-site

Apply

$241K/yr

Other

Posted 9 days ago

Job description

About the teams

Our Reinforcement Learning teams lead Anthropic's reinforcement learning research and development, playing a critical role in advancing our AI systems. We've contributed to all Claude models, with significant impacts on the autonomy and coding capabilities of Claude Sonnet 4.5 and Opus 4.5. Our work spans several key areas:

Developing systems that enable models to use computers effectively
Advancing code generation through reinforcement learning
Pioneering fundamental RL research for large language models
Building scalable RL infrastructure and training methodologies
Enhancing model reasoning capabilities

We collaborate closely with Anthropic's alignment and frontier red teams to ensure our systems are both capable and safe. We partner with the applied production training team to bring research innovations into deployed models, and are dedicated to implement our research at scale. Our Reinforcement Learning teams sit at the intersection of cutting-edge research and engineering excellence, with a deep commitment to building high-quality, scalable systems that push the boundaries of what AI can accomplish.

About the Role

As a Research Engineer within Reinforcement Learning, you will collaborate with a diverse group of researchers and engineers to advance the capabilities and safety of large language models. This role blends research and engineering responsibilities, requiring you to both implement novel approaches and contribute to the research direction. You'll work on fundamental research in reinforcement learning, creating 'agentic' models via tool use for open-ended tasks such as computer use and autonomous software generation, improving reasoning abilities in areas such as mathematics, and developing prototypes for internal use, productivity, and evaluation.

Representative projects:

Architect and optimize core reinforcement learning infrastructure, from clean training abstractions to distributed experiment management across GPU clusters. Help scale our systems to handle increasingly complex research workflows.
Design, implement, and test novel training environments, evaluations, and methodologies for reinforcement learning agents which push the state of the art for the next generation of models.
Drive performance improvements across our stack through profiling, optimization, and benchmarking. Implement efficient caching solutions and debug distributed systems to accelerate both training and evaluation workflows.
Collaborate across research and engineering teams to develop automated testing frameworks, design clean APIs, and build scalable infrastructure that accelerates AI research.

You may be a good fit if you:

Are proficient in Python and async/concurrent programming with frameworks like Trio
Have experience with machine learning frameworks (PyTorch, TensorFlow, JAX)
Have industry experience in machine learning research
Can balance research exploration with engineering implementation
Enjoy pair programming (we love to pair!)
Care about code quality, testing, and performance
Have strong systems design and communication skills
Are passionate about the potential impact of AI and are committed to developing safe and beneficial systems

Strong candidates may have:

Familiarity with LLM architectures and training methodologies
Experience with reinforcement learning techniques and environments
Experience with virtualization and sandboxed code execution environments
Experience with Kubernetes
Experience with distributed systems or high-performance computing
Experience with Rust and/or C++

Strong candidates need not have:

Formal certifications or education credentials
Academic research experience or publication history

Deadline to apply: None. Applications will be reviewed on a rolling basis.

About Anthropic

Sourced by ZipRecruiter

Company size

11 - 50 Employees

Headquarters location

Daly City, CA, US

Year founded

2021

Website

anthropic.com

Social media

View All Anthropic Jobs

Apply

Helper Reinforcement Learning Jobs (NOW HIRING)

Research Engineer, Machine Learning (Reinforcement Learning)

Research Engineer, Machine Learning (Reinforcement Learning)

Research Engineer, Machine Learning (Reinforcement Learning)

Research Engineer, Machine Learning (Reinforcement Learning)

Machine Learning Scientist, Reinforcement Learning

Machine Learning Scientist, Reinforcement Learning

Reinforcement Learning Engineer (Cybersecurity)

Reinforcement Learning Engineer (Cybersecurity)

Developer Advocate - Reinforcement Learning

Developer Advocate - Reinforcement Learning

Research Engineer, Chip Design RL (Reinforcement Learning)

Research Engineer, Chip Design RL (Reinforcement Learning)

Senior PMT ES - Reinforcement Learning, SageMaker AI

Senior PMT ES - Reinforcement Learning, SageMaker AI

Applied Deep Learning PhD Research Intern, Reinforcement Learning for LLMs - Fall 2026

Applied Deep Learning PhD Research Intern, Reinforcement Learning for LLMs - Fall 2026

Applied Deep Learning PhD Research Intern, Reinforcement Learning for LLMs - Fall 2026

Applied Deep Learning PhD Research Intern, Reinforcement Learning for LLMs - Fall 2026

Sr. ML Scientist (Pricing & Reinforcement Learning)

Sr. ML Scientist (Pricing & Reinforcement Learning)

Research Intern - Applied Reinforcement Learning

Research Intern - Applied Reinforcement Learning

Senior PMT ES - Reinforcement Learning, SageMaker AI

Senior PMT ES - Reinforcement Learning, SageMaker AI

AI Research Scientist, Reinforcement Learning

AI Research Scientist, Reinforcement Learning

Senior Machine Learning Engineer, Reinforcement Learning - Egofold

Senior Machine Learning Engineer, Reinforcement Learning - Egofold

AI Research Scientist, Reinforcement Learning

AI Research Scientist, Reinforcement Learning

Senior Machine Learning Engineer, Reinforcement Learning - Egofold

Senior Machine Learning Engineer, Reinforcement Learning - Egofold

Research Engineer, Code RL (Reinforcement Learning)

Research Engineer, Code RL (Reinforcement Learning)

Senior Machine Learning Engineer, Reinforcement Learning - Egofold

Senior Machine Learning Engineer, Reinforcement Learning - Egofold

AI/ML Scientist - Reinforcement Learning, Simulation & Optimization

AI/ML Scientist - Reinforcement Learning, Simulation & Optimization

Applied Reinforcement Learning Engineer

Applied Reinforcement Learning Engineer

Helper Reinforcement Learning information

See salary details

How much do helper reinforcement learning jobs pay per hour?

What is the difference between Helper Reinforcement Learning vs Data Scientist?

Research Engineer, Machine Learning (Reinforcement Learning)

Share this job

Job description

About Anthropic

Company size

Headquarters location

Year founded

Website

Social media

Share this job