Internship Rlhf Jobs (NOW HIRING)

Research Intern, Agent RL Training

$35 - $50/hr

... during your internship What We're Looking For Requirements * Highly motivated and committed ... RLHF, DPO, PPO, GRPO, etc.) * Excellent taste in model behavior: able to reason about what "good ...

NewsBreak

Research Intern, Agent RL Training

Mountain View, CA · On-site

$35 - $50/hr

NewsBreak

Research Intern, Agent RL Training

Mountain View, CA · On-site

$35 - $50/hr

Quick apply

NewsBreak

Research Intern, Agent RL Training

Mountain View, CA · On-site

$35 - $50/hr

Netflix

Research Scientist 4 - Machine Learning and Inference Research, LLM Post-Training

Los Gatos, CA · On-site

... interns, and fostering academic collaborations. We are seeking an early-career researcher who can ... RLVR, RLHF, offline or online, policy- or value-based), and possibly also including reasoning ...

Netflix

Research Scientist 4 - Machine Learning and Inference Research, LLM Post-Training

Los Gatos, CA · On-site

Netflix

Research Scientist 4 - Machine Learning and Inference Research, LLM Post-Training

Los Angeles, CA · On-site

Netflix

Research Scientist 4 - Machine Learning and Inference Research, LLM Post-Training

Los Angeles, CA · On-site

Netflix

Research Scientist 4 - Machine Learning and Inference Research, LLM Post-Training

New York, NY

Netflix

Research Scientist 4 - Machine Learning and Inference Research, LLM Post-Training

New York, NY

Amazon

Senior Software Development Engineer , Stores Foundational AI - Rufus

Palo Alto, CA · On-site

$144K - $189K/yr

Partner closely with applied scientists to translate frontier techniques (e.g., RLHF, agentic ... of non-internship professional software development experience - 5+ years of programming with at ...

Amazon

Senior Software Development Engineer , Stores Foundational AI - Rufus

Palo Alto, CA · On-site

$144K - $189K/yr

Netflix

Research Scientist 4 - Machine Learning and Inference Research, LLM Post-Training

Los Angeles, CA · On-site

Netflix

Research Scientist 4 - Machine Learning and Inference Research, LLM Post-Training

Los Angeles, CA · On-site

Netflix

Research Scientist 4 - Machine Learning and Inference Research, LLM Post-Training

Los Gatos, CA · On-site

Netflix

Research Scientist 4 - Machine Learning and Inference Research, LLM Post-Training

Los Gatos, CA · On-site

Fujitsu

Agentic AI Research Intern

Santa Clara, CA · On-site

$40 - $50/hr

The internship will focus on building intelligent agents, generating high-quality trajectories ... Familiarity with training or adapting LLMs using SFT, RL, DPO/RLHF methods, or trajectory data.

Fujitsu

Agentic AI Research Intern

Santa Clara, CA · On-site

$40 - $50/hr

Netflix

Research Scientist 4 - Machine Learning and Inference Research, LLM Post-Training

New York, NY · On-site

Netflix

Research Scientist 4 - Machine Learning and Inference Research, LLM Post-Training

New York, NY · On-site

Sr. Machine Learning Engineer, Applied Science

San Francisco, CA · On-site +1

$161K - $332K/yr

... conduct RLHF, targeted fine-tuning, etc. * Publish and publicize your work via conferences, paper submissions, blog posts, etc. * Mentor more junior researchers or research interns within the ...

Sr. Machine Learning Engineer, Applied Science

San Francisco, CA · On-site +1

$161K - $332K/yr

Machine Learning Engineer II, Computer Vision Applied Science

San Francisco, CA · On-site +1

$138K - $285K/yr

Machine Learning Engineer II, Computer Vision Applied Science

San Francisco, CA · On-site +1

$138K - $285K/yr

Nuance Labs

Member of Technical Staff - RL Research (New PhD Grad)

Seattle, WA · On-site

Develop and scale post-training methods such as PPO, GRPO, DPO, rejection sampling, RLHF/RLAIF ... Exposure to RL/post-training pipelines through research, internships, or open-source - with ...

Nuance Labs

Member of Technical Staff - RL Research (New PhD Grad)

Seattle, WA · On-site

Showing results 21-33

Internship Rlhf Jobs

Internship Rlhf information

See salary details

$15

$21

How much do internship rlhf jobs pay per hour?

As of Jul 24, 2026, the average hourly pay for internship rlhf in the United States is $15.54, according to ZipRecruiter salary data. Most workers in this role earn between $12.50 and $17.55 per hour, depending on experience, location, and employer.

What are Internship RLHF positions?

Internship RLHF positions refer to internships focused on Reinforcement Learning from Human Feedback (RLHF), a cutting-edge area in artificial intelligence research. Interns in RLHF roles typically work on projects that involve training AI models to align with human preferences using feedback data, often in natural language processing or robotics. These internships are usually offered by tech companies or research labs and provide hands-on experience in machine learning, data analysis, and experimental design. RLHF interns often collaborate with experienced researchers and engineers to advance AI systems' safety, reliability, and alignment with human values.

What is the difference between Internship Rlhf vs Research Assistant?

Aspect	Internship Rlhf	Research Assistant
Required Credentials	Typically enrolled students or recent graduates	Usually requires a relevant degree or ongoing education in the field
Work Environment	Internship programs, often in academic or research institutions	Research labs, universities, or research-focused organizations
Employer & Industry Usage	Used by educational institutions and research organizations for training	Common in academia, government, and private research sectors
Search & Comparison Intent	People comparing internship opportunities or entry-level research roles	Individuals seeking research support or entry-level research positions

Internship Rlhf and Research Assistant roles both involve research activities, but internships are typically short-term training positions for students or recent graduates, while research assistants are more formal, often requiring relevant education and supporting ongoing research projects. Understanding these differences helps candidates choose the right opportunity based on their experience and career goals.

What types of projects and tasks can I expect to work on during an RLHF internship?

As an RLHF (Reinforcement Learning from Human Feedback) intern, you can expect to engage in a variety of projects that combine machine learning, data annotation, and model evaluation. Typical tasks include curating and labeling datasets, training and fine-tuning machine learning models using human feedback, and conducting experiments to evaluate model performance. You may also collaborate closely with engineers and researchers, participate in team meetings, and contribute to documentation or research publications. This hands-on experience will help you develop both technical and collaborative skills essential for a career in AI research.

What are the key skills and qualifications needed to thrive as an RLHF (Reinforcement Learning from Human Feedback) Intern, and why are they important?

To thrive as an RLHF Intern, you need a solid background in machine learning, statistics, and programming (especially Python), usually supported by ongoing or completed studies in computer science or a related field. Experience with deep learning frameworks (such as TensorFlow or PyTorch), version control systems (like Git), and familiarity with reinforcement learning libraries are typically required. Strong problem-solving abilities, curiosity, and effective teamwork and communication skills help interns contribute meaningfully and learn quickly. These skills and qualities are crucial for successfully developing, evaluating, and improving RLHF models in a collaborative research environment.

More about Internship Rlhf jobs

The 10 Top Types Of Internship Rlhf Jobs

What cities are hiring for Internship Rlhf jobs? Cities with the most Internship Rlhf job openings:

What are the most commonly searched types of Rlhf jobs? The most popular types of Rlhf jobs are:

What states have the most Internship Rlhf jobs? States with the most job openings for Internship Rlhf jobs include:

What job categories do people searching Internship Rlhf jobs look for? The top searched job categories for Internship Rlhf jobs are:

Internship Rlhf jobs near you

Infographic showing various Internship Rlhf job openings in the United States as of July 2026, with employment types broken down into 50% Internship, and 50% Full Time. Highlights an 100% In-person job distribution, with an average salary of $32,333 per year, or $15.5 per hour.

Research Intern, Agent RL Training

NewsBreak

Mountain View, CA • On-site

Apply

$35 - $50/hr

Full-time, Internship

Posted 2 hours ago

Job description

About NewsBreak
Founded in 2015, NewsBreak is the Content Intelligence platform shaping the future content economy. With over 40 million monthly active users, our flagship platform delivers highly personalized local news and information powered by advanced AI, recommendation systems, and adtech.
Recognized by Fast Company as #32 on the Top Workplaces for Innovators, we're proud to be Great Place to Work® certified and home to a dynamic team of technologists, product innovators, and business leaders who are passionate about solving meaningful challenges at scale.
Together, we reached unicorn status in 2021, and we remain committed to continuing this high-growth trajectory with the right team to fulfill our mission: building the infrastructure layer for content intelligence.
If you're inspired to dream big, innovate fast, and make a difference, we'd love to hear from you! For more information, visit www.newsbreak.com/about
About the Role
We are looking for a Research Intern to join our Agent RL Training team. You will be paired with a full-time employee as your mentor, working together to explore, from zero to one, how to apply large language models to NewsBreak's core business, including content understanding, recommendation, agentic web browsing, and autonomous multi-step task completion.
This is a hands-on research role. You are expected to independently drive experiments, propose novel ideas, and iterate quickly. We value self-starters with deep intellectual curiosity and the drive to push boundaries in LLM post-training and agent capabilities.
Location: Onsite in Mountain View, CA office
What You'll Work On

Collaborate with your full-time mentor to identify high-impact research directions for applying LLMs to NewsBreak's products
Independently run end-to-end SFT experiments on LLM-based agents, and assist with RL-related exploration such as reward design and training iteration
Curate and build high-quality training datasets: instruction-following, preference pairs, agent trajectories, and synthetic data
Contribute to public publications; we encourage and support top-venue submissions during your internship

What We're Looking For
Requirements

Highly motivated and committed: willing to put in extra hours when needed to push projects across the finish line
Genuine passion for research: you read papers for fun, tinker with models on weekends, and care deeply about advancing the field
Independently capable of end-to-end model SFT: with basic understanding of RL-based post-training methods (RLHF, DPO, PPO, GRPO, etc.)
Excellent taste in model behavior: able to reason about what "good" looks like across user-facing domains and articulate why
Strong Python and PyTorch skills

Preferred Qualifications

Publication at a top-tier venue (NeurIPS, ICML, ICLR, ACL, EMNLP, or equivalent)
Experience with multi-node distributed training (FSDP, DeepSpeed, Megatron-LM)
Proficiency in writing custom GPU kernels with Triton or CUDA
Experience building synthetic data pipelines for agent training
Familiarity with open-source RL frameworks: TRL, OpenRLHF, veRL/vLLM

Hourly Pay: $35- $50
The US base salary range for this full-time position is listed below. Pay may vary based on a number of factors including job-related skills, level, experience, geographic location and relevant education or training. At NewsBreak, we design our overall rewards package to attract top talents. Depending on the position, the role may also be eligible for discretionary bonus and options. Your recruiter can share more details during the hiring process.
Annual Base Pay Range
$35-$50 USD
CPRA Privacy Notice for California Candidates

Apply

Internship Rlhf Jobs (NOW HIRING)

Research Intern, Agent RL Training

Research Intern, Agent RL Training

Research Intern, Agent RL Training

Research Intern, Agent RL Training

Research Scientist 4 - Machine Learning and Inference Research, LLM Post-Training

Research Scientist 4 - Machine Learning and Inference Research, LLM Post-Training

Research Scientist 4 - Machine Learning and Inference Research, LLM Post-Training

Research Scientist 4 - Machine Learning and Inference Research, LLM Post-Training

Research Scientist 4 - Machine Learning and Inference Research, LLM Post-Training

Research Scientist 4 - Machine Learning and Inference Research, LLM Post-Training

Senior Software Development Engineer , Stores Foundational AI - Rufus

Senior Software Development Engineer , Stores Foundational AI - Rufus

Research Scientist 4 - Machine Learning and Inference Research, LLM Post-Training

Research Scientist 4 - Machine Learning and Inference Research, LLM Post-Training

Research Scientist 4 - Machine Learning and Inference Research, LLM Post-Training

Research Scientist 4 - Machine Learning and Inference Research, LLM Post-Training

Agentic AI Research Intern

Agentic AI Research Intern

Research Scientist 4 - Machine Learning and Inference Research, LLM Post-Training

Research Scientist 4 - Machine Learning and Inference Research, LLM Post-Training

Sr. Machine Learning Engineer, Applied Science

Sr. Machine Learning Engineer, Applied Science

Machine Learning Engineer II, Computer Vision Applied Science

Machine Learning Engineer II, Computer Vision Applied Science

Member of Technical Staff - RL Research (New PhD Grad)

Member of Technical Staff - RL Research (New PhD Grad)

Internship Rlhf information

See salary details

How much do internship rlhf jobs pay per hour?

What are Internship RLHF positions?

What is the difference between Internship Rlhf vs Research Assistant?

What types of projects and tasks can I expect to work on during an RLHF internship?

What are the key skills and qualifications needed to thrive as an RLHF (Reinforcement Learning from Human Feedback) Intern, and why are they important?

Research Intern, Agent RL Training

Share this job

Job description

Share this job