Rlhf Jobs (NOW HIRING)

Senior Staff Research Engineer - Reinforcement Learning for AI Agents

$122K - $168K/yr

Preferred : • Experience with RLHF or preference learning. • Experience with LLM agents or tool-using AI systems. • Multi-agent systems or long-horizon planning. • Simulation environments for ...

XPENG

Senior Staff Research Engineer - Reinforcement Learning for AI Agents

Santa Clara, CA · On-site

$122K - $168K/yr

DGN Technologies

Software Engineer - Machine Learning III

Mountain View, CA · On-site

$67.25 - $90.50/hr

RLHF, reward modeling, policy optimization) to optimize guardrail model performance, calibration, and robustness against adaptive adversaries. 1. Curate and generate adversarial training data: direct ...

DGN Technologies

Software Engineer - Machine Learning III

Mountain View, CA · On-site

$67.25 - $90.50/hr

LXT

Sales Senior Director

Manhattan, NY

This role owns the commercialization of complex, multi-year enterprise programs spanning data collection, annotation, evaluation, RLHF, multimodal datasets, and secure AI data operations. Reporting ...

LXT

Sales Senior Director

Manhattan, NY

XPENG

Senior Staff Research Engineer - Reinforcement Learning for AI Agents

Santa Clara, CA · On-site

$122K - $168K/yr

XPENG

Senior Staff Research Engineer - Reinforcement Learning for AI Agents

Santa Clara, CA · On-site

$122K - $168K/yr

OpenAI

Researcher, Robustness & Safety Training

San Francisco, CA · On-site

Responsibilities : • Conduct state-of-the-art research on AI safety topics such as RLHF, adversarial training, robustness, and more. • Implement new methods in OpenAI's core model training and ...

OpenAI

Researcher, Robustness & Safety Training

San Francisco, CA · On-site

Scale AI

Tech Lead Manager- MLRE, ML Systems

San Francisco, CA · On-site

Preferred : • Demonstrated expertise in post-training methods and/or next generation use cases for large language models including instruction tuning, RLHF, tool use, reasoning, agents, and ...

Scale AI

Tech Lead Manager- MLRE, ML Systems

San Francisco, CA · On-site

Preferred : • Demonstrated expertise in post-training methods and/or next generation use cases for large language models including instruction tuning, RLHF, tool use, reasoning, agents, and ...

Nuance Labs

Member of Technical Staff -- RL Research (Experienced)

Seattle, WA · On-site

Required : • Significant hands-on experience with RL, RLHF, RLAIF, post-training, alignment, or large-scale fine-tuning for modern foundation models. • Deep understanding of RL/post-training ...

Nuance Labs

Member of Technical Staff -- RL Research (Experienced)

Seattle, WA · On-site

Embedding VC

AI 数据平台产品经理｜标注 / 评测方向

Palo Alto, CA · On-site

... RLHF / 模型评测相关经验有图像,视频,多模态数据产品经验有自动标注,人机协同,专家标注,众包标注相关经验有从 0 到 1 搭建平台产品经验

Embedding VC

AI 数据平台产品经理｜标注 / 评测方向

Palo Alto, CA · On-site

... RLHF / 模型评测相关经验有图像,视频,多模态数据产品经验有自动标注,人机协同,专家标注,众包标注相关经验有从 0 到 1 搭建平台产品经验

XPENG

Senior Staff Research Engineer - Reinforcement Learning for AI Agents

Santa Clara, CA · On-site

$122K - $168K/yr

XPENG

Senior Staff Research Engineer - Reinforcement Learning for AI Agents

Santa Clara, CA · On-site

$122K - $168K/yr

OSI Engineering, Inc.

ML Engineer, Prompt Safety & Agent Security

Mountain View, CA · On-site

$95 - $110/hr

RLHF, reward modeling, policy optimization) to optimize guardrail model performance, calibration, and robustness against adaptive adversaries. * Curate and generate adversarial training data: direct ...

OSI Engineering, Inc.

ML Engineer, Prompt Safety & Agent Security

Mountain View, CA · On-site

$95 - $110/hr

xAI

Member of Technical Staff - Post-Training and RL

Palo Alto, CA · On-site

$180K - $600K/yr

You will work on the most critical post-training and reinforcement learning challenges at any given time -- including reward modeling, preference optimization (RLHF/DPO), and RL for improving ...

Quick apply

xAI

Member of Technical Staff - Post-Training and RL

Palo Alto, CA · On-site

$180K - $600K/yr

You will work on the most critical post-training and reinforcement learning challenges at any given time -- including reward modeling, preference optimization (RLHF/DPO), and RL for improving ...

xAI

Member of Technical Staff - Post-Training and RL

Palo Alto, CA · On-site

$180K - $600K/yr

You will work on the most critical post-training and reinforcement learning challenges at any given time - including reward modeling, preference optimization (RLHF/DPO), and RL for improving ...

xAI

Member of Technical Staff - Post-Training and RL

Palo Alto, CA · On-site

$180K - $600K/yr

You will work on the most critical post-training and reinforcement learning challenges at any given time - including reward modeling, preference optimization (RLHF/DPO), and RL for improving ...

Tzafon

Member of Technical Staff - Applied AI

San Francisco, CA · On-site

$150K - $400K/yr

Work with post-training team on RLHF and behavioral optimization * Build tools for model introspection and capability mapping * Ship continuous evaluation systems that catch regressions before users ...

Tzafon

Member of Technical Staff - Applied AI

San Francisco, CA · On-site

$150K - $400K/yr

Two Sigma Investments, LP

Post-Training Research Scientist

New York, NY · On-site

$165K - $300K/yr

We are hiring a Post-Training Research Scientist to build RLHF, DPO, and reward modeling capabilities from the ground up. This is a greenfield role: you will define the infrastructure, research ...

New

Two Sigma Investments, LP

Post-Training Research Scientist

New York, NY · On-site

$165K - $300K/yr

New

Tiki AI

AI Data Software Engineer

San Francisco, CA · On-site

$134K - $162K/yr

Direct algorithms like SFT, RLHF, and Chain-of-Thought into measurable, automated data production standards and agentic workflows. • Contribute to data pipeline design and improvements to increase ...

Tiki AI

AI Data Software Engineer

San Francisco, CA · On-site

$134K - $162K/yr

Hyphen Connect Limited

AI Safety Specialist (AI Engineering)

San Francisco, CA · On-site

Develop constitutional AI principles and assist with RLHF alignment pipelines. Qualifications: * Background in cybersecurity, prompt engineering, or adversarial ML. * Experience with jailbreak ...

Hyphen Connect Limited

AI Safety Specialist (AI Engineering)

San Francisco, CA · On-site

IT America Inc

Agentic AI Engineer Lead

Dallas, TX · On-site

$101K - $133K/yr

The ideal candidate will have deep expertise in LLM orchestration, knowledge graphs, reinforcement learning (RLHF/RLAIF), and real-world AI applications. As a leader in this space, they will be ...

Quick apply

IT America Inc

Agentic AI Engineer Lead

Dallas, TX · On-site

$101K - $133K/yr

Centific

Senior Staff Research Scientist, Reinforcement Learning

East Palo Alto, CA · On-site

Post-train LLM agents using RLHF, DPO, GRPO, PPO, and emerging methods * Build pipelines that convert human-labeled traces and verifiable signals into training data * Architect multi-turn, tool-using ...

Centific

Senior Staff Research Scientist, Reinforcement Learning

East Palo Alto, CA · On-site

CodeRabbit

Applied AI Engineer

San Francisco, CA

$175K - $275K/yr

Apply RLHF, ranking, and reward modeling techniques to improve response quality over time * Stay current with the latest generative AI developments and apply them to new use cases Qualifications

CodeRabbit

Applied AI Engineer

San Francisco, CA

$175K - $275K/yr

Apply RLHF, ranking, and reward modeling techniques to improve response quality over time * Stay current with the latest generative AI developments and apply them to new use cases Qualifications

Cognition

Research, Post-Training

San Francisco, CA · On-site

Apply and advance techniques like RLHF, RLAIF, and constitutional approaches to shape how agents reason, act, and collaborate with humans in long-horizon tasks. • Scaling and Exploration: Measure ...

Cognition

Research, Post-Training

San Francisco, CA · On-site

Showing results 1-20

Rlhf Jobs

Rlhf information

What are some common challenges faced by professionals working in Reinforcement Learning from Human Feedback (RLHF) roles?

Professionals in RLHF roles often encounter challenges related to data quality and alignment between human feedback and model behavior. Collecting consistent, unbiased feedback from human annotators can be complex, and ensuring that the reinforcement learning model interprets this feedback correctly requires careful design of reward functions and training protocols. Additionally, balancing the need for rapid experimentation with maintaining rigorous evaluation standards is crucial. Collaboration with interdisciplinary teams, including data scientists, ML engineers, and domain experts, is common to address these challenges and improve model alignment.

What are RLHF jobs?

RLHF stands for Reinforcement Learning from Human Feedback. RLHF jobs typically involve roles where professionals help train artificial intelligence (AI) systems, especially large language models, by providing feedback, curating datasets, designing reward models, or developing algorithms that enable AI to learn effectively from human input. These jobs may include positions such as machine learning engineers, data annotators, AI trainers, and research scientists. The goal of RLHF work is to improve the alignment of AI behavior with human values and expectations by incorporating direct human feedback into the training process.

What are the key skills and qualifications needed to thrive as a Reinforcement Learning from Human Feedback (RLHF) Engineer, and why are they important?

To thrive as an RLHF Engineer, you need a strong background in machine learning, reinforcement learning, and programming (often Python), typically supported by an advanced degree in computer science or a related field. Experience with ML frameworks (such as TensorFlow or PyTorch), data annotation tools, and familiarity with large language models are typically required. Strong analytical thinking, collaboration, and clear communication are essential soft skills to succeed in research-driven, interdisciplinary teams. These skills and qualities are crucial for developing safe, effective AI systems that integrate human feedback and adapt to complex real-world tasks.

What is the difference between Rlhf vs Rn?

Aspect	Rlhf	Rn
Required Credentials	Licensed healthcare professional, often with specialized training in mental health or behavioral health	Licensed practical nurse or registered nurse, with nursing licensure and possibly additional certifications
Work Environment	Behavioral health facilities, clinics, hospitals, or community health settings	Hospitals, clinics, long-term care facilities, and community health settings
Employer & Industry Usage	Behavioral health and mental health services	General healthcare and nursing services
Common Search & Comparison	Rlhf vs Rn	Rlhf vs Rn

While Rlhf (Registered Licensed Mental Health Facilitator) focuses on mental health support and behavioral health interventions, Rn (Registered Nurse) provides broader nursing care across various medical settings. Both roles require licensure, but Rlhf specializes in mental health, whereas Rn covers general patient care.

What is an RLHF job?

An RLHF (Reinforcement Learning with Human Feedback) job involves training AI models using human feedback to improve their responses. Professionals in this role analyze model outputs, provide evaluations, and refine AI behavior through reinforcement learning techniques. These roles are common in AI research, content moderation, and chatbot development.

More about Rlhf jobs

The 10 Top Types Of Rlhf Jobs

What cities are hiring for Rlhf jobs? Cities with the most Rlhf job openings:

What are the most commonly searched types of Rlhf jobs? The most popular types of Rlhf jobs are:

What states have the most Rlhf jobs? States with the most job openings for Rlhf jobs include:

What job categories do people searching Rlhf jobs look for? The top searched job categories for Rlhf jobs are:

Rlhf jobs near you

Senior Staff Research Engineer - Reinforcement Learning for AI Agents

XPENG

Santa Clara, CA • On-site

Apply

$122K - $168K/yr

Full-time

Re-posted 23 days ago

Job description

Job Summary:
XPENG is a leading smart technology company at the forefront of innovation, integrating advanced AI and autonomous driving technologies into its vehicles. They are looking for exceptional Research Engineers / Scientists to design learning systems that allow agents to plan over long horizons and improve through experience.
Responsibilities:
• Reinforcement learning methods for LLM-driven agents and decision systems.
• Policy optimization for long-horizon reasoning and planning.
• Learning from human or AI feedback (RLHF / RLAIF).
• Agent training pipelines built on top of our agent infrastructure platform.
• Evaluation and benchmarking systems for agent capabilities.
• Learning loops that integrate real-world and simulation data.
• Contribute to AI systems that continuously improve after deployment.
Qualifications:
Required:
• MS or PhD in Computer Science, AI, Machine Learning, Robotics, or a related field.
• Strong background in reinforcement learning or machine learning.
• Experience implementing RL algorithms such as PPO, Actor-Critic, or policy gradient methods.
• Strong programming skills in Python with PyTorch or JAX.
• Experience building ML training systems or infrastructure.
Preferred:
• Experience with RLHF or preference learning.
• Experience with LLM agents or tool-using AI systems.
• Multi-agent systems or long-horizon planning.
• Simulation environments for RL.
• Publications in NeurIPS, ICML, ICLR, ACL, or related venues.
Company:
XPENG is a leading Chinese Smart EV company that designs, develops, manufactures, and markets Smart EVs that appeal to the large and growing base of technology-savvy middle-class consumers. Founded in 2014, the company is headquartered in Guangzhou, CHN, with a team of 10001+ employees. The company is currently Late Stage.

Apply

Rlhf Jobs (NOW HIRING)

Senior Staff Research Engineer - Reinforcement Learning for AI Agents

Senior Staff Research Engineer - Reinforcement Learning for AI Agents

Software Engineer - Machine Learning III

Software Engineer - Machine Learning III

Sales Senior Director

Sales Senior Director

Senior Staff Research Engineer - Reinforcement Learning for AI Agents

Senior Staff Research Engineer - Reinforcement Learning for AI Agents

Researcher, Robustness & Safety Training

Researcher, Robustness & Safety Training

Tech Lead Manager- MLRE, ML Systems

Tech Lead Manager- MLRE, ML Systems

Member of Technical Staff -- RL Research (Experienced)

Member of Technical Staff -- RL Research (Experienced)

AI 数据平台产品经理｜标注 / 评测方向

AI 数据平台产品经理｜标注 / 评测方向

Senior Staff Research Engineer - Reinforcement Learning for AI Agents

Senior Staff Research Engineer - Reinforcement Learning for AI Agents

ML Engineer, Prompt Safety & Agent Security

ML Engineer, Prompt Safety & Agent Security

Member of Technical Staff - Post-Training and RL

Member of Technical Staff - Post-Training and RL

Member of Technical Staff - Post-Training and RL

Member of Technical Staff - Post-Training and RL

Member of Technical Staff - Applied AI

Member of Technical Staff - Applied AI

Post-Training Research Scientist

Post-Training Research Scientist

AI Data Software Engineer

AI Data Software Engineer

AI Safety Specialist (AI Engineering)

AI Safety Specialist (AI Engineering)

Agentic AI Engineer Lead

Agentic AI Engineer Lead

Senior Staff Research Scientist, Reinforcement Learning

Senior Staff Research Scientist, Reinforcement Learning

Applied AI Engineer

Applied AI Engineer

Research, Post-Training

Research, Post-Training

Rlhf information

What are some common challenges faced by professionals working in Reinforcement Learning from Human Feedback (RLHF) roles?

What are RLHF jobs?

What are the key skills and qualifications needed to thrive as a Reinforcement Learning from Human Feedback (RLHF) Engineer, and why are they important?

What is the difference between Rlhf vs Rn?

What is an RLHF job?

Senior Staff Research Engineer - Reinforcement Learning for AI Agents

Share this job

Job description

Share this job