Remote Rlhf Jobs (NOW HIRING)

Bilingual Evaluator - Kannada Specialist

$15 - $20/hr

Remote Role Responsibilities * Conduct fact-checking using trusted public sources and external ... Prior experience with RLHF, model evaluation, or data annotation work . * Experience writing or ...

Quick apply

Mercor

Bilingual Evaluator - Kannada Specialist

San Francisco, CA · Remote

$15 - $20/hr

Mercor

Bilingual Evaluator - Odia Specialist

New York, NY · Remote

$15 - $20/hr

Quick apply

Mercor

Bilingual Evaluator - Odia Specialist

New York, NY · Remote

$15 - $20/hr

Mercor

Bilingual Evaluator - Language Specialist

New York, NY · Remote

$15 - $20/hr

Quick apply

Mercor

Bilingual Evaluator - Language Specialist

New York, NY · Remote

$15 - $20/hr

Mercor

Bilingual Language Model Evaluator

San Francisco, CA · Remote

$15 - $20/hr

Quick apply

Mercor

Bilingual Language Model Evaluator

San Francisco, CA · Remote

$15 - $20/hr

Mercor

Bilingual Evaluator - Telugu Expert

New York, NY · Remote

$15 - $20/hr

Quick apply

Mercor

Bilingual Evaluator - Telugu Expert

New York, NY · Remote

$15 - $20/hr

Mercor

Bilingual Analyst - Punjabi Expert

New York, NY · Remote

$15 - $20/hr

Remote Role Responsibilities * Conduct fact-checking using trusted public sources and external ... Experience with RLHF, model evaluation, or data annotation work * Experience writing or editing ...

Quick apply

Mercor

Bilingual Analyst - Punjabi Expert

New York, NY · Remote

$15 - $20/hr

TMS

AI/ ML Engineer

New York, NY · Remote

$60 - $62/hr

US/ Canada- Remote Minimum exp. required: 8+ yrs. We are looking for a GenAI Engineer to design ... Experience with model fine-tuning (LoRA, PEFT, RLHF basics) * Knowledge of MLOps tools and CI/CD ...

Quick apply

TMS

AI/ ML Engineer

New York, NY · Remote

$60 - $62/hr

Mercor

Gujarati Language Evaluation Specialist

San Francisco, CA · Remote

$15 - $20/hr

Quick apply

Mercor

Gujarati Language Evaluation Specialist

San Francisco, CA · Remote

$15 - $20/hr

Mercor

Bilingual Evaluator - Marathi Expert

New York, NY · Remote

$15 - $20/hr

Quick apply

Mercor

Bilingual Evaluator - Marathi Expert

New York, NY · Remote

$15 - $20/hr

Mercor

AI Safety Expert - Red Teaming

San Francisco, CA · Remote

$20 - $22/hr

Remote Role Responsibilities * Red team conversational AI models and agents. Conduct jailbreaks ... Experience in Adversarial ML : jailbreak datasets, prompt injection, RLHF/DPO attacks, model ...

Quick apply

Mercor

AI Safety Expert - Red Teaming

San Francisco, CA · Remote

$20 - $22/hr

Mercor

Bilingual Analyst - Bengali Expert

New York, NY · Remote

$15 - $20/hr

Quick apply

Mercor

Bilingual Analyst - Bengali Expert

New York, NY · Remote

$15 - $20/hr

Mercor

Bilingual Evaluator - LLM Specialist

New York, NY · Remote

$15 - $20/hr

Quick apply

Mercor

Bilingual Evaluator - LLM Specialist

New York, NY · Remote

$15 - $20/hr

Mercor

AI Safety Expert - Red Team

San Francisco, CA · Remote

$20 - $22/hr

Contract Compensation: $20-$22/hour Location: Remote Role Responsibilities * Red team ... Experience with Adversarial ML : jailbreak datasets, prompt injection, RLHF/DPO attacks, model ...

Quick apply

Mercor

AI Safety Expert - Red Team

San Francisco, CA · Remote

$20 - $22/hr

Contract Compensation: $20-$22/hour Location: Remote Role Responsibilities * Red team ... Experience with Adversarial ML : jailbreak datasets, prompt injection, RLHF/DPO attacks, model ...

Mercor

AI Safety Expert - Red Team

San Francisco, CA · Remote

$20 - $22/hr

Contract Compensation: $20-$22/hour Location: Remote Role Responsibilities * Red team ... Experience with Adversarial ML : jailbreak datasets, prompt injection, RLHF/DPO attacks, model ...

Quick apply

Mercor

AI Safety Expert - Red Team

San Francisco, CA · Remote

$20 - $22/hr

Contract Compensation: $20-$22/hour Location: Remote Role Responsibilities * Red team ... Experience with Adversarial ML : jailbreak datasets, prompt injection, RLHF/DPO attacks, model ...

Zillow

Principal Applied Scientist, Agentic AI

$181K - $290K/yr

We collaborate closely with platform, product, and operations partners in a fast-moving, remote ... RLHF, RLAIF, or DPO for multi-objective optimization. * Develop reward models and objective ...

Zillow

Principal Applied Scientist, Agentic AI

$181K - $290K/yr

HumanSignal

High Volume (TOFU) Recruiter

Columbus, OH · On-site +1

$55K - $100K/yr

San Francisco, CA preferred; open to other remote options About the Role HumanSignal Services runs ... Familiarity with AI data operations, annotation, or RLHF workforce programs * Experience with ATS ...

Quick apply

HumanSignal

High Volume (TOFU) Recruiter

Columbus, OH · On-site +1

$55K - $100K/yr

HumanSignal

High Volume (TOFU) Recruiter

San Francisco, CA · On-site +1

$55K - $100K/yr

Quick apply

HumanSignal

High Volume (TOFU) Recruiter

San Francisco, CA · On-site +1

$55K - $100K/yr

HumanSignal

$55K - $100K/yr

HumanSignal

$55K - $100K/yr

HumanSignal

High Volume (TOFU) Recruiter

Austin, TX · On-site +1

$55K - $100K/yr

Quick apply

HumanSignal

High Volume (TOFU) Recruiter

Austin, TX · On-site +1

$55K - $100K/yr

Mercor

ML Research Engineer - PhD - AI Trainer

Seattle, WA · Remote

$75 - $90/hr

Remote Commitment: 20+ hours/week Role Responsibilities * Attempt open-ended machine learning ... LoRA , RLHF , architecture design, contrastive training, generative modeling, multilingual ...

Quick apply

Mercor

ML Research Engineer - PhD - AI Trainer

Seattle, WA · Remote

$75 - $90/hr

Showing results 1-20

Remote Rlhf Jobs

Remote Rlhf information

How does a Remote RLHF (Reinforcement Learning from Human Feedback) specialist typically collaborate with other team members?

A Remote RLHF specialist often works closely with data scientists, machine learning engineers, and product managers to design and refine AI models using human feedback. Collaboration usually happens through regular virtual meetings, cloud-based code repositories, and shared annotation tools. The role requires clear communication to ensure that human feedback is accurately integrated into the learning process and that model improvements align with project goals. Being proactive in sharing findings and challenges is key, as team members may be distributed across different time zones.

What is the difference between Remote Rlhf vs Remote Rlhf?

Aspect	Remote Rlhf	Remote Rlhf
Credentials	Typically requires certification in mental health or counseling, such as LPC or LCSW	Similar credentials, often with additional training in specific therapy methods
Work Environment	Remote, client-facing sessions via telehealth platforms	Remote, providing therapy or support services online
Industry Usage	Common in mental health, therapy, and counseling sectors	Used in mental health and support services, often interchangeably with Rlhf

Remote Rlhf and Remote Rlhf are similar roles in mental health support, primarily differing in specific certifications or training focus. Both roles involve providing remote therapy or support services via telehealth platforms, making them highly comparable in work environment and industry usage.

What are the key skills and qualifications needed to thrive as a Remote RLHF (Reinforcement Learning from Human Feedback) Engineer, and why are they important?

To succeed as a Remote RLHF Engineer, you need expertise in machine learning, reinforcement learning, and programming languages like Python, often supported by an advanced degree in computer science or related fields. Familiarity with ML frameworks (such as TensorFlow or PyTorch), version control systems, and cloud computing platforms is typically required. Strong problem-solving, communication, and self-management skills are vital for remote collaboration and interpreting human feedback effectively. These skills enable the development of robust AI systems that can learn efficiently from human input while ensuring productive teamwork in a distributed environment.

What is a Remote RLHF job?

A Remote RLHF (Reinforcement Learning from Human Feedback) job involves working with artificial intelligence systems, particularly large language models, to improve their performance using feedback from humans. In this role, individuals may annotate data, provide quality evaluations, or help design feedback mechanisms while working from a remote location. These jobs are crucial for ensuring AI models align better with human values and expectations, and they are often offered by AI research companies or organizations focused on machine learning. The work can involve tasks such as ranking AI-generated responses, identifying errors, and suggesting improvements. Remote RLHF positions are popular due to their flexibility and the opportunity to contribute to cutting-edge AI technology.

More about Remote Rlhf jobs

The 10 Top Types Of Remote Rlhf Jobs

What cities are hiring for Remote Rlhf jobs? Cities with the most Remote Rlhf job openings:

What are the most commonly searched types of Rlhf jobs? The most popular types of Rlhf jobs are:

What states have the most Remote Rlhf jobs? States with the most job openings for Remote Rlhf jobs include:

What job categories do people searching Remote Rlhf jobs look for? The top searched job categories for Remote Rlhf jobs are:

Remote Rlhf jobs near you

Infographic showing various Remote Rlhf job openings in the United States as of July 2026, with employment types broken down into 54% Full Time, 8% Part Time, and 38% Contract. Highlights an 100% Remote job distribution.

Bilingual Evaluator - Kannada Specialist

Mercor

San Francisco, CA • Remote

$15 - $20/hr

Full-time

Re-posted 7 days ago

Job description

About the job

Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include Benchmark, General Catalyst, Peter Thiel, Adam D'Angelo, Larry Summers, and Jack Dorsey.

Position: Generalist - English & Kannada
Type: Contract
Compensation: $15–$20/hour
Location: Remote

Role Responsibilities

Conduct fact-checking using trusted public sources and external tools.
Generate high-quality human evaluation data by identifying response strengths, areas for improvement, and factual inaccuracies.
Assess reasoning quality, clarity, tone, and completeness of responses.
Ensure model responses align with expected conversational behavior and system guidelines.

Qualifications

Must-Have

Bachelor's degree.
Native speaker in Kannada.
Significant experience using large language models (LLMs).
Excellent writing skills in English.
Strong attention to detail.
Background or experience in domains requiring structured analytical thinking.

Preferred

Prior experience with RLHF, model evaluation, or data annotation work.
Experience writing or editing high-quality written content.
Experience comparing multiple outputs and making fine-grained qualitative judgments.

Application Process (Takes 20–30 mins to complete)

Upload resume
AI interview based on your resume
Submit form

Resources & Support

For details about the interview process and platform information, please check: https://talent.docs.mercor.com/welcome
For any help or support, reach out to: support@mercor.com

PS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity.

Remote Rlhf Jobs (NOW HIRING)

Bilingual Evaluator - Kannada Specialist

Bilingual Evaluator - Kannada Specialist

Bilingual Evaluator - Odia Specialist

Bilingual Evaluator - Odia Specialist

Bilingual Evaluator - Language Specialist

Bilingual Evaluator - Language Specialist

Bilingual Language Model Evaluator

Bilingual Language Model Evaluator

Bilingual Evaluator - Telugu Expert

Bilingual Evaluator - Telugu Expert

Bilingual Analyst - Punjabi Expert

Bilingual Analyst - Punjabi Expert

AI/ ML Engineer

AI/ ML Engineer

Gujarati Language Evaluation Specialist

Gujarati Language Evaluation Specialist

Bilingual Evaluator - Marathi Expert

Bilingual Evaluator - Marathi Expert

AI Safety Expert - Red Teaming

AI Safety Expert - Red Teaming

Bilingual Analyst - Bengali Expert

Bilingual Analyst - Bengali Expert

Bilingual Evaluator - LLM Specialist

Bilingual Evaluator - LLM Specialist

AI Safety Expert - Red Team

AI Safety Expert - Red Team

AI Safety Expert - Red Team

AI Safety Expert - Red Team

Principal Applied Scientist, Agentic AI

Principal Applied Scientist, Agentic AI

High Volume (TOFU) Recruiter

High Volume (TOFU) Recruiter

High Volume (TOFU) Recruiter

High Volume (TOFU) Recruiter

High Volume (TOFU) Recruiter

High Volume (TOFU) Recruiter

High Volume (TOFU) Recruiter

High Volume (TOFU) Recruiter

ML Research Engineer - PhD - AI Trainer

ML Research Engineer - PhD - AI Trainer

Remote Rlhf information

How does a Remote RLHF (Reinforcement Learning from Human Feedback) specialist typically collaborate with other team members?

What is the difference between Remote Rlhf vs Remote Rlhf?

What are the key skills and qualifications needed to thrive as a Remote RLHF (Reinforcement Learning from Human Feedback) Engineer, and why are they important?

What is a Remote RLHF job?

Bilingual Evaluator - Kannada Specialist

Share this job

Job description

Share this job