2

Remote Rlhf Jobs in California (NOW HIRING)

Senior AI Engineer

Los Altos, CA · On-site +1

$123.80K - $169.90K/yr

We offer a flexible, remote working environment. You can expect a warm welcome from a friendly and ... Build and improve the feedback loops - fine-tuning, reward models, RLHF, and RLAIF - that keep ...

Delivery Lead

San Francisco, CA · Remote

$110K - $140K/yr

... and remote workforce marketplaces can't. We own projects end-to-end, from scoping and protocol ... Our work spans RLHF, evals, red-teaming, and custom multimodal data creation, all powered by Label ...

... and remote workforce marketplaces can't. We own projects end-to-end, from scoping and protocol ... Our work spans RLHF, evals, red-teaming, and custom multimodal data creation, all powered by Label ...

Senior Software Engineer, Agent

Palo Alto, CA · Remote

$144.40K - $190.30K/yr

Palo Alto HQ | Type: Full-time, On-site/Remote About the Role We're looking for a Senior Agent ... fine-tuning, RLHF, or DPO • Familiarity with AI safety and alignment considerations • ...

next page

Showing results 1-20

Remote Rlhf information

What are the key skills and qualifications needed to thrive as a Remote RLHF (Reinforcement Learning from Human Feedback) Engineer, and why are they important?

To succeed as a Remote RLHF Engineer, you need expertise in machine learning, reinforcement learning, and programming languages like Python, often supported by an advanced degree in computer science or related fields. Familiarity with ML frameworks (such as TensorFlow or PyTorch), version control systems, and cloud computing platforms is typically required. Strong problem-solving, communication, and self-management skills are vital for remote collaboration and interpreting human feedback effectively. These skills enable the development of robust AI systems that can learn efficiently from human input while ensuring productive teamwork in a distributed environment.

How does a Remote RLHF (Reinforcement Learning from Human Feedback) specialist typically collaborate with other team members?

A Remote RLHF specialist often works closely with data scientists, machine learning engineers, and product managers to design and refine AI models using human feedback. Collaboration usually happens through regular virtual meetings, cloud-based code repositories, and shared annotation tools. The role requires clear communication to ensure that human feedback is accurately integrated into the learning process and that model improvements align with project goals. Being proactive in sharing findings and challenges is key, as team members may be distributed across different time zones.

What is a Remote RLHF job?

A Remote RLHF (Reinforcement Learning from Human Feedback) job involves working with artificial intelligence systems, particularly large language models, to improve their performance using feedback from humans. In this role, individuals may annotate data, provide quality evaluations, or help design feedback mechanisms while working from a remote location. These jobs are crucial for ensuring AI models align better with human values and expectations, and they are often offered by AI research companies or organizations focused on machine learning. The work can involve tasks such as ranking AI-generated responses, identifying errors, and suggesting improvements. Remote RLHF positions are popular due to their flexibility and the opportunity to contribute to cutting-edge AI technology.

What is the difference between Remote Rlhf vs Remote Rlhf?

AspectRemote RlhfRemote Rlhf
CredentialsTypically requires certification in mental health or counseling, such as LPC or LCSWSimilar credentials, often with additional training in specific therapy methods
Work EnvironmentRemote, client-facing sessions via telehealth platformsRemote, providing therapy or support services online
Industry UsageCommon in mental health, therapy, and counseling sectorsUsed in mental health and support services, often interchangeably with Rlhf

Remote Rlhf and Remote Rlhf are similar roles in mental health support, primarily differing in specific certifications or training focus. Both roles involve providing remote therapy or support services via telehealth platforms, making them highly comparable in work environment and industry usage.

What are the most commonly searched types of Rlhf jobs in California? The most popular types of Rlhf jobs in California are:
What are popular job titles related to Remote Rlhf jobs in California? For Remote Rlhf jobs in California, the most frequently searched job titles are:
What cities in California are hiring for Remote Rlhf jobs? Cities in California with the most Remote Rlhf job openings:
Infographic showing various Remote Rlhf job openings in California as of May 2026, with employment types broken down into 17% Internship, and 83% Full Time. Highlights an 100% Remote job distribution.

Senior AI Engineer

Saga

Los Altos, CA • On-site, Remote

$123.80K - $169.90K/yr

Full-time

Medical, Dental, Vision, Retirement, PTO

Posted 11 days ago


Job description

Saga is building the infrastructure and products for the next generation of AI agents. Our AI Character Agent Network allows studios, creators, and publishers to turn their most iconic IP into living, breathing digital companions that engage fans across social media, drive organic user acquisition, and power a new kind of agentic commerce.
Our agents run on a proprietary training architecture that combines LLMs and SLMs to produce fine-tuned character AI agents - autonomous enough to develop their own intelligence and personality, while guided by the creative vision of the original studios and creators. We've processed over 23 million transactions worth more than $2 billion since our early 2024 launch, and we're just getting started.
Our team is world-class - veteran web3 entrepreneurs and builders from Apple, X (formerly Twitter), Robinhood, Samsung, NVIDIA, Tendermint and Skuchain - and we're looking to grow as we scale our agent platform to more studios, more platforms, and more users.
We offer a flexible, remote working environment. You can expect a warm welcome from a friendly and international team that will support you in your personal and professional growth.
The Role
As a Senior AI Agent Engineer, you'll be at the center of how we build, train, deploy, and operate character AI agents at scale. You'll work across the full lifecycle - from fine-tuning models and orchestrating SLM swarms to deploying agents across social platforms and building the infrastructure that keeps them running reliably. This role blends ML engineering, backend development, and systems thinking.
The Work
  • Build and maintain the training and inference pipelines for character AI agents, including LLM/SLM orchestration and swarm-based architectures
  • >
  • Deploy and operate AI agents across social media platforms (Instagram, X, WhatsApp, TikTok) with consistent character behavior and personality coherence
  • >
  • Develop tooling for studios and creators to customize agent personality, lore, brand guidelines, and behavioral guardrails
  • >
  • Build and improve the feedback loops - fine-tuning, reward models, RLHF, and RLAIF - that keep agents improving over time
  • >
  • Architect and scale the infrastructure supporting agent deployments, including multi-modal capabilities (text, voice, video, livestreaming)
  • >
  • Contribute to the agentic commerce platform - enabling agents to drive transactions, recommend products, and interact with payment systems
  • >
  • Implement safety and content moderation systems to ensure agents behave appropriately across diverse user interactions
  • >
  • Collaborate with studio partners to translate creative direction into agent training parameters and behavioral specifications
  • >
  • Monitor agent performance, engagement metrics, and behavioral drift in production
  • >
Must Have
  • 5+ years of backend or ML engineering experience, with hands-on work deploying AI/ML models to production
  • >
  • Experience with LLM fine-tuning, prompt engineering, and inference optimization
  • >
  • Strong proficiency in Python; experience with Golang is a plus
  • >
  • Experience building and operating API services and data pipelines at scale
  • >
  • Familiarity with reinforcement learning techniques (RLHF, reward modeling) or agent framework development
  • >
  • Understanding of multi-model architectures - orchestrating multiple models or agents to collaborate on tasks
  • >
  • Experience deploying applications that integrate with third-party platforms and APIs (social media, messaging, commerce)
  • >
  • Strong systems thinking - comfortable reasoning about distributed systems, scaling, and reliability
  • >
  • Exceptional problem-solving and communication skills
  • >
Nice to Have
  • Experience with character AI, conversational AI, or NPC behavior systems
  • >
  • Background in gaming, entertainment, or media technology
  • >
  • Experience with multi-modal AI (voice, image, video generation or processing)
  • >
  • Familiarity with blockchain infrastructure, on-chain payments, or crypto-native commerce
  • >
  • Experience with MCP (Model Context Protocol) or agent-to-agent communication protocols
  • >
  • Understanding of content safety, trust & safety systems, or responsible AI practices
  • >
  • Contributions to open-source AI/ML projects
  • >

Benefits
  • Work remotely from anywhere in the world
  • >
  • Work at the intersection of AI, entertainment, and crypto
  • >
  • Flexible working hours
  • >
  • Flexible vacation policy
  • >
  • Competitive salary
  • >
  • Stock options
  • >
  • Full benefits*
  • >

*Medical, Dental, Vision, and 401k retirement plans for US employees only.
We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.