1

Rlhf Jobs in Decatur, GA (NOW HIRING)

Advance Workday's proprietary capabilities in pre-training, post-training (RLHF, DPO), and domain-specific alignment for HR and Finance workflows. * Publish & Open Source: Lead Workday's contribution ...

AI/ML Engineer

Atlanta, GA · On-site

$140K - $160K/yr

... RLHF to improve model accuracy, robustness, and business relevance. • Develop and deploy AI agents and agentic workflows using frameworks such as LangChain, LangGraph, AgentSpace to automate multi ...

... RLHF, RAG and Knowledge graph etc. • Experience in designing and implementing Model Context Protocol (MCP) servers to enable seamless integration between AI agents, enterprise systems, and external ...

Rlhf information

What is an RLHF job?

An RLHF (Reinforcement Learning with Human Feedback) job involves training AI models using human feedback to improve their responses. Professionals in this role analyze model outputs, provide evaluations, and refine AI behavior through reinforcement learning techniques. These roles are common in AI research, content moderation, and chatbot development.

What are the key skills and qualifications needed to thrive as a Reinforcement Learning from Human Feedback (RLHF) Engineer, and why are they important?

To thrive as an RLHF Engineer, you need a strong background in machine learning, reinforcement learning, and programming (often Python), typically supported by an advanced degree in computer science or a related field. Experience with ML frameworks (such as TensorFlow or PyTorch), data annotation tools, and familiarity with large language models are typically required. Strong analytical thinking, collaboration, and clear communication are essential soft skills to succeed in research-driven, interdisciplinary teams. These skills and qualities are crucial for developing safe, effective AI systems that integrate human feedback and adapt to complex real-world tasks.

What are some common challenges faced by professionals working in Reinforcement Learning from Human Feedback (RLHF) roles?

Professionals in RLHF roles often encounter challenges related to data quality and alignment between human feedback and model behavior. Collecting consistent, unbiased feedback from human annotators can be complex, and ensuring that the reinforcement learning model interprets this feedback correctly requires careful design of reward functions and training protocols. Additionally, balancing the need for rapid experimentation with maintaining rigorous evaluation standards is crucial. Collaboration with interdisciplinary teams, including data scientists, ML engineers, and domain experts, is common to address these challenges and improve model alignment.

What are RLHF jobs?

RLHF stands for Reinforcement Learning from Human Feedback. RLHF jobs typically involve roles where professionals help train artificial intelligence (AI) systems, especially large language models, by providing feedback, curating datasets, designing reward models, or developing algorithms that enable AI to learn effectively from human input. These jobs may include positions such as machine learning engineers, data annotators, AI trainers, and research scientists. The goal of RLHF work is to improve the alignment of AI behavior with human values and expectations by incorporating direct human feedback into the training process.

What is the difference between Rlhf vs Rn?

AspectRlhfRn
Required CredentialsLicensed healthcare professional, often with specialized training in mental health or behavioral healthLicensed practical nurse or registered nurse, with nursing licensure and possibly additional certifications
Work EnvironmentBehavioral health facilities, clinics, hospitals, or community health settingsHospitals, clinics, long-term care facilities, and community health settings
Employer & Industry UsageBehavioral health and mental health servicesGeneral healthcare and nursing services
Common Search & ComparisonRlhf vs RnRlhf vs Rn

While Rlhf (Registered Licensed Mental Health Facilitator) focuses on mental health support and behavioral health interventions, Rn (Registered Nurse) provides broader nursing care across various medical settings. Both roles require licensure, but Rlhf specializes in mental health, whereas Rn covers general patient care.

More about Rlhf jobs
What job categories do people searching Rlhf jobs in Decatur, GA look for? The top searched job categories for Rlhf jobs in Decatur, GA are:
What cities near Decatur, GA are hiring for Rlhf jobs? Cities near Decatur, GA with the most Rlhf job openings:
Principal AI Researcher

Principal AI Researcher

Workday

Atlanta, GA

Full-time

Posted 15 days ago


Workday rating

9.2

Company rating: 9.2 out of 10

Based on 7 frontline employees who took The Breakroom Quiz

12th of 184 rated software companies


Job description

Your work days are brighter here.

We're obsessed with making hard work pay off, for our people, our customers, and the world around us. As a Fortune 500 company and a leading AI platform for managing people, money, and agents, we're shaping the future of work so teams can reach their potential and focus on what matters most. The minute you join, you'll feel it. Not just in the products we build, but in how we show up for each other. Our culture is rooted in integrity, empathy, and shared enthusiasm. We're in this together, tackling big challenges with bold ideas and genuine care. We look for curious minds and courageous collaborators who bring sun-drenched optimism and drive. Whether you're building smarter solutions, supporting customers, or creating a space where everyone belongs, you'll do meaningful work with Workmates who've got your back. In return, we'll give you the trust to take risks, the tools to grow, the skills to develop and the support of a company invested in you for the long haul. So, if you want to inspire a brighter work day for everyone, including yourself, you've found a match in Workday, and we hope to be a match for you too.

About the Team

Workday AI Research is a newly built, elite organization dedicated to advancing the state-of-the-art in AI to power the Agentic Enterprise. We sit at the heart of Agent Factory, Workday's innovation hub. While Agent Factory builds the production-grade agents used by 65M+ people, the AI Research team discovers the fundamental breakthroughs that make those agents possible.
We are a high-impact, "research-to-reality" team. We don't just conduct experiments in a vacuum; we solve the hardest enterprise-level problems in reasoning, planning, and multi-modal understanding. As a founding member of this research arm, you will help shape our research culture, publication strategy, and the fundamental intelligence layer of the Workday platform.

About the Role

As a Principal AI Researcher, you will be a technical founder and visionary for the Workday AI Research Team. This is a role for a scientist-leader who thrives on defining the frontier rather than just following a roadmap. You will move the needle on how LLMs and autonomous agents function within the complex, high-trust environment of global enterprise data.

You will bridge the gap between groundbreaking theory and massive-scale application. Your work will involve defining research roadmaps for agentic reasoning, long-horizon planning, and model alignment, ensuring that Workday remains the world leader in responsible, intelligent enterprise agents.

Key Responsibilities:

  • Define research direction in Agentic AI and LLMs.

  • Advance the state of the art in agentic systems, including retrieval, grounding, memory, context, personalization, etc.

  • Foundational Model Research: Advance Workday's proprietary capabilities in pre-training, post-training (RLHF, DPO), and domain-specific alignment for HR and Finance workflows.

  • Publish & Open Source: Lead Workday's contribution to the global AI community through publications at top-tier venues (NeurIPS, ICLR, ICML, ACL, etc.) and strategic open-source releases.

  • Research-to-Production Bridge: Partner with Agent Factory engineering pods to integrate SOTA research into production-ready services that scale to the world's largest organizations.

  • Mentorship & Culture: Act as a force multiplier for the newly formed research team, fostering a culture of scientific rigor, curiosity, and high-velocity experimentation.

About You

Basic Qualifications

  • 8+ years of experience in AI/ML research or applied science, with a proven history of taking products from applied research to global-scale production.

  • Ph.D. or Master's degree in Computer Science, Artificial Intelligence, Math, Physics, or equivalent technical field.

  • 4+ years of professional experience using deep learning frameworks such as PyTorch, JAX, or TensorFlow.

  • 2+ years of deep expertise in LLMs or VLMs, specifically in training, alignment (RLHF/DPO), or agentic reasoning frameworks.

  • SOTA Publication Record: A strong history of lead-researcher publications in top AI venues OR a proven track record of shipping industry-defining AI innovations to millions of users.

Other Qualifications

  • Agentic AI Pioneer: Strong background in reinforcement learning, tool-use, or multi-agent systems.

  • Systems Thinking: Ability to conduct research under real-world constraints, such as latency, cost-to-serve, and enterprise data privacy.

  • Leadership in Ambiguity: Proven experience in building a research agenda from the ground up and leading technical strike teams through open-ended problems.

  • Communication: Exceptional ability to translate complex research breakthroughs into strategic value for product leaders and executive stakeholders.

  • Ethics & Trust: A deep commitment to developing AI that is explainable, fair, and safe for the world's largest workforces.


Workday Pay Transparency Statement

The annualized base salary ranges for the primary location and any additional locations are listed below. Workday pay ranges vary based on work location. As a part of the total compensation package, this role may be eligible for the Workday Bonus Plan or a role-specific commission/bonus, as well as annual refresh stock grants. Recruiters can share more detail during the hiring process. Each candidate's compensation offer will be based on multiple factors including, but not limited to, geography, experience, skills, job duties, and business need, among other things. For more information regarding Workday's comprehensive benefits, please click here.

Primary Location: USA.CA.PleasantonPrimary Location Base Pay Range: $228,000 USD - $342,000 USDAdditional US Location(s) Base Pay Range: $190,600 USD - $342,000 USD


Our Approach to Flexible Work

With Flex Work, we're combining the best of both worlds: in-person time and remote. Our approach enables our teams to deepen connections, maintain a strong community, and do their best work. We know that flexibility can take shape in many ways, so rather than a number of required days in-office each week, we simply spend at least half (50%) of our time each quarter in the office or in the field with our customers, prospects, and partners (depending on role). This means you'll have the freedom to create a flexible schedule that caters to your business, team, and personal needs, while being intentional to make the most of time spent together. Those in our remote "home office" roles also have the opportunity to come together in our offices for important moments that matter.

Pursuant to applicable Fair Chance law, Workday will consider for employment qualified applicants with arrest and conviction records.

Workday is an Equal Opportunity Employer including individuals with disabilities and protected veterans.


At Workday, we are committed to providing an accessible and inclusive hiring experience where all candidates can fully demonstrate their skills. If you require assistance or an accommodation at any point, please email accommodations@workday.com.

Are you being referred to one of our roles? If so, ask your connection at Workday about our Employee Referral process!

At Workday, we value our candidates' privacy and data security. Workday will never ask candidates to apply to jobs through websites that are not Workday Careers.

Please be aware of sites that may ask for you to input your data in connection with a job posting that appears to be from Workday but is not.

In addition, Workday will never ask candidates to pay a recruiting fee, or pay for consulting or coaching services, in order to apply for a job at Workday.


Workday logo

About Workday

Sourced by ZipRecruiter

Workday's journey began with a transformative idea generated during a breakfast conversation between its founders in sunny California. What set us apart from the start was our people-centric culture, driven by the core value of prioritizing our employees. At Workday, the happiness, growth, and contributions of every team member are at the heart of who we are. Our collaborative and employee-focused culture is the key ingredient for our business success. We not only care for our people but also for the communities and the environment, all while maintaining profitability. Embrace your uniqueness, as we encourage our Workmates to shine brightly in their authentic selves. Our passion and energy make us distinct, and we are inspired to create a brighter workday for everyone.

Industry

Software development

Company size

10,000+ Employees

Headquarters location

Pleasanton, CA, US

Year founded

2005