Support RLHF and Preference Data Workflows Design and implement tooling that supports RLHF-style ... Knowledge of designing and managing scalable database systems, including relational databases (e.g ...
Support RLHF and Preference Data Workflows Design and implement tooling that supports RLHF-style ... Knowledge of designing and managing scalable database systems, including relational databases (e.g ...
Afterpay is transforming the way customers manage their spending over time. TIDAL is a music ... Drive deeper model optimization work (fine-tuning, distillation, RLHF) where it unlocks agent ...
Afterpay is transforming the way customers manage their spending over time. TIDAL is a music ... Drive deeper model optimization work (fine-tuning, distillation, RLHF) where it unlocks agent ...
Support RLHF and Preference Data Workflows Design and implement tooling that supports RLHF-style ... Knowledge of designing and managing scalable database systems, including relational databases (e.g ...
Support RLHF and Preference Data Workflows Design and implement tooling that supports RLHF-style ... Knowledge of designing and managing scalable database systems, including relational databases (e.g ...
Support RLHF and Preference Data Workflows Design and implement tooling that supports RLHF-style ... Knowledge of designing and managing scalable database systems, including relational databases (e.g ...
Support RLHF and Preference Data Workflows Design and implement tooling that supports RLHF-style ... Knowledge of designing and managing scalable database systems, including relational databases (e.g ...
Strategic Projects Lead
San Francisco, CA · Remote
$75K - $110K/yr
Experience managing distributed workforces or marketplace operations * Exposure to AI data, RLHF, or model evaluation workflows * Background in investment banking, private equity, or management ...
Quick apply
Strategic Projects Lead
San Francisco, CA · Remote
$75K - $110K/yr
Experience managing distributed workforces or marketplace operations * Exposure to AI data, RLHF, or model evaluation workflows * Background in investment banking, private equity, or management ...
Research Engineer - Ads Integrity
San Jose, CA · On-site
$156K - $316K/yr
... SFT, RLHF, and AI safety. 2. Design and deploy AIGC solutions for content understanding and ... managing confidential information including proprietary and trade secret information and access to ...
Research Engineer - Ads Integrity
San Jose, CA · On-site
$156K - $316K/yr
... SFT, RLHF, and AI safety. 2. Design and deploy AIGC solutions for content understanding and ... managing confidential information including proprietary and trade secret information and access to ...
High Volume (TOFU) Recruiter
San Francisco, CA · On-site +1
$55K - $100K/yr
We design and create datasets from scratch, recruit and manage the domain experts who evaluate ... Familiarity with AI data operations, annotation, or RLHF workforce programs * Experience with ATS ...
Quick apply
High Volume (TOFU) Recruiter
San Francisco, CA · On-site +1
$55K - $100K/yr
We design and create datasets from scratch, recruit and manage the domain experts who evaluate ... Familiarity with AI data operations, annotation, or RLHF workforce programs * Experience with ATS ...
Staff Product Manager, AI Safety
San Francisco, CA · On-site +1
As a Staff Product Manager for the GenAI Safety team within Trust & Safety, you'll define and drive ... RLHF) * Experience with AI ethics frameworks, responsible AI principles, or relevant regulatory ...
Staff Product Manager, AI Safety
San Francisco, CA · On-site +1
As a Staff Product Manager for the GenAI Safety team within Trust & Safety, you'll define and drive ... RLHF) * Experience with AI ethics frameworks, responsible AI principles, or relevant regulatory ...
Design interruption handling, turn-taking logic, and conversational state management for natural ... Experiment with prompt engineering, RLHF, distillation, and other techniques to optimize model ...
Design interruption handling, turn-taking logic, and conversational state management for natural ... Experiment with prompt engineering, RLHF, distillation, and other techniques to optimize model ...
... RLHF, context engineering) to optimize models for autonomy tasks. * Internal Customer Focus: Drive ... People Management & Development: Proactively create development opportunities for team members ...
Quick apply
... RLHF, context engineering) to optimize models for autonomy tasks. * Internal Customer Focus: Drive ... People Management & Development: Proactively create development opportunities for team members ...
... RLHF and multilingual AI solutions for complex, high impact use cases. The work is global, fast ... Build and manage a healthy pipeline within your accounts, with accurate forecasting and clear ...
... RLHF and multilingual AI solutions for complex, high impact use cases. The work is global, fast ... Build and manage a healthy pipeline within your accounts, with accurate forecasting and clear ...
Machine Learning Engineer, Chakra
Santa Clara, CA · On-site +1
$120K - $235K/yr
Architect and develop Chakra end to end: the agent design, conversation management, real-time ... Build fine-tuning and RLHF workflows to push model judgment past what off-the-shelf models deliver ...
Machine Learning Engineer, Chakra
Santa Clara, CA · On-site +1
$120K - $235K/yr
Architect and develop Chakra end to end: the agent design, conversation management, real-time ... Build fine-tuning and RLHF workflows to push model judgment past what off-the-shelf models deliver ...
Strong understanding of LLM training pipelines - pretraining, supervised fine-tuning, RLHF/DPO, and ... You will work daily with field engineers, project managers, operations teams, and an independent ...
Strong understanding of LLM training pipelines - pretraining, supervised fine-tuning, RLHF/DPO, and ... You will work daily with field engineers, project managers, operations teams, and an independent ...
Own end-to-end LLM fine-tuning pipelines -- data curation, training, RLHF/DPO alignment, and ... Manager, and GenStudio enable people and businesses to turn ideas into impact, powered by AI and ...
Own end-to-end LLM fine-tuning pipelines -- data curation, training, RLHF/DPO alignment, and ... Manager, and GenStudio enable people and businesses to turn ideas into impact, powered by AI and ...
Manager, Multi-Modal Language Action Models
Foster City, CA · On-site
$242K - $333K/yr
... RLHF, context engineering) to optimize models for autonomy tasks. * Internal Customer Focus: Drive ... People Management & Development: Proactively create development opportunities for team members ...
Manager, Multi-Modal Language Action Models
Foster City, CA · On-site
$242K - $333K/yr
... RLHF, context engineering) to optimize models for autonomy tasks. * Internal Customer Focus: Drive ... People Management & Development: Proactively create development opportunities for team members ...
Familiarity with AI, machine learning, data pipelines, RLHF, or data-labeling/annotation operations. * Experience managing contractor or contingent workforces at scale. * Experience owning margin ...
Familiarity with AI, machine learning, data pipelines, RLHF, or data-labeling/annotation operations. * Experience managing contractor or contingent workforces at scale. * Experience owning margin ...
Machine Learning Engineer, Chakra
$120K - $235K/yr
Architect and develop Chakra end to end: the agent design, conversation management, real-time ... Build fine-tuning and RLHF workflows to push model judgment past what off-the-shelf models deliver ...
Machine Learning Engineer, Chakra
$120K - $235K/yr
Architect and develop Chakra end to end: the agent design, conversation management, real-time ... Build fine-tuning and RLHF workflows to push model judgment past what off-the-shelf models deliver ...
Own end-to-end LLM fine-tuning pipelines - data curation, training, RLHF/DPO alignment, and ... Manager, and GenStudio enable people and businesses to turn ideas into impact, powered by AI and ...
Own end-to-end LLM fine-tuning pipelines - data curation, training, RLHF/DPO alignment, and ... Manager, and GenStudio enable people and businesses to turn ideas into impact, powered by AI and ...
Principal AI Researcher
Pleasanton, CA · On-site
As a Fortune 500 company and a leading AI platform for managing people, money, and agents, we're ... Advance Workday's proprietary capabilities in pre-training, post-training (RLHF, DPO), and domain ...
Principal AI Researcher
Pleasanton, CA · On-site
As a Fortune 500 company and a leading AI platform for managing people, money, and agents, we're ... Advance Workday's proprietary capabilities in pre-training, post-training (RLHF, DPO), and domain ...
Own end-to-end LLM fine-tuning pipelines - data curation, training, RLHF/DPO alignment, and ... Manager, and GenStudio enable people and businesses to turn ideas into impact, powered by AI and ...
Own end-to-end LLM fine-tuning pipelines - data curation, training, RLHF/DPO alignment, and ... Manager, and GenStudio enable people and businesses to turn ideas into impact, powered by AI and ...
Manager Rlhf information
Job description
We're looking for a Full-Stack AI Engineer to join our team, where you'll build the next generation of tools for developing, evaluating, and training state-of-the-art AI systems. You will own features end to end-from user-facing experiences and APIs to backend services, data models, and infrastructure.
You'll be at the heart of our applied AI efforts, with a particular focus on human-in-the-loop systems used to generate high-quality training data for Large Language Models (LLMs) and AI agents. This includes building a platform that enables us and our customers to create and evaluate data, as well as systems that leverage LLMs to assist with reviewing, scoring, and improving human submissions.
Your Impact- Own End-to-End Product Features
Design, build, and ship complete workflows spanning frontend UI, APIs, backend services, databases, and production infrastructure. - Enable Human-in-the-Loop AI Training
Build systems that allow humans to efficiently create, review, and curate high-quality training and evaluation data used in AI model development. - Support RLHF and Preference Data Workflows
Design and implement tooling that supports RLHF-style pipelines, including task generation, human review, scoring, aggregation, and dataset versioning. - Leverage LLMs in the Review Loop
Build systems that use LLMs to assist human reviewers-such as automated checks, critiques, ranking suggestions, or quality signals-while maintaining human oversight. - Advance AI Evaluation
Design and implement evaluation frameworks and interactive tools for LLMs and AI agents across multiple data modalities (text, images, audio, video). - Create Intuitive, Reviewer-Focused Interfaces
Build thoughtful, efficient user interfaces (e.g., in React) optimized for high-throughput human review, quality control, and operational workflows. - Architect Scalable Data & Service Layers
Design APIs, backend services, and data schemas that support large-scale data creation, review, and iteration with strong guarantees around correctness and traceability. - Solve Ambiguous, Real-World Problems
Translate loosely defined operational and research needs into practical, scalable, end-to-end systems. - Ensure System Reliability
Participate in on-call rotations to monitor, troubleshoot, and resolve issues across the full stack. - Elevate the Team
Improve engineering practices, development processes, and documentation. Share knowledge through technical writing and design discussions.
- Bachelor's degree in Computer Science, Data Engineering, or a related field.
- 2+ years of experience in a software or machine learning engineering role.
- A proactive, product-focused mindset and a high degree of ownership, with a passion for building solutions that empower users.
- Experience using frontend frameworks like React/Redux and backend systems and technologies like Python, Java, GraphQL; familiarity with NodeJS and NestJS is a plus.
- Knowledge of designing and managing scalable database systems, including relational databases (e.g., PostgreSQL, MySQL), NoSQL stores (e.g., MongoDB, Cassandra), and cloud-native solutions (e.g., Google Spanner, AWS DynamoDB).
- Familiarity with cloud infrastructure like GCP (GCS, PubSub) and containerization (Kubernetes) is a plus.
- Excellent communication and collaboration skills.
- High proficiency in leveraging AI tools for daily development (e.g., Cursor, GitHub Copilot).
- A focus on writing clean, well-tested code and delivering your work on time.
- Experience building tools for AI/ML applications, particularly for data annotation, monitoring, or agent evaluation.
- Familiarity with data infrastructure components such as data pipelines, streaming systems, and storage architectures (e.g., Cloud Buckets, Key-Value Stores).
- Previous experience with search engines (e.g., ElasticSearch).
- Experience in optimizing databases for performance (e.g., schema design, indexing, query tuning) and integrating them with broader data workflows.
At Labelbox Engineering, we're building a comprehensive platform that powers the future of AI development. Our team combines deep technical expertise with a passion for innovation, working at the intersection of AI infrastructure, data systems, and user experience. We believe in pushing technical boundaries while maintaining high standards of code quality and system reliability. Our engineering culture emphasizes autonomous decision-making, rapid iteration, and collaborative problem-solving. We've cultivated an environment where engineers can take ownership of significant challenges, experiment with cutting-edge technologies, and see their solutions directly impact how leading AI labs and enterprises build the next generation of AI systems.
Our Technology StackOur engineering team works with a modern tech stack designed for scalability, performance, and developer efficiency:
- Frontend: React.js with Redux, TypeScript
- Backend: Node.js, TypeScript, Python, some Java & Kotlin
- APIs: GraphQL
- Cloud & Infrastructure: Google Cloud Platform (GCP), Kubernetes
- Databases: MySQL, Spanner, PostgreSQL
- Queueing / Streaming: Kafka, PubSub
About Labelbox
Sourced by ZipRecruiter
Company size
51 - 200 Employees
Headquarters location
San Francisco, CA, US
Year founded
2018