They are seeking a Tech Lead Manager for their ML Systems team to build and optimize a training and ... RLHF/RLVR and related algorithms like PPO/GRPO etc. • Strong software engineering skills ...
They are seeking a Tech Lead Manager for their ML Systems team to build and optimize a training and ... RLHF/RLVR and related algorithms like PPO/GRPO etc. • Strong software engineering skills ...
The Tech Lead Manager will build and optimize the training and inference framework for large ... RLHF/RLVR and related algorithms like PPO/GRPO etc. • Strong software engineering skills ...
The Tech Lead Manager will build and optimize the training and inference framework for large ... RLHF/RLVR and related algorithms like PPO/GRPO etc. • Strong software engineering skills ...
Research and develop novel post-training techniques, including SFT, RLHF, and reward modeling, to ... manage applicants' needs, provide our services, and comply with applicable laws. Any information we ...
Research and develop novel post-training techniques, including SFT, RLHF, and reward modeling, to ... manage applicants' needs, provide our services, and comply with applicable laws. Any information we ...
This role involves managing the operational execution and delivery of technical work, ensuring ... RLHF systems. Company : Scale's mission is to develop reliable AI systems for the world's most ...
This role involves managing the operational execution and delivery of technical work, ensuring ... RLHF systems. Company : Scale's mission is to develop reliable AI systems for the world's most ...
Developer
New York, NY · On-site
Deep understanding of Data preprocessing, Prompt management, Caching, Validation, Advanced RAG, RLHF, and success measurement. * Thorough understanding of LLMOps, data pipelines and other common ...
Developer
New York, NY · On-site
Deep understanding of Data preprocessing, Prompt management, Caching, Validation, Advanced RAG, RLHF, and success measurement. * Thorough understanding of LLMOps, data pipelines and other common ...
Experience with post-training methods like RLHF/RLVR and related algorithms like PPO/GRPO etc ... manage applicants' needs, provide our services, and comply with applicable laws. Any information we ...
Experience with post-training methods like RLHF/RLVR and related algorithms like PPO/GRPO etc ... manage applicants' needs, provide our services, and comply with applicable laws. Any information we ...
Strong stakeholder management and technical leadership skills. Preferred * Multi-agent orchestration experience. * AI evaluation and governance frameworks. * Fine-tuning, RLHF, or model optimization ...
Strong stakeholder management and technical leadership skills. Preferred * Multi-agent orchestration experience. * AI evaluation and governance frameworks. * Fine-tuning, RLHF, or model optimization ...
... RLHF systems. Compensation packages at Scale for eligible roles include base salary, equity, and ... manage applicants' needs, provide our services, and comply with applicable laws. Any information we ...
... RLHF systems. Compensation packages at Scale for eligible roles include base salary, equity, and ... manage applicants' needs, provide our services, and comply with applicable laws. Any information we ...
Applied AI/ML Engineer
$180K - $230K/yr
Own the model serving layer -- deploying models, managing inference infrastructure (API providers ... Identify opportunities to fine-tune open-source models using techniques such as SFT, DPO, and RLHF ...
Applied AI/ML Engineer
$180K - $230K/yr
Own the model serving layer -- deploying models, managing inference infrastructure (API providers ... Identify opportunities to fine-tune open-source models using techniques such as SFT, DPO, and RLHF ...
Delivery Lead
New York, NY · Remote
$110K - $140K/yr
... RLHF, annotation, model evaluation) * STEM background or strong technical fluency * Python & REACT working knowledge * Experience managing distributed contributor workforces at scale * Background in ...
Quick apply
Delivery Lead
New York, NY · Remote
$110K - $140K/yr
... RLHF, annotation, model evaluation) * STEM background or strong technical fluency * Python & REACT working knowledge * Experience managing distributed contributor workforces at scale * Background in ...
Direct experience with AI/ML platform products - data labeling, RLHF, fine-tuning workflows, or ... manage applicants' needs, provide our services, and comply with applicable laws. Any information we ...
Direct experience with AI/ML platform products - data labeling, RLHF, fine-tuning workflows, or ... manage applicants' needs, provide our services, and comply with applicable laws. Any information we ...
Senior Research Engineer
New York, NY · On-site
$180K - $200K/yr
There is no middle management. There are no layers of approval. The company is designed, from the ... Spearhead pre-training and post-training efforts, including RLHF, DPO, RLAIF, and other alignment ...
Senior Research Engineer
New York, NY · On-site
$180K - $200K/yr
There is no middle management. There are no layers of approval. The company is designed, from the ... Spearhead pre-training and post-training efforts, including RLHF, DPO, RLAIF, and other alignment ...
Manager, Data Science - AI Foundations Data is at the center of everything we do. As a startup, we ... RLHF. * You have an engineering mindset as shown by a track record of delivering models at scale ...
Manager, Data Science - AI Foundations Data is at the center of everything we do. As a startup, we ... RLHF. * You have an engineering mindset as shown by a track record of delivering models at scale ...
AI Engineer
New York, NY · On-site
$200K - $300K/yr
Design strategies to manage latency, output variance, and graceful error handling at scale ... Research or applied experience with LLM agents, RL (offline/online, RLHF/RLAIF), constrained ...
AI Engineer
New York, NY · On-site
$200K - $300K/yr
Design strategies to manage latency, output variance, and graceful error handling at scale ... Research or applied experience with LLM agents, RL (offline/online, RLHF/RLAIF), constrained ...
Familiarity with SFT, DPO, RLHF, or similar techniques. • Understanding of evaluation methodology ... GPUs, compute management, debugging common training failures. You don't need to be an infra ...
Familiarity with SFT, DPO, RLHF, or similar techniques. • Understanding of evaluation methodology ... GPUs, compute management, debugging common training failures. You don't need to be an infra ...
Manager, Data Science - AI Foundations Data is at the center of everything we do. As a startup, we ... RLHF. * You have an engineering mindset as shown by a track record of delivering models at scale ...
Manager, Data Science - AI Foundations Data is at the center of everything we do. As a startup, we ... RLHF. * You have an engineering mindset as shown by a track record of delivering models at scale ...
Manager, Data Science - LLM Customization Team Data is at the center of everything we do. As a ... RLHF. * You have an engineering mindset as shown by a track record of delivering models at scale ...
Manager, Data Science - LLM Customization Team Data is at the center of everything we do. As a ... RLHF. * You have an engineering mindset as shown by a track record of delivering models at scale ...
Manager, Data Science - AI Foundations Data is at the center of everything we do. As a startup, we ... RLHF. * You have an engineering mindset as shown by a track record of delivering models at scale ...
Manager, Data Science - AI Foundations Data is at the center of everything we do. As a startup, we ... RLHF. * You have an engineering mindset as shown by a track record of delivering models at scale ...
Manager, Data Science - LLM Customization Team Data is at the center of everything we do. As a ... RLHF. * You have an engineering mindset as shown by a track record of delivering models at scale ...
Manager, Data Science - LLM Customization Team Data is at the center of everything we do. As a ... RLHF. * You have an engineering mindset as shown by a track record of delivering models at scale ...
Senior Manager, Data Science - AI Foundations Data is at the center of everything we do. As a ... RLHF. * You have an engineering mindset as shown by a track record of delivering models at scale ...
Senior Manager, Data Science - AI Foundations Data is at the center of everything we do. As a ... RLHF. * You have an engineering mindset as shown by a track record of delivering models at scale ...
Manager Rlhf information
See Edison, NJ salary details
$25.4K - $34K
9% of jobs
$34K - $42.6K
15% of jobs
$43.3K is the 25th percentile. Wages below this are outliers.
$42.6K - $51.2K
17% of jobs
The median wage is $54.1K / yr.
$51.2K - $59.8K
27% of jobs
$65.1K is the 75th percentile. Wages above this are outliers.
$59.8K - $68.4K
12% of jobs
$68.4K - $77K
8% of jobs
$77K - $85.6K
4% of jobs
$85.6K - $94.3K
3% of jobs
$94.3K - $102.9K
2% of jobs
$102.9K - $111.5K
2% of jobs
$111.5K - $120.1K
1% of jobs
$25.4K
$61.6K
$120.1K
How much do manager rlhf jobs pay per year?
Job description
Scale AI is a company focused on developing reliable AI systems for critical decisions. They are seeking a Tech Lead Manager for their ML Systems team to build and optimize a training and inference framework that supports machine learning research and development.
Responsibilities:
• Build, profile and optimize our training and inference framework.
• Collaborate with ML and research teams to accelerate their research and development, and enable them to develop the next generation of models and data curation.
• Research and integrate state-of-the-art technologies to optimize our ML system.
Qualifications:
Required:
• Passionate about system optimization
• Experience with multi-node LLM training and inference
• Experience with developing large-scale distributed ML systems
• Experience with post-training methods like RLHF/RLVR and related algorithms like PPO/GRPO etc.
• Strong software engineering skills, proficient in frameworks and tools such as CUDA, Pytorch, transformers, flash attention, etc.
• Strong written and verbal communication skills to operate in a cross functional team environment.
Preferred:
• Demonstrated expertise in post-training methods and/or next generation use cases for large language models including instruction tuning, RLHF, tool use, reasoning, agents, and multimodal, etc.
Company:
Scale’s mission is to develop reliable AI systems for the world’s most important decisions. Founded in 2016, the company is headquartered in San Francisco, USA, with a team of 501-1000 employees. The company is currently Late Stage.
About Scale AI
Sourced by ZipRecruiter
Industry
Software development
Company size
201 - 500 Employees
Headquarters location
San Francisco, CA, US
Year founded
2016