They are seeking a Tech Lead Manager for their ML Systems team to build and optimize a training and ... RLHF/RLVR and related algorithms like PPO/GRPO etc. • Strong software engineering skills ...
They are seeking a Tech Lead Manager for their ML Systems team to build and optimize a training and ... RLHF/RLVR and related algorithms like PPO/GRPO etc. • Strong software engineering skills ...
The Tech Lead Manager will build and optimize the training and inference framework for large ... RLHF/RLVR and related algorithms like PPO/GRPO etc. • Strong software engineering skills ...
The Tech Lead Manager will build and optimize the training and inference framework for large ... RLHF/RLVR and related algorithms like PPO/GRPO etc. • Strong software engineering skills ...
Developer
New York, NY · On-site
Deep understanding of Data preprocessing, Prompt management, Caching, Validation, Advanced RAG, RLHF, and success measurement. * Thorough understanding of LLMOps, data pipelines and other common ...
Developer
New York, NY · On-site
Deep understanding of Data preprocessing, Prompt management, Caching, Validation, Advanced RAG, RLHF, and success measurement. * Thorough understanding of LLMOps, data pipelines and other common ...
Delivery Lead
New York, NY · Remote
$110K - $140K/yr
... RLHF, annotation, model evaluation) * STEM background or strong technical fluency * Python & REACT working knowledge * Experience managing distributed contributor workforces at scale * Background in ...
Quick apply
Delivery Lead
New York, NY · Remote
$110K - $140K/yr
... RLHF, annotation, model evaluation) * STEM background or strong technical fluency * Python & REACT working knowledge * Experience managing distributed contributor workforces at scale * Background in ...
Manage the project lifecycle from ideation and scoping to deployment and post-launch support ... RLHF, multi-task learning). * Model Optimization: Expertise in model compression and quantization ...
Manage the project lifecycle from ideation and scoping to deployment and post-launch support ... RLHF, multi-task learning). * Model Optimization: Expertise in model compression and quantization ...
... to manage their own context in long-horizon tasks. This is applied research with direct product ... RLHF, GRPO, PPO, RLVR, reward modeling, RL scaling laws Code generation and coding agents ...
... to manage their own context in long-horizon tasks. This is applied research with direct product ... RLHF, GRPO, PPO, RLVR, reward modeling, RL scaling laws Code generation and coding agents ...
Manager, Data Science - AI Foundations Data is at the center of everything we do. As a startup, we ... RLHF. * You have an engineering mindset as shown by a track record of delivering models at scale ...
Manager, Data Science - AI Foundations Data is at the center of everything we do. As a startup, we ... RLHF. * You have an engineering mindset as shown by a track record of delivering models at scale ...
AI Engineer
New York, NY · On-site
$200K - $300K/yr
Design strategies to manage latency, output variance, and graceful error handling at scale ... Research or applied experience with LLM agents, RL (offline/online, RLHF/RLAIF), constrained ...
AI Engineer
New York, NY · On-site
$200K - $300K/yr
Design strategies to manage latency, output variance, and graceful error handling at scale ... Research or applied experience with LLM agents, RL (offline/online, RLHF/RLAIF), constrained ...
Manager, Data Science - LLM Customization Team Data is at the center of everything we do. As a ... RLHF. * You have an engineering mindset as shown by a track record of delivering models at scale ...
Manager, Data Science - LLM Customization Team Data is at the center of everything we do. As a ... RLHF. * You have an engineering mindset as shown by a track record of delivering models at scale ...
Manager, Data Science - AI Foundations Data is at the center of everything we do. As a startup, we ... RLHF. * You have an engineering mindset as shown by a track record of delivering models at scale ...
Manager, Data Science - AI Foundations Data is at the center of everything we do. As a startup, we ... RLHF. * You have an engineering mindset as shown by a track record of delivering models at scale ...
Manager, Data Science - AI Foundations Data is at the center of everything we do. As a startup, we ... RLHF. * You have an engineering mindset as shown by a track record of delivering models at scale ...
Manager, Data Science - AI Foundations Data is at the center of everything we do. As a startup, we ... RLHF. * You have an engineering mindset as shown by a track record of delivering models at scale ...
Senior Manager, Data Science - AI Foundations Data is at the center of everything we do. As a ... RLHF. * You have an engineering mindset as shown by a track record of delivering models at scale ...
Senior Manager, Data Science - AI Foundations Data is at the center of everything we do. As a ... RLHF. * You have an engineering mindset as shown by a track record of delivering models at scale ...
Senior Manager, Data Science - AI Foundations Data is at the center of everything we do. As a ... RLHF. * You have an engineering mindset as shown by a track record of delivering models at scale ...
Senior Manager, Data Science - AI Foundations Data is at the center of everything we do. As a ... RLHF. * You have an engineering mindset as shown by a track record of delivering models at scale ...
Senior Manager, Data Science - AI Foundations Data is at the center of everything we do. As a ... RLHF. * You have an engineering mindset as shown by a track record of delivering models at scale ...
Senior Manager, Data Science - AI Foundations Data is at the center of everything we do. As a ... RLHF. * You have an engineering mindset as shown by a track record of delivering models at scale ...
Senior Manager, Data Science - AI Foundations Data is at the center of everything we do. As a ... RLHF. * You have an engineering mindset as shown by a track record of delivering models at scale ...
Senior Manager, Data Science - AI Foundations Data is at the center of everything we do. As a ... RLHF. * You have an engineering mindset as shown by a track record of delivering models at scale ...
GenAI Product Engineering Lead
New York, NY · Remote
$104K - $138K/yr
Key Responsibilities Technical Leadership & Team Management * Lead, mentor, and grow a high ... RLHF) or synthetic data augmentation. • Establish model-drift detection and retraining triggers ...
Quick apply
GenAI Product Engineering Lead
New York, NY · Remote
$104K - $138K/yr
Key Responsibilities Technical Leadership & Team Management * Lead, mentor, and grow a high ... RLHF) or synthetic data augmentation. • Establish model-drift detection and retraining triggers ...
ML Researcher, Apple Foundation Models
$181K - $318K/yr
... to manage their own context in long-horizon tasks. This is applied research with direct product ... RLHF, GRPO, PPO, RLVR, reward modeling, RL scaling laws Code generation and coding agents ...
ML Researcher, Apple Foundation Models
$181K - $318K/yr
... to manage their own context in long-horizon tasks. This is applied research with direct product ... RLHF, GRPO, PPO, RLVR, reward modeling, RL scaling laws Code generation and coding agents ...
... Management Framework, ISO/IEC 42001:2023 (AI Management Systems), and the OWASP Generative AI ... RLHF, RLAIF, DPO). • LLM Ecosystems and Tooling: Proficiency with the HuggingFace ecosystem ...
... Management Framework, ISO/IEC 42001:2023 (AI Management Systems), and the OWASP Generative AI ... RLHF, RLAIF, DPO). • LLM Ecosystems and Tooling: Proficiency with the HuggingFace ecosystem ...
Senior Machine Learning Engineer
New York, NY · On-site
$185K - $280K/yr
Through unparalleled curriculum management functionality, Kiddom empowers schools and districts to ... Familiarity with foundation model adaptation techniques such as PEFT, LoRA, or RLHF. * Self ...
Senior Machine Learning Engineer
New York, NY · On-site
$185K - $280K/yr
Through unparalleled curriculum management functionality, Kiddom empowers schools and districts to ... Familiarity with foundation model adaptation techniques such as PEFT, LoRA, or RLHF. * Self ...
Senior Machine Learning Engineer
$185K - $280K/yr
Through unparalleled curriculum management functionality, Kiddom empowers schools and districts to ... Familiarity with foundation model adaptation techniques such as PEFT, LoRA, or RLHF. * Self ...
Senior Machine Learning Engineer
$185K - $280K/yr
Through unparalleled curriculum management functionality, Kiddom empowers schools and districts to ... Familiarity with foundation model adaptation techniques such as PEFT, LoRA, or RLHF. * Self ...
Manager Rlhf information
See Edison, NJ salary details
$25.4K - $34K
9% of jobs
$34K - $42.6K
15% of jobs
$43.3K is the 25th percentile. Wages below this are outliers.
$42.6K - $51.2K
17% of jobs
The median wage is $54.1K / yr.
$51.2K - $59.8K
27% of jobs
$65.1K is the 75th percentile. Wages above this are outliers.
$59.8K - $68.4K
12% of jobs
$68.4K - $77K
8% of jobs
$77K - $85.6K
4% of jobs
$85.6K - $94.3K
3% of jobs
$94.3K - $102.9K
2% of jobs
$102.9K - $111.5K
2% of jobs
$111.5K - $120.1K
1% of jobs
$25.4K
$61.6K
$120.1K
How much do manager rlhf jobs pay per year?

Job description
Scale AI is a company focused on developing reliable AI systems for critical decisions. They are seeking a Tech Lead Manager for their ML Systems team to build and optimize a training and inference framework that supports machine learning research and development.
Responsibilities:
• Build, profile and optimize our training and inference framework.
• Collaborate with ML and research teams to accelerate their research and development, and enable them to develop the next generation of models and data curation.
• Research and integrate state-of-the-art technologies to optimize our ML system.
Qualifications:
Required:
• Passionate about system optimization
• Experience with multi-node LLM training and inference
• Experience with developing large-scale distributed ML systems
• Experience with post-training methods like RLHF/RLVR and related algorithms like PPO/GRPO etc.
• Strong software engineering skills, proficient in frameworks and tools such as CUDA, Pytorch, transformers, flash attention, etc.
• Strong written and verbal communication skills to operate in a cross functional team environment.
Preferred:
• Demonstrated expertise in post-training methods and/or next generation use cases for large language models including instruction tuning, RLHF, tool use, reasoning, agents, and multimodal, etc.
Company:
Scale’s mission is to develop reliable AI systems for the world’s most important decisions. Founded in 2016, the company is headquartered in San Francisco, USA, with a team of 501-1000 employees. The company is currently Late Stage.
About Scale AI
Sourced by ZipRecruiter
Industry
Software development
Company size
201 - 500 Employees
Headquarters location
San Francisco, CA, US
Year founded
2016