... RLHF), orchestration architectures, and data ingestion pipelines (ETL/ELT). • Experience ... or managing observability tools, tracing, or monitoring systems for distributed systems or ML ...
... RLHF), orchestration architectures, and data ingestion pipelines (ETL/ELT). • Experience ... or managing observability tools, tracing, or monitoring systems for distributed systems or ML ...
Research and develop novel post-training techniques, including SFT, RLHF, and reward modeling, to ... manage applicants' needs, provide our services, and comply with applicable laws. Any information we ...
Research and develop novel post-training techniques, including SFT, RLHF, and reward modeling, to ... manage applicants' needs, provide our services, and comply with applicable laws. Any information we ...
... RLHF), orchestration architectures, and data ingestion pipelines (ETL/ELT). • Experience ... or managing observability tools, tracing, or monitoring systems for distributed systems or ML ...
... RLHF), orchestration architectures, and data ingestion pipelines (ETL/ELT). • Experience ... or managing observability tools, tracing, or monitoring systems for distributed systems or ML ...
Strategic Projects Lead
$75K - $110K/yr
Experience managing distributed workforces or marketplace operations * Exposure to AI data, RLHF, or model evaluation workflows * Background in investment banking, private equity, or management ...
Strategic Projects Lead
$75K - $110K/yr
Experience managing distributed workforces or marketplace operations * Exposure to AI data, RLHF, or model evaluation workflows * Background in investment banking, private equity, or management ...
Research Lead / Principal Scientist & Manager Post-Training Alignment Reinforcement Learning Au...
California, MD · On-site
Own post-training strategy for model development - from RLHF and preference optimization to agentic ... Manage, mentor, and grow a team of AI scientists * Set technical direction and research priorities ...
Research Lead / Principal Scientist & Manager Post-Training Alignment Reinforcement Learning Au...
California, MD · On-site
Own post-training strategy for model development - from RLHF and preference optimization to agentic ... Manage, mentor, and grow a team of AI scientists * Set technical direction and research priorities ...
Senior AI Model Fine-Tuning Engineer
Phoenix, AZ · On-site
$128K - $176K/yr
You will use advanced techniques like prompt engineering, RLHF, and instruction tuning to ensure ... you manage your health and achieve your goals across many areas of your life. This includes a ...
Senior AI Model Fine-Tuning Engineer
Phoenix, AZ · On-site
$128K - $176K/yr
You will use advanced techniques like prompt engineering, RLHF, and instruction tuning to ensure ... you manage your health and achieve your goals across many areas of your life. This includes a ...
Senior AI Model Fine-Tuning Engineer
Phoenix, AZ · On-site
$128K - $176K/yr
You will use advanced techniques like prompt engineering, RLHF, and instruction tuning to ensure ... you manage your health and achieve your goals across many areas of your life. This includes a ...
Senior AI Model Fine-Tuning Engineer
Phoenix, AZ · On-site
$128K - $176K/yr
You will use advanced techniques like prompt engineering, RLHF, and instruction tuning to ensure ... you manage your health and achieve your goals across many areas of your life. This includes a ...
Research Lead / Principal Scientist & Manager Post-Training - Alignment - Reinforcement Learning ...
San Francisco, CA · On-site +1
Own post-training strategy for model development - from RLHF and preference optimization to agentic ... Manage, mentor, and grow a team of AI scientists * Set technical direction and research priorities ...
Research Lead / Principal Scientist & Manager Post-Training - Alignment - Reinforcement Learning ...
San Francisco, CA · On-site +1
Own post-training strategy for model development - from RLHF and preference optimization to agentic ... Manage, mentor, and grow a team of AI scientists * Set technical direction and research priorities ...
... Manager to define and drive product strategy for training software on Trainium. This includes distributed training libraries, post-training workflows (RLHF, DPO, fine-tuning), reinforcement learning ...
... Manager to define and drive product strategy for training software on Trainium. This includes distributed training libraries, post-training workflows (RLHF, DPO, fine-tuning), reinforcement learning ...
This role involves managing the operational execution and delivery of technical work, ensuring ... RLHF systems. Company : Scale's mission is to develop reliable AI systems for the world's most ...
This role involves managing the operational execution and delivery of technical work, ensuring ... RLHF systems. Company : Scale's mission is to develop reliable AI systems for the world's most ...
Research Lead / Principal Scientist & Manager Post-Training Alignment Reinforcement Learning Au...
Boston, MA · On-site
Own post-training strategy for model development - from RLHF and preference optimization to agentic ... Manage, mentor, and grow a team of AI scientists * Set technical direction and research priorities ...
Research Lead / Principal Scientist & Manager Post-Training Alignment Reinforcement Learning Au...
Boston, MA · On-site
Own post-training strategy for model development - from RLHF and preference optimization to agentic ... Manage, mentor, and grow a team of AI scientists * Set technical direction and research priorities ...
Research Lead / Principal Scientist & Manager Post-Training Alignment Reinforcement Learning Au...
San Francisco, CA · On-site
Own post-training strategy for model development - from RLHF and preference optimization to agentic ... Manage, mentor, and grow a team of AI scientists * Set technical direction and research priorities ...
Research Lead / Principal Scientist & Manager Post-Training Alignment Reinforcement Learning Au...
San Francisco, CA · On-site
Own post-training strategy for model development - from RLHF and preference optimization to agentic ... Manage, mentor, and grow a team of AI scientists * Set technical direction and research priorities ...
Research Lead / Principal Scientist & Manager Post-Training Alignment Reinforcement Learning Au...
Portland, OR · On-site
Own post-training strategy for model development - from RLHF and preference optimization to agentic ... Manage, mentor, and grow a team of AI scientists * Set technical direction and research priorities ...
Research Lead / Principal Scientist & Manager Post-Training Alignment Reinforcement Learning Au...
Portland, OR · On-site
Own post-training strategy for model development - from RLHF and preference optimization to agentic ... Manage, mentor, and grow a team of AI scientists * Set technical direction and research priorities ...
... Manager to define and drive product strategy for training software on Trainium. This includes distributed training libraries, post-training workflows (RLHF, DPO, fine-tuning), reinforcement learning ...
... Manager to define and drive product strategy for training software on Trainium. This includes distributed training libraries, post-training workflows (RLHF, DPO, fine-tuning), reinforcement learning ...
Experience designing RLHF or structured human feedback programs.Background in large language model ... Prior experience managing a data operations team.
Experience designing RLHF or structured human feedback programs.Background in large language model ... Prior experience managing a data operations team.
Training Specialist
$60K - $125K/yr
Our work spans RLHF, evals, red-teaming, and custom multimodal data creation, all powered by Label ... You'll own complex, high-stakes data programs end-to-end, managing expert workforces, navigating ...
Training Specialist
$60K - $125K/yr
Our work spans RLHF, evals, red-teaming, and custom multimodal data creation, all powered by Label ... You'll own complex, high-stakes data programs end-to-end, managing expert workforces, navigating ...
... Manager to define and drive product strategy for training software on Trainium. This includes distributed training libraries, post-training workflows (RLHF, DPO, fine-tuning), reinforcement learning ...
... Manager to define and drive product strategy for training software on Trainium. This includes distributed training libraries, post-training workflows (RLHF, DPO, fine-tuning), reinforcement learning ...
Developer
New York, NY · On-site
Deep understanding of Data preprocessing, Prompt management, Caching, Validation, Advanced RAG, RLHF, and success measurement. * Thorough understanding of LLMOps, data pipelines and other common ...
Developer
New York, NY · On-site
Deep understanding of Data preprocessing, Prompt management, Caching, Validation, Advanced RAG, RLHF, and success measurement. * Thorough understanding of LLMOps, data pipelines and other common ...
$30/hr
Define the data collection strategy to create a proprietary feedback loop (RLHF) that makes our ... You won't be managing a "feature list." You will be deciding how an entire industry evolves. If you ...
$30/hr
Define the data collection strategy to create a proprietary feedback loop (RLHF) that makes our ... You won't be managing a "feature list." You will be deciding how an entire industry evolves. If you ...
Technical Program Manager, Discovery
San Francisco, CA · On-site
$152K - $196K/yr
Have deep, hands-on experience with ML training pipelines, RLHF systems, and large-scale data ... Have excellent stakeholder management and communication skills, with the ability to influence ...
Technical Program Manager, Discovery
San Francisco, CA · On-site
$152K - $196K/yr
Have deep, hands-on experience with ML training pipelines, RLHF systems, and large-scale data ... Have excellent stakeholder management and communication skills, with the ability to influence ...
Manager Rlhf information
See salary details
$24.5K - $32.8K
9% of jobs
$32.8K - $41.1K
15% of jobs
$41.8K is the 25th percentile. Wages below this are outliers.
$41.1K - $49.5K
17% of jobs
The median wage is $52.3K / yr.
$49.5K - $57.8K
27% of jobs
$62.9K is the 75th percentile. Wages above this are outliers.
$57.8K - $66.1K
12% of jobs
$66.1K - $74.4K
8% of jobs
$74.4K - $82.7K
4% of jobs
$82.7K - $91K
3% of jobs
$91K - $99.4K
2% of jobs
$99.4K - $107.7K
2% of jobs
$107.7K - $116K
1% of jobs
$24.5K
$59.5K
$116K
How much do manager rlhf jobs pay per year?
$129K - $170K/yr
Full-time
Posted 29 days ago
Cisco Systems rating
8.6
Based on 39 frontline employees who took The Breakroom Quiz
15th of 139 rated electronics manufacturers
Job description
Cisco is revolutionizing how data and infrastructure connect and protect organizations in the AI era. They are seeking a Senior Staff Product Manager to join their AI Foundations Team, where the role involves defining the roadmap for AI and ensuring the development of enterprise-grade AI models while maintaining security and compliance.
Responsibilities:
• Define the strategic roadmap for end-to-end AI/ML lifecycles, ensuring our tools provide seamless, secure, and performant model development.
• Drive the integration of robust governance and guardrails to control model behavior and ensure compliance with security policies.
• Build and scale platform observability layers to allow users to trace performance and debug complex multi-step workflows.
Qualifications:
Required:
• Bachelor’s degree plus 12 years of related experience, or Master’s degree plus 8 years, or PhD plus 5 years in Product Management with a focus on AI/ML platform products.
• Demonstrated experience in the AI/ML lifecycle, including model fine-tuning (SFT/RLHF), orchestration architectures, and data ingestion pipelines (ETL/ELT).
• Experience implementing security protocols, compliance frameworks, and guardrails within AI or software development platforms.
• Proven experience in building or managing observability tools, tracing, or monitoring systems for distributed systems or ML models.
• Proficiency in SQL and Python for data analysis and managing large-scale data flows into AI-ready formats.
Preferred:
• Advanced degree (Master's or Ph.D.) in a quantitative field or an MBA.
• Hands-on experience with the model and agent development lifecycle (e.g., PyTorch, Hugging Face, LangChain, LlamaIndex).
• Knowledge of MLOps and LLMOps principles within networking or cybersecurity.
• Experience building ecosystems, APIs, or SDKs for enterprise-grade AI platforms.
• Proven ability to drive adoption for builder-centric products and scale AI solutions from concept to market.
Company:
Cisco develops, manufactures, and sells networking hardware, telecommunications equipment, and other technology services and products. It is a sub-organization of Cisco Press. Founded in 1984, the company is headquartered in San Jose, USA, with a team of 10001+ employees. The company is currently Late Stage.
What Cisco Systems employees say
Pay
Benefits
Hours and flexibility
Workplace
Get the full story on Breakroom
About Cisco Systems
Sourced by ZipRecruiter
Cisco Systems, a global tech titan based in San Jose, CA, US, operates in the information technology and services industry. Founded in 1984, the company was derived from a project between two computer scientists from Stanford University. They aimed to connect different networks of computer systems at the university, resulting in the first multi-protocol router, and subsequently, the birth of Cisco. As an industry-leading manufacturer of networking hardware and telecommunications equipment, Cisco's product and services range includes routers, switches, firewall devices, and telecommunication technology. The company's mission, "to shape the future of the Internet by creating unprecedented value and opportunity for our customers, employees, investors, and ecosystem partners," is a testament to its pursuit of technology-forward innovation and customer satisfaction.
Industry
Computer and computer peripheral equipment and software wholesalers
Company size
10,000+ Employees
Headquarters location
San Jose, CA, US
Year founded
1984