You will be the champion inside AWS for frontier model builders pushing the bounds of scale and ... You will develop deep knowledge of AI/ML training architectures, distributed training systems ...
You will be the champion inside AWS for frontier model builders pushing the bounds of scale and ... You will develop deep knowledge of AI/ML training architectures, distributed training systems ...
The role involves executing high-volume data tasks related to audio and language data, ensuring accuracy and consistency in the datasets used for training AI models. Responsibilities : • Execute ...
The role involves executing high-volume data tasks related to audio and language data, ensuring accuracy and consistency in the datasets used for training AI models. Responsibilities : • Execute ...
Sr. Manager Software Development, AI Models and Applications
San Jose, CA · On-site
$193K/yr
You will have the opportunity to shape the future of AI model training and inference optimizations across a variety of applications. * Talented Team: Join a team of highly skilled industry ...
Sr. Manager Software Development, AI Models and Applications
San Jose, CA · On-site
$193K/yr
You will have the opportunity to shape the future of AI model training and inference optimizations across a variety of applications. * Talented Team: Join a team of highly skilled industry ...
Develop new algorithms and methods for training AI models for enhancing the robot dexterity. * Conduct cutting edge research across multiple disciplines (Robotics, RL/IL, control, perception, LLM ...
Develop new algorithms and methods for training AI models for enhancing the robot dexterity. * Conduct cutting edge research across multiple disciplines (Robotics, RL/IL, control, perception, LLM ...
Speakers/Writers
Kansas City, MO · On-site
$15 - $60/hr
You will play a pivotal role in training AI models, ensuring the accuracy and relevance of Arabic content generated by AI. This position allows for flexible scheduling, and your contributions will ...
Speakers/Writers
Kansas City, MO · On-site
$15 - $60/hr
You will play a pivotal role in training AI models, ensuring the accuracy and relevance of Arabic content generated by AI. This position allows for flexible scheduling, and your contributions will ...
Speakers/Writers
Rockford, IL · On-site
$15 - $60/hr
You will play a pivotal role in training AI models,ensuring the accuracy and relevance of Arabic content generated byAI. This position allows for flexible scheduling, and yourcontributions will ...
Speakers/Writers
Rockford, IL · On-site
$15 - $60/hr
You will play a pivotal role in training AI models,ensuring the accuracy and relevance of Arabic content generated byAI. This position allows for flexible scheduling, and yourcontributions will ...
The role focuses on building datasets for AI systems by executing high-volume data labeling tasks, ensuring accuracy and consistency in the data used for training AI models. Responsibilities : • ...
The role focuses on building datasets for AI systems by executing high-volume data labeling tasks, ensuring accuracy and consistency in the data used for training AI models. Responsibilities : • ...
Postdoctoral researcher in AI for Climate - Surrogate Multiscale Modeling (NYU Courant Institute ...
New York, NY · On-site
$62K - $94K/yr
Expertise in developing and training AI models * Proficiency in Python * Experience with HPC (GPUs preferred) * Related Skills and Other Requirements * Ability to work at the interface of AI and ...
Postdoctoral researcher in AI for Climate - Surrogate Multiscale Modeling (NYU Courant Institute ...
New York, NY · On-site
$62K - $94K/yr
Expertise in developing and training AI models * Proficiency in Python * Experience with HPC (GPUs preferred) * Related Skills and Other Requirements * Ability to work at the interface of AI and ...
AI Systems, Training
Palo Alto, CA · On-site
$123K - $168K/yr
They are seeking a key contributor to build a next-generation ML model training platform and co-design training systems alongside novel AI models and hardware. Responsibilities : • Build and ...
AI Systems, Training
Palo Alto, CA · On-site
$123K - $168K/yr
They are seeking a key contributor to build a next-generation ML model training platform and co-design training systems alongside novel AI models and hardware. Responsibilities : • Build and ...
Speakers/Writers
$15 - $60/hr
You will play a pivotal role in training AI models, ensuring the accuracy and relevance of Arabic content generated by AI. This position allows for flexible scheduling, and your contributions will ...
Speakers/Writers
$15 - $60/hr
You will play a pivotal role in training AI models, ensuring the accuracy and relevance of Arabic content generated by AI. This position allows for flexible scheduling, and your contributions will ...
Senior AI & Data Engineering Lead - Senior Vice President
Jersey City, NJ · On-site
$110K - $150K/yr
Ensuring the architecture can scale horizontally to handle petabytes of data and a high volume of concurrent queries, which is critical for pre-training large language models (LLMs). 2. Advanced AI ...
Senior AI & Data Engineering Lead - Senior Vice President
Jersey City, NJ · On-site
$110K - $150K/yr
Ensuring the architecture can scale horizontally to handle petabytes of data and a high volume of concurrent queries, which is critical for pre-training large language models (LLMs). 2. Advanced AI ...
The Founding Member of Technical Staff will be responsible for post-training AI models for chip design tasks, co-designing RL environments, and ensuring production-ready implementations.
New
The Founding Member of Technical Staff will be responsible for post-training AI models for chip design tasks, co-designing RL environments, and ensuring production-ready implementations.
New
AI/ML Solutions Engineer
Chantilly, VA · On-site
$84K - $112K/yr
Hands-on experience in training AI models and integrating pre-trained models into dataflows and software architectures. Highly Desired Skills and Demonstrated Experience: * Experience implementing ...
Quick apply
AI/ML Solutions Engineer
Chantilly, VA · On-site
$84K - $112K/yr
Hands-on experience in training AI models and integrating pre-trained models into dataflows and software architectures. Highly Desired Skills and Demonstrated Experience: * Experience implementing ...
Staff Artificial Intelligence Research Engineer (SJ2026KP)
San Jose, CA · On-site
$237K - $249K/yr
Minimum Experience Requirement: 5 years of experience working in developing, evaluating, training AI models for audio or visual perception, language models, reinforcement learning, or related.
Staff Artificial Intelligence Research Engineer (SJ2026KP)
San Jose, CA · On-site
$237K - $249K/yr
Minimum Experience Requirement: 5 years of experience working in developing, evaluating, training AI models for audio or visual perception, language models, reinforcement learning, or related.
Applied Machine Learning Engineer
San Francisco, CA · On-site
$220K - $320K/yr
Deeply understand customer use cases to inform training strategies and surface edge cases Requirements * 2+ years of experience training AI models using PyTorch * Hands-on experience with post ...
Applied Machine Learning Engineer
San Francisco, CA · On-site
$220K - $320K/yr
Deeply understand customer use cases to inform training strategies and surface edge cases Requirements * 2+ years of experience training AI models using PyTorch * Hands-on experience with post ...
Physics AI Trainer (PhD)
$70 - $85/hr
We are currently helping hire for one of the leading AI labs (via one of our partners); helping them train their AI models. Have you been curious as to how LLMs generate expert answers and would you ...
Physics AI Trainer (PhD)
$70 - $85/hr
We are currently helping hire for one of the leading AI labs (via one of our partners); helping them train their AI models. Have you been curious as to how LLMs generate expert answers and would you ...
AI/ML Solutions Engineer
Chantilly, VA · On-site
$84K - $112K/yr
Hands-on experience in training AI models and integrating pre-trained models into dataflows and software architectures. Highly Desired Skills and Demonstrated Experience: * Experience implementing ...
AI/ML Solutions Engineer
Chantilly, VA · On-site
$84K - $112K/yr
Hands-on experience in training AI models and integrating pre-trained models into dataflows and software architectures. Highly Desired Skills and Demonstrated Experience: * Experience implementing ...
Computer Vision Engineer
OR · Remote
$114K - $134K/yr
Hands-on experience training AI models with custom datasets * Structure-from-Motion, 3D Reconstruction and Photogrammetry * Pointcloud and Mesh Processing * Familiarity with Geospatial Data and ...
Quick apply
Computer Vision Engineer
OR · Remote
$114K - $134K/yr
Hands-on experience training AI models with custom datasets * Structure-from-Motion, 3D Reconstruction and Photogrammetry * Pointcloud and Mesh Processing * Familiarity with Geospatial Data and ...
Computer Vision Engineer
$114K - $134K/yr
Hands-on experience training AI models with custom datasets * Structure-from-Motion, 3D Reconstruction and Photogrammetry * Pointcloud and Mesh Processing * Familiarity with Geospatial Data and ...
Computer Vision Engineer
$114K - $134K/yr
Hands-on experience training AI models with custom datasets * Structure-from-Motion, 3D Reconstruction and Photogrammetry * Pointcloud and Mesh Processing * Familiarity with Geospatial Data and ...
Economics AI Trainer (PhD)
$70 - $90/hr
We are currently helping hire for one of the leading AI labs (via one of our partners); helping them train their AI models. Have you been curious as to how LLMs generate expert answers and would you ...
Economics AI Trainer (PhD)
$70 - $90/hr
We are currently helping hire for one of the leading AI labs (via one of our partners); helping them train their AI models. Have you been curious as to how LLMs generate expert answers and would you ...
Training Ai Models information
See salary details
$15.14 - $20.80
5% of jobs
$20.80 - $26.46
16% of jobs
$28.10 is the 25th percentile. Wages below this are outliers.
$26.46 - $32.12
14% of jobs
The median wage is $35.85 / hr.
$32.12 - $37.78
23% of jobs
$37.78 - $43.44
12% of jobs
$48.40 is the 75th percentile. Wages above this are outliers.
$43.44 - $49.10
6% of jobs
$49.10 - $54.76
5% of jobs
$54.76 - $60.42
3% of jobs
$60.42 - $66.08
11% of jobs
$66.08 - $71.74
4% of jobs
$71.74 - $77.40
1% of jobs
$15
$42
$77
How much do training ai models jobs pay per hour?
What are some common challenges faced when training AI models, and how are they addressed on the job?
One of the most common challenges in training AI models is handling large, complex datasets that often contain errors or inconsistencies, which can impact model performance. Professionals in this role frequently collaborate with data engineers and subject matter experts to clean and properly label data, as well as implement quality assurance checks throughout the process. Additionally, tuning model parameters and addressing issues such as overfitting or underfitting often require experimentation and iterative testing. Most teams employ version control and hold regular review sessions to ensure best practices are followed, making collaboration and communication essential parts of overcoming these challenges.
What is a Training AI Models job?
A Training AI Models job involves developing, refining, and optimizing machine learning models by providing them with relevant data, adjusting parameters, and evaluating their performance. Professionals in this role clean and preprocess data, select appropriate algorithms, and fine-tune models for accuracy and efficiency. They may also work with engineers and researchers to ensure models generalize well to real-world applications. The goal is to create AI systems that perform specific tasks effectively, such as natural language processing, image recognition, or predictive analytics.
What are the key skills and qualifications needed to thrive in the Training Ai Models position, and why are they important?
To thrive in Training AI Models, you need strong programming skills in languages like Python, a solid understanding of machine learning concepts, and typically a degree in computer science, data science, or a related field. Experience with machine learning frameworks such as TensorFlow, PyTorch, and familiarity with data preprocessing and annotation tools are commonly required; certifications in AI or data science can be advantageous. Effective communication, keen attention to detail, and collaboration are vital soft skills for working with cross-functional teams and ensuring data quality. These abilities are crucial for developing accurate models, delivering impactful AI solutions, and maintaining high standards throughout the model development lifecycle.

Full-time
Medical, Dental, Vision, Life, Retirement, PTO
Posted 22 days ago
Amazon rating
7.4
Based on 6,820 frontline employees who took The Breakroom Quiz
7th of 39 rated national retailers
Job description
AWS Neuron is hiring a Principal Technical Product Manager to define and drive product strategy for training software on Trainium. This includes distributed training libraries, post-training workflows (RLHF, DPO, fine-tuning), reinforcement learning frameworks, and training performance optimization. Your mission is to enable researchers and operators to train frontier models at scale on Trainium, from single-node experimentation to distributed training across thousands of nodes.
You will be the champion inside AWS for frontier model builders pushing the bounds of scale and resilience for current and emerging training paradigms. You will work with customers inside and outside the company to identify key improvements and stay ahead of the training landscape. You will define how Neuron supports the training AI/ML ecosystem and what tools customers will use for their training workflows on Trainium.
To be successful, you will partner with engineering teams building training libraries and distributed training infrastructure, applied scientists developing optimization techniques, and PMs responsible for compiler, runtime, NKI, and infrastructure. You will develop deep knowledge of AI/ML training architectures, distributed training systems, model parallelism strategies, and training performance optimization to effectively define product strategy and make informed technical decisions.
The Ideal Candidate
The ideal candidate will have solid understanding of large-scale model training, distributed training architectures, post-training workflows, and reinforcement learning. They should be able to assess technical implications of training software stack decisions, understand customer needs, and drive developer experience improvements. The ideal candidate can navigate ambiguity in a fast-moving, early-stage initiative, balance competing priorities across multiple workstreams, and drive alignment across engineering and science stakeholders with excellent written and verbal communication abilities
Key job responsibilities
Training Product Strategy & Roadmap
Define and execute training product strategy and roadmap working backwards from customer requirements in collaboration with engineering leadership. Define the vision for how customers train frontier models at scale on Trainium, balancing performance, developer experience, and AI/ML ecosystem compatibility. Produce PRFAQs and PRDs for training capabilities. Drive technical alignment across Neuron training libraries, distributed training infrastructure, and dependencies. Partner with PMs responsible for compiler, NKI, runtime, and infrastructure. Drive trade-offs between training performance, scalability, developer experience, and AI/ML ecosystem compatibility. Define requirements for reusable training building blocks that compose into end-to-end workflows.
Post-Training, RL & Emerging Workflows
Drive strategy for post-training workflows including RLHF, DPO, reward modeling, and fine-tuning at scale. Define requirements for how Neuron supports emerging training paradigms, model architectures, and RL-based optimization loops. Lead the product experience for RL research-to-production workflows on Trainium. Create and optimize RL libraries and frameworks to help researchers and production model builders.
Customer Engagement & Enablement
Work with BD, Solutions Architecture, and GTM teams to engage customers training frontier models on Trainium. Understand their distributed training challenges, RL needs, performance optimization requirements, and framework preferences. Translate customer pain points into product requirements. Define success metrics for training adoption and performance. Support customer enablement for training migration and optimization.
Training AI/ML Ecosystem & Delivery
Define how Neuron supports the training AI/ML ecosystem and what tools customers will use for their training workflows on Trainium. Own the technical depth on training-specific AI/ML ecosystem tools and define how Neuron's training libraries integrate with them. Track training-specific AI/ML ecosystem trends and feed them into product planning. Drive open source community engagement and upstream contributions for training-related tools. Coordinate with BD on partnership discussions where training-specific technical input is needed.
Launch & Go-to-Market
Lead end-to-end launches for training capabilities, coordinating documentation, field enablement, and customer communications. Partner with Marketing and Solutions Architecture to drive awareness and adoption. Define launch success criteria and track adoption metrics.
About the team
Our team is dedicated to supporting new members. We have a broad mix of experience levels and tenures, and we're building an environment that celebrates knowledge sharing and mentorship. We operate with startup like velocity, prioritizing talent acquisition, hands on leadership, and flexible organization. Our senior members enjoy one on one mentoring and thorough, but kind, code reviews. We care about your career growth and strive to assign projects that help our team members develop your engineering expertise so you feel empowered to take on more complex tasks in the future.
Diverse Experiences
AWS values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn't followed a traditional path, or includes alternative experiences, don't let it stop you from applying.
Inclusive Team Culture
Here at AWS, it's in our nature to learn and be curious. Our employee led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon conferences, inspire us to never stop embracing our uniqueness.
Work/Life Balance
We value work life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there's nothing we can't achieve in the cloud.
Mentorship & Career Growth
We're continuously raising our performance bar as we strive to become Earth's Best Employer. That's why you'll find endless knowledge sharing, mentorship and other career advancing resources here to help you develop into a better rounded professional.
About Amazon Annapurna Labs
Amazon Annapurna Labs team (our organization within AWS UC) is responsible for building innovation in silicon and software for our AWS customers. We are at the forefront of innovation by combining cloud scale with the world's most talented engineers. Our team covers multiple disciplines including silicon engineering, hardware design, software and operations. Because of our teams breadth of talent, we have been able to improve AWS cloud infrastructure in high performance machine learning with AWS Neuron, Inferentia and Trainium ML chips, in networking and security with products such as AWS Nitro, Enhanced Network Adapter (ENA), and Elastic Fabric Adapter (EFA), and in computing with AWS Graviton and F1 EC2 instances.
About AWS Utility Computing (UC)
AWS Utility Computing (UC) provides product innovations that continue to set AWS's services and features apart in the industry. As a member of the UC organization, you'll support the development and management of Compute, Database, Storage, Platform, and Productivity Apps services in AWS, including support for customers who require specialized security solutions for their cloud services. Additionally, this role may involve exposure to and experience with Amazon's growing suite of generative AI services and other cloud computing offerings across the AWS portfolio.
About AWS
Amazon Web Services (AWS) is the world's most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating, that's why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses.
BASIC QUALIFICATIONS
- 7+ years of working as a Technical Product Manager experience
- Bachelor's degree in computer science, engineering, analytics, mathematics, statistics, IT or equivalent
- Experience with large-scale model training workflows, including solid knowledge of distributed training concepts
- Familiarity with major AI/ML training frameworks (JAX or PyTorch) and how training libraries interact with them
- Experience driving product strategy, long-term roadmap development, and cross-organizational alignment
- Excellent written and verbal communication abilities, including executive-level communication
PREFERRED QUALIFICATIONS
- Experience with PyTorch or JAX distributed training
- Track record of driving developer training libraries and tools
- Experience with design and scaling of training optimization software (e.g., NeMo, TorchTitan, TRL, VeRL, MaxText, AXLearn, or similar)
- Experience leading RL for research-to-production at scale
- Experience with post-training workflows including RLHF, DPO, reward modeling, and fine-tuning
- Experience with AI/ML training accelerators and hardware, including training performance optimization, profiling, and tooling
- Experience with distributed training of large-scale models including model parallel training techniques (tensor, pipeline, sequence, and expert parallelism)
- Experience working on open source and GitHub-first developer products with deep customer interactions
- Track record of driving open standards and AI/ML ecosystem integration for training workflows
- Experience operating in early-stage, ambiguous environments with startup-like velocity
Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.
Los Angeles County applicants: Job duties for this position include: work safely and cooperatively with other employees, supervisors, and staff; adhere to standards of excellence despite stressful conditions; communicate effectively and respectfully with employees, supervisors, and staff to ensure exceptional customer service; and follow all federal, state, and local laws and Company policies. Criminal history may have a direct, adverse, and negative relationship with some of the material job duties of this position. These include the duties and responsibilities listed above, as well as the abilities to adhere to company policies, exercise sound judgment, effectively manage stress and work safely and respectfully with others, exhibit trustworthiness and professionalism, and safeguard business operations and the Company's reputation. Pursuant to the Los Angeles County Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.
Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit https://amazon.jobs/content/en/how-we-hire/accommodations for more information. If the country/region you're applying in isn't listed, please contact your Recruiting Partner.
The base salary range for this position is listed below. Your Amazon package will include sign-on payments and restricted stock units (RSUs). Final compensation will be determined based on factors including experience, qualifications, and location. Amazon also offers comprehensive benefits including health insurance (medical, dental, vision, prescription, Basic Life & AD&D insurance and option for Supplemental life plans, EAP, Mental Health Support, Medical Advice Line, Flexible Spending Accounts, Adoption and Surrogacy Reimbursement coverage), 401(k) matching, paid time off, and parental leave. Learn more about our benefits at https://amazon.jobs/en/benefits.
USA, CA, Cupertino - 208,300.00 - 281,800.00 USD annually
USA, WA, SEATTLE - 181,100.00 - 245,000.00 USD annually
USA, WA, Seattle - 181,100.00 - 245,000.00 USD annually
About Amazon
Sourced by ZipRecruiter
Amazon.com, Inc., commonly known as Amazon, is an American multinational technology company. It was founded by Jeff Bezos in 1994 and initially started as an online marketplace for books. Since then, Amazon has expanded its operations and become one of the largest e-commerce companies in the world. Amazon's primary business is its online retail platform, where customers can purchase a vast array of products, including electronics, clothing, books, home goods, and much more. The company offers a convenient and user-friendly shopping experience, with features such as fast shipping, customer reviews, and personalized recommendations. In addition to its e-commerce platform, Amazon has diversified its business into various other areas. One of its notable ventures is Amazon Web Services (AWS), a comprehensive cloud computing platform that provides services such as storage, compute power, and database management to individuals and businesses. AWS has become a leader in the cloud computing industry, powering many websites and applications worldwide. Amazon has also developed its own consumer electronics, including the popular Amazon Kindle e-reader, Fire tablets, Fire TV streaming devices, and the Alexa-powered Echo smart speakers. The Alexa voice assistant, integrated into these devices, allows users to interact with their devices using voice commands, perform tasks, and access information. Furthermore, Amazon has expanded into media and entertainment. It operates Prime Video, a streaming service that offers a wide range of movies, TV shows, and original content. Amazon Music provides a platform for streaming and purchasing digital music, while Audible offers audiobooks and other audio content. The company's commitment to customer satisfaction and convenience is demonstrated by its membership program, Amazon Prime. Prime members receive various benefits, including free two-day shipping, access to streaming services, exclusive deals, and more.
Industry
It services, book publishers, retail, real estate and computer and electronic product manufacturing
Company size
10,000+ Employees
Headquarters location
Seattle, WA, US