1

Software Engineer Ai Model Training Jobs (NOW HIRING)

AI Software Engineer

El Segundo, CA ยท On-site

$80K - $210K/yr

About the Role We're looking for an AI/ML Software Engineer to play a foundational role in ... Establish scalable MLOps pipelines and real-time inference services to streamline model training ...

Senior Software Engineer - AI / Java

Boston, MA ยท Hybrid

$135K - $178K/yr

Support AI model evaluation and benchmarking using metrics such as BLEU, ROUGE, cosine similarity, and custom domain metrics * Partner cross-functionally with product, frontend, and data teams to ...

Lead Software Engineer, AI Are you ready to shape the future of AI-driven content technology while ... training, fine-tuning, and evaluating AI/ML models with focus on efficiency, scalability, and ...

Lead Software Engineer, AI Are you passionate about building AI-driven software that transforms how ... Hybrid Work Model: We've adopted a flexible hybrid working environment (2-3 days a week in the ...

next page

Showing results 1-20

People also search for

Software Engineer Ai Model Training information

See salary details

$63.5K

$147.5K

$205.5K

How much do software engineer ai model training jobs pay per year?

As of Jun 18, 2026, the average yearly pay for software engineer ai model training in the United States is $147,524.00, according to ZipRecruiter salary data. Most workers in this role earn between $120,000.00 and $173,000.00 per year, depending on experience, location, and employer.

What are some common challenges faced by Software Engineers specializing in AI model training, and how can they be addressed?

Software Engineers focusing on AI model training often encounter challenges such as managing large datasets, ensuring data quality, and optimizing model performance. Addressing these issues typically involves close collaboration with data scientists, domain experts, and DevOps engineers to streamline the data pipeline and refine training processes. Staying up to date with the latest advancements in machine learning frameworks and tools can also help overcome technical hurdles. Regular code reviews and cross-functional meetings further support problem-solving and foster a productive work environment.

What are the key skills and qualifications needed to thrive as a Software Engineer in AI Model Training, and why are they important?

To excel as a Software Engineer in AI Model Training, you need strong programming skills (especially in Python), a solid grasp of machine learning fundamentals, and typically a degree in computer science or a related field. Experience with frameworks like TensorFlow or PyTorch, familiarity with data processing tools, and sometimes certifications in AI or ML are highly valuable. Analytical thinking, problem-solving, and effective collaboration enhance your ability to develop and refine complex AI models. These skills ensure that AI solutions are robust, scalable, and aligned with organizational goals in a rapidly evolving technological landscape.

What is the difference between Software Engineer Ai Model Training vs Data Scientist?

AspectSoftware Engineer Ai Model TrainingData Scientist
Required CredentialsBachelor's in CS, related field; experience with ML frameworksBachelor's or higher in CS, statistics, or related field; strong analytical skills
Work EnvironmentDevelopment teams, AI labs, cloud platformsData analysis, research environments, business units
Employer & Industry UsageTech companies, AI startups, research institutionsTech firms, finance, healthcare, consulting

While both roles involve working with data and machine learning, Software Engineer Ai Model Training focuses on developing and optimizing AI models through coding and engineering practices. Data Scientists analyze data, build models, and generate insights. The roles often collaborate but differ in their core responsibilities and skill sets.

What does a Software Engineer in AI Model Training do?

A Software Engineer specializing in AI Model Training is responsible for designing, developing, and optimizing machine learning models. Their work involves preparing and processing large datasets, selecting appropriate algorithms, implementing training pipelines, and evaluating model performance. They collaborate closely with data scientists and other engineers to ensure that AI models are accurate, efficient, and suitable for deployment in real-world applications. Additionally, they may help maintain infrastructure for model training and contribute to research and development of new AI techniques.
More about Software Engineer Ai Model Training jobs
What cities are hiring for Software Engineer Ai Model Training jobs? Cities with the most Software Engineer Ai Model Training job openings:
What states have the most Software Engineer Ai Model Training jobs? States with the most job openings for Software Engineer Ai Model Training jobs include:
What job categories do people searching Software Engineer Ai Model Training jobs look for? The top searched job categories for Software Engineer Ai Model Training jobs are:
Infographic showing various Software Engineer Ai Model Training job openings in the United States as of June 2026, with employment types broken down into 1% As Needed, 93% Full Time, 1% Part Time, 3% Temporary, and 2% Contract. Highlights an 87% Physical, 5% Hybrid, and 8% Remote job distribution, with an average salary of $147,524 per year, or $70.9 per hour.

Software Engineer, AI Compute Infrastructure

Heygen

San Francisco, CA โ€ข On-site

$203K - $241K/yr

Other

This job post hasย expired 1 day ago.ย Applications are no longer accepted.


Job description

Software Engineer, AI Compute Infrastructure

Los Angeles, Palo Alto, San Francisco, Toronto, Singapore

About HeyGen

At HeyGen, our mission is to make visual storytelling accessible to all. Over the last decade, visual content has become the preferred method of information creation, consumption, and retention. But the ability to create such content, in particular videos, continues to be costly and challenging to scale. Our ambition is to build technology that equips more people with the power to reach, captivate, and inspire audiences.

We are seeking a seasoned Software Engineer to build and scale the foundational compute infrastructure that powers our state-of-the-art AI modelsโ€”from multimodal training data pipelines to high-throughput, low-latency video generation.

Responsibilities

You will be the core engineer responsible for building the robust, efficient, and scalable platform that enables our research and production teams to rapidly iterate on HeyGen's generative video models. Your contributions will directly impact model performance, developer productivity, and the final quality of every AI-generated video.

  • Optimize GPU Utilization: Design and implement mechanisms to aggressively optimize GPU and cluster utilization across thousands of devices for inference, training, data processing and large-scale deployment of our state-of-art video generation models.

  • Develop Large-Scale AI Job Framework: Build highly scalable, reliable frameworks for launching and managing massive, heterogeneous compute jobs, including multi-modal high-volume data ingestion/processing, distributed model training, and continuous evaluation/benchmarking.

  • Enhance Observability: Develop world-class observability, tracing, and visualization tools for our compute cluster to ensure reliability, diagnose performance bottlenecks (e.g., memory, bandwidth, communication).

  • Accelerate Pipelines: Collaborate closely with AI researchers and AI engineers to integrate innovative acceleration techniques (e.g., custom CUDA kernels, distributed training libraries) into production-ready, scalable training and inference pipelines.

  • Infrastructure Management: Champion the adoption and optimization of modern cloud and container technologies (Kubernetes, Ray) for elastic, cost-efficient scaling of our distributed systems.

Minimum Requirements
  • Bachelor's degree in Computer Science, Engineering, or a related field, or equivalent practical experience.

  • 5+ years of full-time industry experience in large-scale MLOps, AI infrastructure, or HPC systems.

  • Experience with data frameworks and standards like Ray, Apache Spark, LanceDB

  • Strong proficiency in Python and a high-performance language such as C++ for developing core infrastructure components.

  • Deep understanding and hands-on experience with modern orchestration and distributed computing frameworks such as Kubernetes and Ray.

  • Experience with core ML frameworks such as PyTorch, TensorFlow, or JAX.

Preferred Qualifications
  • Master's or PhD in Computer Science or a related technical field.

  • Demonstrated Tech Lead experience, driving projects from conceptual design through to production deployment across cross-functional teams.

  • Prior experience building infrastructure specifically for Generative AI models (e.g., diffusion models, GANs, or large language models) where cost and latency are critical.

  • Proven background in building and operating large-scale data infrastructure (e.g., Ray, Apache Spark) to manage petabytes of multi-modal data (video, audio, text).

  • Expertise in GPU acceleration and deep familiarity with low-level compute programming, including CUDA, NCCL, or similar technologies for efficient inter-GPU communication.

What HeyGen Offers
  • Competitive salary and benefits package.
  • Dynamic and inclusive work environment.
  • Opportunities for professional growth and advancement.
  • Collaborative culture that values innovation and creativity.
  • Access to the latest technologies and tools.

HeyGen is an Equal Opportunity Employer. We celebrate diversity and are committed to creating an inclusive environment for all employees.


About HeyGen

Sourced by ZipRecruiter

Industry

Video and audio streaming services

Company size

11 - 50 Employees

Headquarters location

Los Angeles, CA, US

Year founded

2020

Social media