1

Llm Trainer Jobs (NOW HIRING)

LLM Training Engineer

San Francisco, CA · On-site

$155K - $220K/yr

About the Role As an LLM Training Engineer , you'll work across the full foundation-model stack: pretraining and scaling , post-training and Reinforcement Learning , sandbox environments for ...

Familiarity with distributed computing, cloud infrastructure, and orchestration tools, such as Kubernetes, Apache Airflow (DAG), Docker, Conductor, Ray for LLM training and inference at scale.Hands ...

Familiarity with distributed computing, cloud infrastructure, and orchestration tools, such as Kubernetes, Apache Airflow (DAG), Docker, Conductor, Ray for LLM training and inference at scale. Hands ...

next page

Showing results 1-20

Llm Trainer information

See salary details

$15

$36

$92

How much do llm trainer jobs pay per hour?

As of Jun 9, 2026, the average hourly pay for llm trainer in the United States is $36.91, according to ZipRecruiter salary data. Most workers in this role earn between $19.23 and $52.88 per hour, depending on experience, location, and employer.

What are some typical responsibilities and challenges faced by an LLM Trainer on a day-to-day basis?

LLM Trainers are responsible for designing and refining training datasets, developing prompts, evaluating model outputs, and working closely with engineers and data scientists to optimize large language models. Common challenges include maintaining data quality, mitigating model biases, and staying up-to-date with rapidly evolving AI research and best practices. You’ll often collaborate with cross-functional teams, communicate findings clearly, and adapt to new tools or methodologies. This dynamic environment offers opportunities for innovation and skill development, making it an excellent fit for those passionate about advancing AI technology.

What are the key skills and qualifications needed to thrive in the Llm Trainer position, and why are they important?

To thrive as an LLM Trainer, you need a deep understanding of natural language processing (NLP), machine learning principles, and data annotation techniques, often supported by a background in computer science or related fields. Familiarity with tools like Python, PyTorch or TensorFlow, data labeling platforms, and version control systems is essential, along with knowledge of prompt engineering and model fine-tuning. Strong analytical thinking, attention to detail, and collaborative communication skills are crucial soft skills for working with cross-functional AI teams. These competencies are important for developing high-quality language models that meet user needs and industry standards.

What is an LLM Trainer job?

An LLM Trainer is responsible for training and fine-tuning large language models (LLMs) to improve their accuracy, efficiency, and relevance for specific applications. This role involves curating and preprocessing training data, designing training methodologies, and evaluating model performance. LLM Trainers work closely with data scientists, engineers, and researchers to optimize models for tasks such as natural language understanding, text generation, and conversational AI. They also ensure ethical AI practices by mitigating biases and refining model outputs.

What cities are hiring for Llm Trainer jobs? Cities with the most Llm Trainer job openings:
What are the most commonly searched types of Llm Trainer jobs? The most popular types of Llm Trainer jobs are:
What states have the most Llm Trainer jobs? States with the most job openings for Llm Trainer jobs include:
Infographic showing various Llm Trainer job openings in the United States as of May 2026, with employment types broken down into 10% Internship, 50% Full Time, and 40% Contract. Highlights an 60% In-person, 20% Hybrid, and 20% Remote job distribution, with an average salary of $76,772 per year, or $36.9 per hour.

Senior Software Engineer - LLM Trainer

Kake Group

San Francisco, CA • Remote

$125K - $165K/yr

Contractor

Posted 13 days ago


Job description

We are looking for a Senior Software Engineer to contribute to the development and evaluation of AI training data for a leading expert human data platform for AI agents and LLMs.

In this role, you will work at the intersection of software engineering and artificial intelligence, helping AI labs and companies build better, safer, and more capable models. You will leverage your deep technical expertise to write prompts, produce reference-quality code solutions, evaluate model outputs, and provide the structured human signal that makes AI systems smarter.

This is not a traditional engineering role - it is a unique opportunity for senior engineers who want to shape how the next generation of AI understands, generates, and reasons about code.

Key Responsibilities

  • Create and review coding tasks based on real-world software engineering scenarios, including debugging, refactoring, code generation, API usage, automated tests, performance, security, and edge cases.
  • Write high-quality reference solutions that are correct, clear, testable, and aligned with task requirements.
  • Evaluate AI-generated code and responses using structured rubrics, assessing correctness, clarity, security, performance, maintainability, and instruction-following.
  • Compare multiple model responses, select the strongest answer, and justify your decision with clear technical reasoning.
  • Identify bugs, hallucinated APIs, missing edge cases, weak explanations, and poor engineering decisions in AI-generated outputs.
  • Work with terminal-based development workflows when needed, including running tests, debugging issues, managing dependencies, and navigating repositories.
  • Follow detailed guidelines consistently and participate in calibration activities to ensure high-quality, reliable evaluations.

Core Requirements

  • 5+ years of professional software engineering experience in a backend, fullstack, or systems role.
  • Strong proficiency in at least one core programming language, ideally Python, JavaScript/TypeScript, Go, Java, C++, or SQL.
  • Hands-on experience with Terminal-Bench, with the ability to evaluate AI agent performance on terminal-based tasks including compiling code, running tests, managing environments, and completing multi-step software engineering workflows.
  • Comfortable working with Git, command line/terminal, and common development workflows.
  • Ability to evaluate code critically - not only whether it works, but whether it is well-designed, secure, and maintainable.
  • Prior experience in AI data production, RLHF, data annotation, or LLM evaluation projects.
  • Excellent written and verbal communication skills in English.
  • Ability to work independently in a remote, asynchronous, fast-paced environment.
  • High attention to detail and the ability to follow complex, rubric-based guidelines consistently

Nice-to-Have

  • Experience with Python-heavy workflows, automated testing frameworks, Docker, Linux, bash, or containerized environments.
  • Experience with repo-level code reasoning, large codebases, or open-source contributions.
  • Background in backend systems, data engineering, DevOps, infrastructure, security, or large codebase.

Additional

- US Timezone Overlap: PST (GMT -8)

Please Note: Due to the high volume of applications, only shortlisted candidates will be contacted.

We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.