Experience with foundation models for data annotation * Experience with MLOps tooling (Weights & Biases, MLflow, SageMaker, or equivalents) * Experience shipping LLM- or agent-powered features in a ...
Experience with foundation models for data annotation * Experience with MLOps tooling (Weights & Biases, MLflow, SageMaker, or equivalents) * Experience shipping LLM- or agent-powered features in a ...
... LLM-powered data synthesis and data annotation tasks, prompt engineering, localization and quality evaluations. Experience Job Responsibilities • Provide linguistic expertise in the areas of syntax ...
Quick apply
... LLM-powered data synthesis and data annotation tasks, prompt engineering, localization and quality evaluations. Experience Job Responsibilities • Provide linguistic expertise in the areas of syntax ...
Sr Machine Learning Engineer, Tech Lead - Autograder Systems, Evaluation
Cupertino, CA · On-site
$126.50K - $166.60K/yr
... LLM-as-judge, preference learning, and calibration techniques to measurably improve evaluation ... Partner with data annotation teams to define labeling guidelines that feed autograder training.
Sr Machine Learning Engineer, Tech Lead - Autograder Systems, Evaluation
Cupertino, CA · On-site
$126.50K - $166.60K/yr
... LLM-as-judge, preference learning, and calibration techniques to measurably improve evaluation ... Partner with data annotation teams to define labeling guidelines that feed autograder training.
Experience with foundation models for data annotation * Experience with MLOps tooling (Weights & Biases, MLflow, SageMaker, or equivalents) * Experience shipping LLM- or agent-powered features in a ...
Quick apply
Experience with foundation models for data annotation * Experience with MLOps tooling (Weights & Biases, MLflow, SageMaker, or equivalents) * Experience shipping LLM- or agent-powered features in a ...
Human Factors Research Engineer, AIML Data Operations
$171.60K - $258.10K/yr
... annotation workflows, or AI/LLM model development Pay & Benefits At Apple, base pay is one part of our total compensation package and is determined within a range. This provides the opportunity to ...
Human Factors Research Engineer, AIML Data Operations
$171.60K - $258.10K/yr
... annotation workflows, or AI/LLM model development Pay & Benefits At Apple, base pay is one part of our total compensation package and is determined within a range. This provides the opportunity to ...
ML Engineer
New York, NY · On-site
$132K - $165K/yr
Partner with Data Science on annotation workflows, PII scrubbing, and ground-truth pipelines ... Practical experience with evaluation for ML or LLM systems - golden datasets, model-as-a-judge, IAA ...
ML Engineer
New York, NY · On-site
$132K - $165K/yr
Partner with Data Science on annotation workflows, PII scrubbing, and ground-truth pipelines ... Practical experience with evaluation for ML or LLM systems - golden datasets, model-as-a-judge, IAA ...
The ideal candidate will have a foundational understanding of machine learning, data annotation ... Deliver detailed reports on findings, including aspects such as utterance quality, LLM evaluation ...
The ideal candidate will have a foundational understanding of machine learning, data annotation ... Deliver detailed reports on findings, including aspects such as utterance quality, LLM evaluation ...
$16 - $20.75/hr
The ideal candidate will have a foundational understanding of machine learning, data annotation ... Deliver detailed reports on findings, including aspects such as utterance quality, LLM evaluation ...
$16 - $20.75/hr
The ideal candidate will have a foundational understanding of machine learning, data annotation ... Deliver detailed reports on findings, including aspects such as utterance quality, LLM evaluation ...
The ideal candidate will have a foundational understanding of machine learning, data annotation ... Deliver detailed reports on findings, including aspects such as utterance quality, LLM evaluation ...
The ideal candidate will have a foundational understanding of machine learning, data annotation ... Deliver detailed reports on findings, including aspects such as utterance quality, LLM evaluation ...
The ideal candidate will have a foundational understanding of machine learning, data annotation ... Deliver detailed reports on findings, including aspects such as utterance quality, LLM evaluation ...
The ideal candidate will have a foundational understanding of machine learning, data annotation ... Deliver detailed reports on findings, including aspects such as utterance quality, LLM evaluation ...
Data Labeling Associate
$17.50 - $22.75/hr
The ideal candidate will have a foundational understanding of machine learning, data annotation ... Deliver detailed reports on findings, including aspects such as utterance quality, LLM evaluation ...
Data Labeling Associate
$17.50 - $22.75/hr
The ideal candidate will have a foundational understanding of machine learning, data annotation ... Deliver detailed reports on findings, including aspects such as utterance quality, LLM evaluation ...
Data Quality Analyst
San Francisco, CA · On-site
The ideal candidate will have a foundational understanding of machine learning, data annotation ... Deliver detailed reports on findings, including aspects such as utterance quality, LLM evaluation ...
Data Quality Analyst
San Francisco, CA · On-site
The ideal candidate will have a foundational understanding of machine learning, data annotation ... Deliver detailed reports on findings, including aspects such as utterance quality, LLM evaluation ...
The ideal candidate will have a foundational understanding of machine learning, data annotation ... Deliver detailed reports on findings, including aspects such as utterance quality, LLM evaluation ...
The ideal candidate will have a foundational understanding of machine learning, data annotation ... Deliver detailed reports on findings, including aspects such as utterance quality, LLM evaluation ...
The ideal candidate will have a foundational understanding of machine learning, data annotation ... Deliver detailed reports on findings, including aspects such as utterance quality, LLM evaluation ...
The ideal candidate will have a foundational understanding of machine learning, data annotation ... Deliver detailed reports on findings, including aspects such as utterance quality, LLM evaluation ...
Tamil Translator (Remote) | Sigma AI
$45K - $58.90K/yr
Experience with one or more of the following: computational linguistics, corpus analysis, language data annotation, or LLM training * Strong attention to detail What will you do? Annotation - Audio ...
Tamil Translator (Remote) | Sigma AI
$45K - $58.90K/yr
Experience with one or more of the following: computational linguistics, corpus analysis, language data annotation, or LLM training * Strong attention to detail What will you do? Annotation - Audio ...
Senior Financial Analyst (CFA Level 3 Required)
New York, NY · On-site +1
$93.50K - $116.50K/yr
Familiarity with AI/LLM-related projects or data annotation * Experience working with both structured and unstructured datasets Additional Information * Short-term, project-based engagement (~2 weeks)
Quick apply
Senior Financial Analyst (CFA Level 3 Required)
New York, NY · On-site +1
$93.50K - $116.50K/yr
Familiarity with AI/LLM-related projects or data annotation * Experience working with both structured and unstructured datasets Additional Information * Short-term, project-based engagement (~2 weeks)
Senior Financial Analyst (CFA Level 3 Required)
New York, NY · On-site +1
$93.50K - $116.50K/yr
Familiarity with AI/LLM-related projects or data annotation * Experience working with both structured and unstructured datasets Additional Information * Short-term, project-based engagement (~2 weeks)
Senior Financial Analyst (CFA Level 3 Required)
New York, NY · On-site +1
$93.50K - $116.50K/yr
Familiarity with AI/LLM-related projects or data annotation * Experience working with both structured and unstructured datasets Additional Information * Short-term, project-based engagement (~2 weeks)
Senior Financial Analyst (CFA Level 3 Required)
New York, NY · On-site +1
$93.50K - $116.50K/yr
Familiarity with AI/LLM-related projects or data annotation * Experience working with both structured and unstructured datasets Additional Information * Short-term, project-based engagement (~2 weeks)
Senior Financial Analyst (CFA Level 3 Required)
New York, NY · On-site +1
$93.50K - $116.50K/yr
Familiarity with AI/LLM-related projects or data annotation * Experience working with both structured and unstructured datasets Additional Information * Short-term, project-based engagement (~2 weeks)
... data annotation) for firm wide ML practitioners. Job responsibilities * Works on several new ... Build LLM/SLM - powered applications including RAG-based systems, summarization/extraction ...
... data annotation) for firm wide ML practitioners. Job responsibilities * Works on several new ... Build LLM/SLM - powered applications including RAG-based systems, summarization/extraction ...
... reliable LLM applications. Required Skills & Qualifications Strong proficiency in Python is ... Create APIs and internal web tools for data annotation, curation, and model interaction
Quick apply
... reliable LLM applications. Required Skills & Qualifications Strong proficiency in Python is ... Create APIs and internal web tools for data annotation, curation, and model interaction
Llm Annotation information
See salary details
$11K - $13.8K
0% of jobs
$13.8K - $16.5K
0% of jobs
$16.5K - $19.3K
0% of jobs
$19.3K - $22.1K
0% of jobs
$22.1K - $24.9K
0% of jobs
$24.9K - $27.6K
0% of jobs
$27.6K - $30.4K
0% of jobs
$30.4K - $33.2K
0% of jobs
$33.2K - $36K
0% of jobs
$36K - $38.7K
0% of jobs
$39.4K is the 25th percentile. Wages below this are outliers.
$38.7K - $41.5K
100% of jobs
$11K
$41.5K
How much do llm annotation jobs pay per year?
What are the key skills and qualifications needed to thrive as an LLM Annotation Specialist, and why are they important?
What are some common challenges faced by LLM Annotation specialists, and how can they be addressed?
What is LLM annotation?
What is the difference between Llm Annotation vs Data Labeler?
| Aspect | Llm Annotation | Data Labeler |
|---|---|---|
| Required Credentials | Basic computer skills, sometimes familiarity with AI tools | Basic skills, often on-the-job training |
| Work Environment | Remote or office-based, tech-focused | Remote or on-site, varied industries |
| Industry Usage | AI, machine learning, NLP projects | Various industries including marketing, healthcare, and tech |
| Search & Comparison Intent | Understanding roles in AI data preparation | General data labeling tasks |
In summary, Llm Annotation involves specialized annotation for large language models, often requiring familiarity with AI tools, while Data Labeler is a broader role focused on labeling data across multiple industries with minimal technical requirements.
Full-time
Posted 24 days ago
Job description
About the Role
We're hiring an AI Engineer to work on the AI core of Mill Commercial - the computer vision and agentic systems that turn a stream of food waste into operational intelligence for commercial kitchens. Mill Commercial integrates a camera and onboard compute directly into our high-capacity food recycler; models running on the edge identify, classify, and quantify food scraps at the point of generation, and our vision pipeline turns that signal into procurement and operational guidance for large food service operators.
You'll join a small AI team, building the data and training pipeline that produces our edge CV models, designing the cloud-side evaluation that tells us whether those models are good enough to ship, and helping build the agentic, LLM-driven product features that turn raw waste data into customer-facing insights and recommendations. This is a hands-on senior IC role for someone who's equally comfortable fine-tuning a segmentation model, prompting a VLM, and wiring an agent into a product feature.
What You'll Do
- Build and manage the end-to-end ML training pipeline: data ingestion from deployed kitchen units, ground truth generation, annotation tooling (including foundation-model-assisted labeling), training, evaluation, and retraining cycles.
- Train and evaluate segmentation, classification, and mass-estimation models for the Mill Commercial camera pipeline - from prompting foundation models to fine-tuning ConvNets and VLMs.
- Build the cloud-side evaluation harness that tells us how our shipped edge models are actually performing in the field - automated, reproducible, and aligned to product accuracy targets across food types, kitchen environments, and deployment configurations.
- Own MLOps: reproducible training, experiment tracking, model versioning, and automated evaluation against product-defined accuracy targets.
- Export and validate models for deployment to edge devices, working closely with the edge team on optimization, quantization, and integration.
- Help design and build the LLM- and agent-powered product features that consume waste characterization data and turn it into customer-facing recommendations - purchasing suggestions, anomaly explanations, operational nudges. Define how agents call tools, ground in customer data, and stay reliable in production.
- Analyze failure cases systematically - unfamiliar food classes, novel kitchen environments, challenging lighting and clutter conditions - and drive the data and modeling decisions that close accuracy gaps.
- Strong fundamentals in computer vision and deep learning - segmentation, detection, classification, tracking. You understand the architectures well enough to make informed choices.
- Fluency with modern ML approaches - VLMs, LLMs, foundation models, and agentic systems - alongside classical deep learning. You know when to fine-tune a ConvNet, when to prompt a VLM, and when to wire up an agent, and you understand the practical realities of putting any of them into a product.
- Experience building ML training pipelines and data annotation systems at scale.
- Experience evaluating ML models rigorously - designing metrics, building the eval harness, and using results to drive product decisions rather than just publish a number.
- Proficiency with cloud ML infrastructure (AWS or equivalent) - you've managed training jobs, data pipelines, and experiment workflows in production.
- Familiarity with cloud-to-edge model deployment.
- Clear, direct communication - you can explain tradeoffs to non-technical stakeholders, push back honestly when you disagree, and write docs that others can follow.
- Genuine interest in applying AI to food waste reduction and sustainability. This is a mission-driven product and we want people who care about the mission.
Software skills: Python, PyTorch, OpenCV. Strong familiarity with MLOps on AWS infrastructure. Experience with LLM and agent frameworks. Google Cloud / Gemini experience is a plus.
Nice to Have
- Experience with video understanding (temporal consistency, tracking, video segmentation)
- Experience with foundation models for data annotation
- Experience with MLOps tooling (Weights & Biases, MLflow, SageMaker, or equivalents)
- Experience shipping LLM- or agent-powered features in a consumer or B2B product
- Hardware / IoT product experience, particularly with computer vision and cameras for embedded systems
The estimated base salary range for this position is $240 to $280k, which does not include the value of benefits or a potential equity grant. A wide range of factors are considered in making compensation decisions, including but not limited to skill sets, market conditions, experience and training, licensure and certifications, and business and organizational needs. At Mill, it is not typical for an individual to be hired at or near the top of the range for their role.
About Mill
Sourced by ZipRecruiter
Company size
1 - 10 Employees
Headquarters location
Colwell, IA, US
Year founded
1993