... independent judgment in ambiguous audio scenarios and make consistent, defensible annotation decisions. • Portfolio (strongly preferred for advanced candidates): Voice samples, annotated ...
... independent judgment in ambiguous audio scenarios and make consistent, defensible annotation decisions. • Portfolio (strongly preferred for advanced candidates): Voice samples, annotated ...
What You'll Do * Perform annotation and labeling tasks for Chinese generative AI datasets ... These tools assist our recruitment team but do not replace human judgment. Final hiring decisions ...
Quick apply
What You'll Do * Perform annotation and labeling tasks for Chinese generative AI datasets ... These tools assist our recruitment team but do not replace human judgment. Final hiring decisions ...
AI Tutor, Rufus, Amazon Rufus
Seattle, WA · On-site
This role will be responsible for conducting high-judgment evaluations and labeling data in order ... BASIC QUALIFICATIONS - Experience in natural language data labeling, data annotation, linguistic ...
AI Tutor, Rufus, Amazon Rufus
Seattle, WA · On-site
This role will be responsible for conducting high-judgment evaluations and labeling data in order ... BASIC QUALIFICATIONS - Experience in natural language data labeling, data annotation, linguistic ...
Multimedia Generative AI Analyst - USA (Remote)
Charleston, WV · Remote
$28.80/hr
... judgment when reviewing ambiguous or complex scenarios. * Basic understanding of robotics behavior in real-world environments. * Comfortable working in structured annotation platforms or similar ...
Quick apply
Multimedia Generative AI Analyst - USA (Remote)
Charleston, WV · Remote
$28.80/hr
... judgment when reviewing ambiguous or complex scenarios. * Basic understanding of robotics behavior in real-world environments. * Comfortable working in structured annotation platforms or similar ...
... judgment when reviewing ambiguous or complex scenarios. * Basic understanding of robotics behavior in real-world environments. * Comfortable working in structured annotation platforms or similar ...
... judgment when reviewing ambiguous or complex scenarios. * Basic understanding of robotics behavior in real-world environments. * Comfortable working in structured annotation platforms or similar ...
Demonstrated technical judgment in designing or operating annotation systems that support machine learning training, evaluation, or model assessment * Good understanding of annotation systems and ...
Demonstrated technical judgment in designing or operating annotation systems that support machine learning training, evaluation, or model assessment * Good understanding of annotation systems and ...
Remote | Chinese STEM Translation Quality Expert -- $80-$100/hour
New York, NY · On-site +1
$80 - $100/hr
Prepare concise written judgments explaining translation quality and technical correctness ... Background in AI evaluation, data annotation, language model review, or applied research workflows
Quick apply
Remote | Chinese STEM Translation Quality Expert -- $80-$100/hour
New York, NY · On-site +1
$80 - $100/hr
Prepare concise written judgments explaining translation quality and technical correctness ... Background in AI evaluation, data annotation, language model review, or applied research workflows
Remote | Spanish Scientific Translation Quality Expert -- $45-$60/hour
New York, NY · On-site +1
$45 - $60/hr
Prepare concise written judgments explaining translation quality and technical correctness ... Background in AI evaluation, data annotation, language model review, or applied research workflows
Quick apply
Remote | Spanish Scientific Translation Quality Expert -- $45-$60/hour
New York, NY · On-site +1
$45 - $60/hr
Prepare concise written judgments explaining translation quality and technical correctness ... Background in AI evaluation, data annotation, language model review, or applied research workflows
Conduct detailed data annotation and quality assurance of natural language datasets following ... These tools assist our recruitment team but do not replace human judgment. Final hiring decisions ...
Quick apply
Conduct detailed data annotation and quality assurance of natural language datasets following ... These tools assist our recruitment team but do not replace human judgment. Final hiring decisions ...
Senior Applied Scientist - AI Evaluation & Quality Systems
Seattle, WA · On-site
$104.10K - $142.20K/yr
... types, annotation modalities, and cold start conditions • Build and maintain calibration frameworks that keep LLM evaluators anchored to human judgment over time • Develop anomaly detection ...
Senior Applied Scientist - AI Evaluation & Quality Systems
Seattle, WA · On-site
$104.10K - $142.20K/yr
... types, annotation modalities, and cold start conditions • Build and maintain calibration frameworks that keep LLM evaluators anchored to human judgment over time • Develop anomaly detection ...
Conduct detailed data annotation and quality assurance of natural language datasets following ... These tools assist our recruitment team but do not replace human judgment. Final hiring decisions ...
Quick apply
Conduct detailed data annotation and quality assurance of natural language datasets following ... These tools assist our recruitment team but do not replace human judgment. Final hiring decisions ...
Demonstrated ability to exercise independent judgment in ambiguous audio scenarios and make consistent, defensible annotation decisions. * Portfolio (strongly preferred for advanced candidates)
Quick apply
Demonstrated ability to exercise independent judgment in ambiguous audio scenarios and make consistent, defensible annotation decisions. * Portfolio (strongly preferred for advanced candidates)
Demonstrated ability to exercise independent judgment in ambiguous audio scenarios and make consistent, defensible annotation decisions. * Portfolio (strongly preferred for advanced candidates)
Quick apply
Demonstrated ability to exercise independent judgment in ambiguous audio scenarios and make consistent, defensible annotation decisions. * Portfolio (strongly preferred for advanced candidates)
Demonstrated ability to exercise independent judgment in ambiguous audio scenarios and make consistent, defensible annotation decisions. * Portfolio (strongly preferred for advanced candidates)
Quick apply
Demonstrated ability to exercise independent judgment in ambiguous audio scenarios and make consistent, defensible annotation decisions. * Portfolio (strongly preferred for advanced candidates)
Demonstrated ability to exercise independent judgment in ambiguous audio scenarios and make consistent, defensible annotation decisions. * Portfolio (strongly preferred for advanced candidates)
Quick apply
Demonstrated ability to exercise independent judgment in ambiguous audio scenarios and make consistent, defensible annotation decisions. * Portfolio (strongly preferred for advanced candidates)
Demonstrated ability to exercise independent judgment in ambiguous audio scenarios and make consistent, defensible annotation decisions. * Portfolio (strongly preferred for advanced candidates)
Quick apply
Demonstrated ability to exercise independent judgment in ambiguous audio scenarios and make consistent, defensible annotation decisions. * Portfolio (strongly preferred for advanced candidates)
AI Tutor - Polish
$15.75 - $20.25/hr
Demonstrated ability to exercise independent judgment in ambiguous audio scenarios and make consistent, defensible annotation decisions. * Portfolio (strongly preferred for advanced candidates)
Quick apply
AI Tutor - Polish
$15.75 - $20.25/hr
Demonstrated ability to exercise independent judgment in ambiguous audio scenarios and make consistent, defensible annotation decisions. * Portfolio (strongly preferred for advanced candidates)
Demonstrated ability to exercise independent judgment in ambiguous audio scenarios and make consistent, defensible annotation decisions. * Portfolio (strongly preferred for advanced candidates)
Quick apply
Demonstrated ability to exercise independent judgment in ambiguous audio scenarios and make consistent, defensible annotation decisions. * Portfolio (strongly preferred for advanced candidates)
Demonstrated ability to exercise independent judgment in ambiguous audio scenarios and make consistent, defensible annotation decisions. * Portfolio (strongly preferred for advanced candidates)
Quick apply
Demonstrated ability to exercise independent judgment in ambiguous audio scenarios and make consistent, defensible annotation decisions. * Portfolio (strongly preferred for advanced candidates)
Demonstrated ability to exercise independent judgment in ambiguous audio scenarios and make consistent, defensible annotation decisions. * Portfolio (strongly preferred for advanced candidates)
Quick apply
Demonstrated ability to exercise independent judgment in ambiguous audio scenarios and make consistent, defensible annotation decisions. * Portfolio (strongly preferred for advanced candidates)
Annotation Judge information
What are the key skills and qualifications needed to thrive as an Annotation Judge, and why are they important?
What are some common challenges faced by Annotation Judges, and how can they effectively overcome them?
What is an Annotation Judge?
What is the difference between Annotation Judge vs Data Annotator?
| Aspect | Annotation Judge | Data Annotator |
|---|---|---|
| Credentials | Typically requires basic education, sometimes certification in data labeling | Usually requires similar or less formal education, often on-the-job training |
| Work Environment | Office or remote, working with data labeling platforms | Office or remote, performing data labeling tasks |
| Industry Usage | Used across AI, machine learning, and data science projects | Common in AI, machine learning, and data preparation workflows |
| Search & Comparison Intent | Often compared for roles involving data review and quality control | Compared for entry-level data labeling roles |
The main difference between an Annotation Judge and a Data Annotator lies in their roles. Annotation Judges typically review and validate annotations made by Data Annotators, ensuring quality and accuracy. Data Annotators perform the initial labeling of data. Both roles are essential in AI data pipelines, with Annotation Judges focusing on quality control and Data Annotators on data preparation.

Full-time
This job post has expired today. Applications are no longer accepted.
Job description
xAI is focused on creating AI systems that enhance human understanding and knowledge. They are seeking an AI Tutor specialized in multilingual audio capabilities to train and refine their AI model, Grok, in voice interactions and speech recognition across various languages and cultural contexts.
Responsibilities:
• Use proprietary software to provide labels, annotations, recordings, and inputs on projects involving multilingual audio clips, voice recordings, speech samples, and auditory elements in various languages.
• Support the delivery of high-quality curated audio data that ensures clear, natural spoken output, accurate representation of linguistic and prosodic details (such as intonation, rhythm, and accent), and professional audio standards.
• Collaborate with technical staff to develop tasks that improve AI's ability to handle speech modulation, accent variation, noise in real-world recordings, and multilingual audio processing.
• Work with technical staff to improve annotation tools for efficient audio workflows.
Qualifications:
Required:
• Native proficiency in Arabic with exposure to diverse accents, dialects, or regional variations.
• Proficiency in English (minimum B2 level) with clear, natural vocal delivery and pronunciation suitable for audio recording purposes.
• Strong auditory perception to identify nuances in speech, accents, pronunciation, intonation, and audio quality across languages.
• Demonstrated ability to handle multilingual audio content, including evaluating speech accuracy, cultural vocal expressions, and contextual interpretation in spoken form.
• Demonstrated ability to transcribe audio with high accuracy across accents and varying audio quality.
• Comfort providing high-quality voice recordings and feedback on audio samples in multiple languages.
• Strong comprehension skills and the ability to make independent judgments on ambiguous or varied audio material, including noisy or accented speech.
• Strong communication, interpersonal, analytical, detail-oriented, and organizational skills, with the ability to articulate audio-related feedback effectively.
• Commitment to developing AI that masters sophisticated multilingual audio capabilities.
Preferred:
• Demonstration of exceptional attention to linguistic nuance, auditory detail, and data quality beyond standard transcription work.
• Deep understanding and taste of what good/useful Audio data is.
• Strong command of advanced transcription and annotation practices, including handling disfluencies, accents, and prosodic features (intonation, stress, rhythm, emotion, etc) with high consistency and accuracy.
• Background in linguistics (e.g., phonetics, phonology, sociolinguistics), speech sciences, cognitive science, or a related field, or equivalent practical experience, with demonstrated ability to analyze accent variation, pronunciation differences, and multilingual speech patterns.
• Experience working with speech/audio datasets, annotation workflows, or AI training data, including knowledge/experience with training voice models, and an understanding of how data quality impacts model performance.
• Professional experience in voice work, including voice acting, voice recording, podcasting with a measurable audience (e.g., X following), or similar audio production demonstrating attention to clarity and recording quality.
• Demonstrated ability to exercise independent judgment in ambiguous audio scenarios and make consistent, defensible annotation decisions.
• Portfolio (strongly preferred for advanced candidates): Voice samples, annotated transcripts, or audio-related work demonstrating quality, methodology, and attention to detail.
• Candidates with professional experience in voice, linguistics, speech data, or speech evaluation and research are especially encouraged to apply.
Company:
XAI is an artificial intelligence startup that develops AI solutions and tools to enhance reasoning and search capabilities. It is a sub-organization of SpaceX. Founded in 2023, the company is headquartered in Palo Alto, USA, with a team of 1001-5000 employees. The company is currently Late Stage.