1

Audio Annotation Jobs in California (NOW HIRING)

MLOps, feature engineering, model training and inference. • Experience with labeling tools, audio annotation platforms, or human-in-the-loop annotation pipelines. • Experience at a high-growth ...

We are looking for a strategic thinker with strong Audio AI data annotation experience who can translate loosely defined technical objectives into executable programs and grow relationships by ...

Principal ML Engineer

Palo Alto, CA · On-site

$250K - $350K/yr

Experience with labeling tools, audio annotation platforms, or human-in-the- loop annotation pipelines. * Experience at a high-growth startup or tech company operating at scale. * Deep experience ...

We are looking for a strategic thinker with strong Audio AI data annotation experience who can translate loosely defined technical objectives into executable programs and grow relationships by ...

Work with technical staff to improve annotation tools for efficient audio workflows. BASIC QUALIFICATIONS: * Native proficiency in Bengali with exposure to diverse accents, dialects, or regional ...

Work with technical staff to improve annotation tools for efficient audio workflows. BASIC QUALIFICATIONS: * Native proficiency in Chinese with exposure to diverse accents, dialects, or regional ...

Work with technical staff to improve annotation tools for efficient audio workflows. BASIC QUALIFICATIONS: * Native proficiency in Arabic with exposure to diverse accents, dialects, or regional ...

Work with technical staff to improve annotation tools for efficient audio workflows. BASIC QUALIFICATIONS: * Native proficiency in Danish with exposure to diverse accents, dialects, or regional ...

next page

Showing results 1-20

Audio Annotation information

What are the key skills and qualifications needed to thrive as an Audio Annotator, and why are they important?

To thrive as an Audio Annotator, you need strong attention to detail, excellent listening skills, and familiarity with linguistic concepts, often supported by relevant coursework or experience in linguistics or audio processing. Proficiency in annotation tools such as ELAN, Audacity, or Praat, as well as experience with data labeling platforms, is typically required. Strong organizational skills, patience, and the ability to work independently make someone stand out in this role. These skills ensure accurate and consistent audio data labeling, which is essential for training reliable AI and speech recognition systems.

What are some common challenges faced by audio annotators, and how can they be managed effectively?

Audio annotators often encounter challenges such as distinguishing overlapping voices, dealing with low-quality recordings, and maintaining consistency in labeling. To manage these, it's important to use high-quality headphones, familiarize yourself with annotation guidelines, and communicate regularly with your team to resolve ambiguities. Many organizations also provide regular feedback sessions and quality checks to ensure accuracy and support continuous improvement.

What is audio annotation?

Audio annotation is the process of labeling or tagging audio data with relevant information, such as identifying sounds, speech, speakers, or background noises. This process helps train machine learning models to recognize and understand audio content. Audio annotation can involve tasks like transcribing speech, marking segments with specific sounds, or categorizing audio clips by genre or emotion. It is widely used in developing applications for speech recognition, virtual assistants, and audio analysis.
What job categories do people searching Audio Annotation jobs in California look for? The top searched job categories for Audio Annotation jobs in California are:
What cities in California are hiring for Audio Annotation jobs? Cities in California with the most Audio Annotation job openings:
Infographic showing various Audio Annotation job openings in California as of May 2026, with employment types broken down into 79% Full Time, 19% Part Time, and 2% Contract. Highlights an 46% Physical, 1% Hybrid, and 53% Remote job distribution.

Principal ML Engineer

Sanas

Palo Alto, CA • On-site

Full-time

Posted 3 days ago


Job description

Job Summary:
Sanas is pioneering the future of human communication with its innovative real-time speech AI platform. The Principal Machine Learning Engineer will lead the design and implementation of Machine Learning infrastructure for Voice AI products, shaping the technical vision and mentoring a team of engineers.
Responsibilities:
• Architect robust, modular ML pipelines for model experimentation, feature extraction, and production inference
• Collaborate with data engineering to improve audio dataset quality, labeling pipelines, and feature engineering
• Mentor and collaborate with other ML engineers and research scientists to ensure best practices in model development, evaluation, and deployment.
• Optimize models for latency, memory, and real-time performance on CPU/GPU/edge hardware.
• Introduce frameworks for continual learning, model versioning, and A/B testing in production.
• Stay current with advancements in Voice AI, Deep learning and multimodal model architectures
Qualifications:
Required:
• 10+ years of experience in Machine Learning Systems, ML workflows with at least 3+ years in a technical leadership capacity
• Advanced proficiency in Python and ML frameworks like PyTorch, TensorFlow, or JAX
• Strong understanding of Deep learning architectures like RNNs, LSTMs, CNNs, Transformers, CTC and their application in Accent translation, Noise cancellation, Acoustic Modeling, Language Modeling and Language Translation
• Experience deploying ML models to production (e.g., via ONNX, TensorRT, TorchScript, or custom inference stacks)
Preferred:
• Familiarity with audio data and its unique challenges, like large file sizes, time-series features, metadata handling, is a strong plus.
• Experience with Voice AI models like ASR, TTS and speaker verification.
• Familiarity with real-time data processing frameworks like Kafka, Flink, Druid and Pinot
• Familiarity with ML workflows including: MLOps, feature engineering, model training and inference.
• Experience with labeling tools, audio annotation platforms, or human-in-the-loop annotation pipelines.
• Experience at a high-growth startup or tech company operating at scale.
• Deep experience with ML tooling for training and serving models, ideally in audio or speech domains (e.g., PyTorch, ONNX, Hugging Face Transformers, torchaudio).
• Experience deploying real-time ASR, TTS, or voice synthesis models in production.
• Background in DSP, audio augmentation, or working with noisy or multilingual datasets.
Company:
Sanas is a real-time speech-understanding platform that modulates accents while preserving voices and emotions for natural interactions. Founded in 2020, the company is headquartered in Palo Alto, USA, with a team of 51-200 employees. The company is currently Growth Stage.