Audio Speech Machine Learning Jobs (NOW HIRING)

Machine Learning Engineer, Siri Speech

$150K - $225K/yr

Good knowledge in machine learning technologies related to speech and audio processing; experience with image processing is a plus. Strong problem-solving skills and ability to work independently as ...

Apple

Machine Learning Engineer, Siri Speech

Cupertino, CA

$150K - $225K/yr

Dolby Laboratories, Inc.

Senior Generative AI Researcher

Atlanta, GA

... for audio, speech, and music. You will join the Machine Reasoning and Perception team to join a ... The ideal candidate would have a strong background in deep learning, both in terms of conceptual ...

Dolby Laboratories, Inc.

Senior Generative AI Researcher

Atlanta, GA

Dolby

Senior Generative AI Researcher

Atlanta, GA · On-site

Dolby

Senior Generative AI Researcher

Atlanta, GA · On-site

10a Labs

Machine Learning Engineer

Washington, DC · On-site +1

$130K - $200K/yr

Design, train, evaluate, and deploy machine learning models across text, image, audio, and ... LLMs, text classification, information extraction, retrieval systems, speech-to-text, agentic ...

10a Labs

Machine Learning Engineer

Washington, DC · On-site +1

$130K - $200K/yr

HARMAN International

Audio ML Engineer (Research)

Los Angeles, CA · On-site

Artificial Intelligence & Machine Learning Worker Type Reference: Regular - Permanent Pay Rate Type ... Experience with audio ML domains (speech enhancement, denoising, source separation, spatial audio ...

HARMAN International

Audio ML Engineer (Research)

Los Angeles, CA · On-site

femtoAI

Audio Deep Learning Engineer

San Bruno, CA · On-site

Develop and optimize deep learning models for audio processing, including tasks like speech ... Desired Skills and Experience Deep learning, Machine learning, DSP, Python, PyTorch Benefits ...

femtoAI

Audio Deep Learning Engineer

San Bruno, CA · On-site

Apple

Machine Learning Engineer - Speech & Multimodal Language Modeling

Cupertino, CA · On-site

... scale audio data processing on distributed systems Experience with prompt evaluation and ... Machine Learning journals or conferences Excellent communication skills and cross-functional ...

Apple

Machine Learning Engineer - Speech & Multimodal Language Modeling

Cupertino, CA · On-site

Netflix

Machine Learning Manager - Localization Algorithms

Los Angeles, CA

$523K - $920K/yr

Responsibilities Lead a broad portfolio of end-to-end initiatives in multimodal LLM and audio ... Highly proficient in multimodal LLM and speech algorithm research, and deeply committed to staying ...

Netflix

Machine Learning Manager - Localization Algorithms

Los Angeles, CA

$523K - $920K/yr

Netflix

Machine Learning Manager - Localization Algorithms

Los Gatos, CA

$523K - $920K/yr

Netflix

Machine Learning Manager - Localization Algorithms

$523K - $920K/yr

$523K - $920K/yr

$523K - $920K/yr

Machine Learning Engineer - Speech & Multimodal Language Modeling

Cupertino, CA

$150K - $277K/yr

Work with Data Engineers to process large scale speech audio data for foundation model training ... in Computer Science, Machine Learning, Statistics, or other STEM field 5+ years of hands-on ...

Apple

Machine Learning Engineer - Speech & Multimodal Language Modeling

Cupertino, CA

$150K - $277K/yr

Qualcomm

Speech & Audio Research Engineer

San Diego, CA · On-site

Fundamental research in the fields of signal processing and machine learning * Advanced use cases for human interaction that combine communication modalities from areas such as speech, audio ...

Qualcomm

Speech & Audio Research Engineer

San Diego, CA · On-site

10a Labs

Machine Learning Engineer

Washington, DC · On-site

$130K - $200K/yr

10a Labs

Machine Learning Engineer

Washington, DC · On-site

$130K - $200K/yr

PROLIM Global Corporation

Research Scientist IV

Burlingame, CA · On-site

... machine learning models for speech quality, run computational experiments and report findings. * Implement ML models that emulate aspects of human auditory perception. * Develop next-gen audio ...

PROLIM Global Corporation

Research Scientist IV

Burlingame, CA · On-site

Modulate

Machine Learning Engineer

Somerville, MA · On-site

$170K - $200K/yr

We're looking for a Senior Machine Learning Engineer to help advance the state of voice ... Experience with audio models or speech systems (ASR, TTS, speaker modeling, etc.) * Experience with ...

Modulate

Machine Learning Engineer

Somerville, MA · On-site

$170K - $200K/yr

Deepgram

Research Engineer, Machine Learning Systems

San Francisco, CA · On-site +1

$150K - $250K/yr

Even if billions of hours of audio were accessible, its inherent high dimensionality creates ... speech technologies, internal tooling, and innovative data strategies. You'll work at the ...

Deepgram

Research Engineer, Machine Learning Systems

San Francisco, CA · On-site +1

$150K - $250K/yr

Even if billions of hours of audio were accessible, its inherent high dimensionality creates ... speech technologies, internal tooling, and innovative data strategies. You'll work at the ...

Modulate

Machine Learning Engineer

Somerville, MA · On-site +1

$170K - $200K/yr

Quick apply

Modulate

Machine Learning Engineer

Somerville, MA · On-site +1

$170K - $200K/yr

Modulate

Machine Learning Engineer

Somerville, MA · On-site +1

$170K - $200K/yr

Modulate

Machine Learning Engineer

Somerville, MA · On-site +1

$170K - $200K/yr

Adobe, Inc.

Senior Research Scientist

San Francisco, CA · On-site

$116K - $147K/yr

Deep expertise in audio and machine learning, including strong intuition for: * Speech and audio generation * Audio representations and modeling * Training large-scale neural models * Hands-on ...

Adobe, Inc.

Senior Research Scientist

San Francisco, CA · On-site

$116K - $147K/yr

NICE

Senior Machine Learning Engineer

Sandy, UT · On-site

$113K - $150K/yr

As Senior Machine Learning Engineer, you will own the evaluation and optimization of speech ... LoRA/PEFT for speech models, inference optimization (quantization, SGLang/vLLM serving for audio ...

NICE

Senior Machine Learning Engineer

Sandy, UT · On-site

$113K - $150K/yr

Showing results 1-20

Audio Speech Machine Learning Jobs

Audio Speech Machine Learning information

What are some common challenges faced when developing machine learning models for audio speech applications?

A key challenge in audio speech machine learning roles is dealing with diverse and noisy audio data, which can significantly affect model accuracy. Additionally, models must be robust to different accents, languages, and speaking styles, requiring large and varied datasets for training and validation. Collaboration with data engineers, linguists, and software developers is often necessary to ensure high-quality data pipelines and model integration into production systems. Staying updated with the latest research and optimizing models for real-time performance are also ongoing aspects of the role.

What is an Audio Speech Machine Learning Engineer?

An Audio Speech Machine Learning Engineer is a specialized professional who designs, develops, and implements machine learning models that process and analyze audio and speech data. Their work involves tasks like speech recognition, speaker identification, and audio event detection by leveraging algorithms and large datasets. These engineers collaborate with data scientists, software developers, and linguists to create applications such as voice assistants, transcription tools, and automated customer service systems. Expertise in signal processing, deep learning frameworks, and programming languages like Python is crucial for this role.

What is the difference between Audio Speech Machine Learning vs Speech Data Analyst?

Aspect	Audio Speech Machine Learning	Speech Data Analyst
Required Credentials	Degree in Computer Science, Data Science, or related fields; knowledge of ML frameworks	Degree in Data Analysis, Statistics, or related fields; experience with data tools
Work Environment	Research labs, tech companies, AI startups	Data analysis teams, research institutions, tech firms
Industry Usage	Developing speech recognition, voice assistants, NLP applications	Analyzing speech datasets, improving speech models, reporting insights

Audio Speech Machine Learning focuses on developing algorithms for speech recognition and processing, often involving model training and AI development. Speech Data Analysts interpret speech data, generate insights, and support model improvements. Both roles require strong analytical skills, but their core tasks differ: one builds models, the other analyzes data.

What are the key skills and qualifications needed to thrive as an Audio Speech Machine Learning Engineer, and why are they important?

To thrive as an Audio Speech Machine Learning Engineer, you need a solid background in machine learning, signal processing, and programming (typically Python), along with a relevant degree in computer science or a related field. Familiarity with tools like TensorFlow or PyTorch, audio processing libraries (such as Librosa), and experience with speech datasets and ASR systems are commonly required. Critical soft skills include problem-solving, innovation, and effective communication for collaborating with cross-functional teams. These skills are essential to develop accurate, scalable speech recognition systems that advance voice-driven technology.

More about Audio Speech Machine Learning jobs

The 10 Top Types Of Audio Speech Machine Learning Jobs

What cities are hiring for Audio Speech Machine Learning jobs? Cities with the most Audio Speech Machine Learning job openings:

What states have the most Audio Speech Machine Learning jobs? States with the most job openings for Audio Speech Machine Learning jobs include:

What job categories do people searching Audio Speech Machine Learning jobs look for? The top searched job categories for Audio Speech Machine Learning jobs are:

Audio Speech Machine Learning jobs near you

Infographic showing various Audio Speech Machine Learning job openings in the United States as of July 2026, with employment types broken down into 77% Full Time, 20% Part Time, 1% Temporary, and 2% Contract. Highlights an 89% Physical, 1% Hybrid, and 10% Remote job distribution.

Machine Learning Engineer, Siri Speech

Apple

Cupertino, CA

Apply

$150K - $225K/yr

Full-time

Medical, Dental, Retirement

Re-posted 7 days ago

Apple rating

8.1

Based on 670 frontline employees who took The Breakroom Quiz

5th of 30 rated technology retailers

Job description

Apple is where individual imaginations gather together, committing to the values that lead to great work. Every new product we build, service we create, or experience we deliver is the result of us making each other’s ideas stronger. That happens because every one of us shares a belief that we can make something wonderful and share it with the world, changing lives for the better. It’s the diversity of our people and their thinking that inspires the innovation that runs through everything we do. When we bring everybody in, we can do the best work of our lives. Come join us and do your best work. Here at Apple, you’ll do more than join something - you’ll add something.
Join the Siri Speech team and play a part in the next revolution in human-computer interaction. You’ll help create groundbreaking technology for large-scale systems, spoken language, big data, and artificial intelligence, building production-quality models that power natural voice experiences used by millions. In this role, you will design and develop machine learning models, fine tune deep learning systems for speaker recognition and multimodal understanding, and collaborate across software engineering, data science, and product to integrate solutions into production.
Description
At the Siri Speech team, you will drive research and development for natural voice interaction, investigating new approaches and fine tuning deep learning models. You will design, develop, and implement machine learning models across speech, NLP, and multimodal applications, integrate ML solutions into existing workflows and large-scale systems, and work with large quantities of data to create production-quality models at scale. You will write clean, efficient, well-documented code, participate in code reviews, and mentor junior team members. Collaboration with cross-functional partners to define requirements and deliver high-quality solutions is essential. You will contribute to research initiatives and stay current with advancements in ML, HCI, LLMs, speech recognition, and signal processing. You should be passionate about creating and shipping phenomenal products and thrive in a fast-paced environment with rapidly changing priorities.","responsibilities":"Design, develop, and implement machine learning models for speech, NLP, and multimodal applications.
Investigate and fine tune deep learning architectures for natural voice interaction and speaker recognition.
Integrate ML solutions into production systems and existing workflows at scale.
Collaborate with data scientists, software engineers, and product managers to define requirements and deliverables.
Write clean, efficient, well-documented code and participate in code reviews.
Analyze large datasets and apply state-of-the-art methods to build production-quality models.
Mentor junior team members and contributes to engineering best practices.
Stay current with advances in ML, HCI, LLMs, speech recognition, and signal processing and contribute to research.
Preferred Qualifications
M.S in Computer Science or related field, or Bachelor’s degree with equivalent experience.
Minimum Qualifications
Strong proficiency in Python; good coding skills in bash scripting, and any OOP/functional language such as Java, C, C++, Go, Rust etc.
Experience with machine learning algorithms and techniques, including deep learning.
Hands-on experience with TensorFlow and/or PyTorch; familiarity with scikit-learn.
Experience with version control systems such as Git.
Good knowledge in machine learning technologies related to speech and audio processing; experience with image processing is a plus.
Strong problem-solving skills and ability to work independently as well as in a team environment.
Excellent written and verbal communication skills.
Pay & Benefits
At Apple, base pay is one part of our total compensation package and is determined within a range. This provides the opportunity to progress as you grow and develop within a role. The base pay range for this role is between $150,400 and $225,300, and your base pay will depend on your skills, qualifications, experience, and location.
Apple employees also have the opportunity to become an Apple shareholder through participation in Apple's discretionary employee stock programs. Apple employees are eligible for discretionary restricted stock unit awards, and can purchase Apple stock at a discount if voluntarily participating in Apple's Employee Stock Purchase Plan. You'll also receive benefits including: Comprehensive medical and dental coverage, retirement benefits, a range of discounted products and free services, and for formal education related to advancing your career at Apple, reimbursement for certain educational expenses - including tuition. Additionally, this role might be eligible for discretionary bonuses or commission payments as well as relocation. Learn more about Apple Benefits
Note: Apple benefit, compensation and employee stock programs are subject to eligibility requirements and other terms of the applicable plan or program.

What Apple employees say

Pay

Benefits

Hours and flexibility

Workplace

Get the full story on Breakroom

About Apple

Sourced by ZipRecruiter

Imagine what you could do here! At Apple, new ideas have a way of becoming extraordinary products, services, and customer experiences very quickly. Bring passion and dedication to your job and there's no telling what you could accomplish. Dynamic, intelligent people and inspiring, innovative technologies are the norm here. The people who work here have reinvented entire industries with all Apple Hardware products. The same real passion for innovation that goes into our products also applies to our practices strengthening our dedication to leave the world better than we found it.

Industry

Computer and electronic product manufacturing

Company size

10,000+ Employees

Headquarters location

Cupertino, CA, US

Year founded

1976

Website

apple.com

Social media

View All Apple Jobs

Apply

Audio Speech Machine Learning Jobs (NOW HIRING)

Machine Learning Engineer, Siri Speech

Machine Learning Engineer, Siri Speech

Senior Generative AI Researcher

Senior Generative AI Researcher

Senior Generative AI Researcher

Senior Generative AI Researcher

Machine Learning Engineer

Machine Learning Engineer

Audio ML Engineer (Research)

Audio ML Engineer (Research)

Audio Deep Learning Engineer

Audio Deep Learning Engineer

Machine Learning Engineer - Speech & Multimodal Language Modeling

Machine Learning Engineer - Speech & Multimodal Language Modeling

Machine Learning Manager - Localization Algorithms

Machine Learning Manager - Localization Algorithms

Machine Learning Manager - Localization Algorithms

Machine Learning Manager - Localization Algorithms

Machine Learning Manager - Localization Algorithms

Machine Learning Manager - Localization Algorithms

Machine Learning Engineer - Speech & Multimodal Language Modeling

Machine Learning Engineer - Speech & Multimodal Language Modeling

Speech & Audio Research Engineer

Speech & Audio Research Engineer

Machine Learning Engineer

Machine Learning Engineer

Research Scientist IV

Research Scientist IV

Machine Learning Engineer

Machine Learning Engineer

Research Engineer, Machine Learning Systems

Research Engineer, Machine Learning Systems

Machine Learning Engineer

Machine Learning Engineer

Machine Learning Engineer

Machine Learning Engineer

Senior Research Scientist

Senior Research Scientist

Senior Machine Learning Engineer

Senior Machine Learning Engineer

Audio Speech Machine Learning information

What are some common challenges faced when developing machine learning models for audio speech applications?

What is an Audio Speech Machine Learning Engineer?

What is the difference between Audio Speech Machine Learning vs Speech Data Analyst?

What are the key skills and qualifications needed to thrive as an Audio Speech Machine Learning Engineer, and why are they important?

Machine Learning Engineer, Siri Speech

Share this job

Apple rating

Get the real story on frontline employers

Job description

What Apple employees say

Get the real story on frontline employers

Pay

Most people get paid breaks

Most people get paid when they’re sick

The job rarely spills into unpaid time

Benefits

Sick days don’t use up paid time off

Most part-timers can get health insurance

Most part-timers get paid time off

Hours and flexibility

Less than 4 weeks notice of work schedule

Most people don’t worry about their hours

Only some people can choose their shifts

Workplace

Most people feel treated with respect

Most people get breaks without interruption

Most people are stressed out

About Apple

Industry

Company size

Headquarters location

Year founded

Website

Social media

Share this job