Machine Learning Infra Engineer
Seattle, WA ยท On-site
Familiarity with video/audio processing and storage systems
Seattle, WA ยท On-site
Familiarity with video/audio processing and storage systems
Seattle, WA ยท On-site
Familiarity with video/audio processing and storage systems
Meta Reality Labs Research is looking for experienced interns who are passionate about ground breaking research in audio signal processing, machine learning and audio visual learning to solve ...
Meta Reality Labs Research is looking for experienced interns who are passionate about ground breaking research in audio signal processing, machine learning and audio visual learning to solve ...
Redmond, WA ยท On-site
$7K - $12K/mo
Meta Reality Labs Research is looking for experienced interns who are passionate about ground breaking research in audio signal processing, machine learning and audio visual learning to solve ...
Redmond, WA ยท On-site
$7K - $12K/mo
Meta Reality Labs Research is looking for experienced interns who are passionate about ground breaking research in audio signal processing, machine learning and audio visual learning to solve ...
You'll design and build scalable ingestion pipelines that handle multi-modal content (PDF, audio, video, tables) using computer vision and machine learning, optimize system performance for millions ...
You'll design and build scalable ingestion pipelines that handle multi-modal content (PDF, audio, video, tables) using computer vision and machine learning, optimize system performance for millions ...
You'll design and build scalable ingestion pipelines that handle multi-modal content (PDF, audio, video, tables) using computer vision and machine learning, optimize system performance for millions ...
You'll design and build scalable ingestion pipelines that handle multi-modal content (PDF, audio, video, tables) using computer vision and machine learning, optimize system performance for millions ...
Redmond, WA ยท On-site
Understanding of machine learning techniques including predictive modeling, text and image mining ... video, audio, text and time series etc... Qualifications: * PhD degree in Computer Science ...
Redmond, WA ยท On-site
Understanding of machine learning techniques including predictive modeling, text and image mining ... video, audio, text and time series etc... Qualifications: * PhD degree in Computer Science ...
Understanding of machine learning techniques including predictive modeling, text and image mining ... video, audio, text and time series etc... Qualifications: * PhD degree in Computer Science ...
Understanding of machine learning techniques including predictive modeling, text and image mining ... video, audio, text and time series etc... Qualifications: * PhD degree in Computer Science ...
As an Embedded Machine Learning Engineer, you'll deploy efficient, low-power ML models directly ... Knowledge of computer vision, NLP, or audio processing in an embedded/robotics context. Experience ...
As an Embedded Machine Learning Engineer, you'll deploy efficient, low-power ML models directly ... Knowledge of computer vision, NLP, or audio processing in an embedded/robotics context. Experience ...
We are seeking individuals passionate in areas such as Natural Language Processing, Audio and Speech processing, Computer Vision, Machine Learning, Deep Learning, and Reinforcement Learning. Our ...
We are seeking individuals passionate in areas such as Natural Language Processing, Audio and Speech processing, Computer Vision, Machine Learning, Deep Learning, and Reinforcement Learning. Our ...
You'll design and build scalable ingestion pipelines that handle multi-modal content (PDF, audio, video, tables) using computer vision and machine learning, optimize system performance for millions ...
You'll design and build scalable ingestion pipelines that handle multi-modal content (PDF, audio, video, tables) using computer vision and machine learning, optimize system performance for millions ...
Drive the invention and development of novel AI Agent architectures and video/image/audio ... Build interface-oriented systems that use Machine Learning models, perform proof-of-concept ...
Drive the invention and development of novel AI Agent architectures and video/image/audio ... Build interface-oriented systems that use Machine Learning models, perform proof-of-concept ...
Drive the invention and development of novel AI Agent architectures and video/image/audio ... Build interface-oriented systems that use Machine Learning models, perform proof-of-concept ...
Drive the invention and development of novel AI Agent architectures and video/image/audio ... Build interface-oriented systems that use Machine Learning models, perform proof-of-concept ...
We are seeking individuals passionate in areas such as deep learning, computer vision, audio and speech processing, natural language processing, machine learning, reinforcement learning ...
We are seeking individuals passionate in areas such as deep learning, computer vision, audio and speech processing, natural language processing, machine learning, reinforcement learning ...
... audio and images to apply from variety of techniques in computer vision, deep learning, machine learning and image processing algorithms to build content risk inspection systems. You will be ...
... audio and images to apply from variety of techniques in computer vision, deep learning, machine learning and image processing algorithms to build content risk inspection systems. You will be ...
We are seeking individuals passionate in areas such as deep learning, computer vision, audio and speech processing, natural language processing, machine learning, reinforcement learning ...
We are seeking individuals passionate in areas such as deep learning, computer vision, audio and speech processing, natural language processing, machine learning, reinforcement learning ...
We are seeking individuals passionate in areas such as deep learning, computer vision, audio and speech processing, natural language processing, machine learning, reinforcement learning ...
We are seeking individuals passionate in areas such as deep learning, computer vision, audio and speech processing, natural language processing, machine learning, reinforcement learning ...
Bellevue, WA ยท On-site
$7K - $12K/mo
We are seeking individuals passionate in areas such as deep learning, computer vision, audio and speech processing, natural language processing, machine learning, reinforcement learning ...
Bellevue, WA ยท On-site
$7K - $12K/mo
We are seeking individuals passionate in areas such as deep learning, computer vision, audio and speech processing, natural language processing, machine learning, reinforcement learning ...
Develop a long-term approach to generation-focused data that improves image, video, and audio ... D. in Computer Science, Machine Learning, or a related field preferred About Adobe Adobe empowers ...
Develop a long-term approach to generation-focused data that improves image, video, and audio ... D. in Computer Science, Machine Learning, or a related field preferred About Adobe Adobe empowers ...
Bellevue, WA ยท On-site
$7K - $12K/mo
We are seeking individuals passionate in areas such as deep learning, computer vision, audio and speech processing, natural language processing, machine learning, reinforcement learning ...
Bellevue, WA ยท On-site
$7K - $12K/mo
We are seeking individuals passionate in areas such as deep learning, computer vision, audio and speech processing, natural language processing, machine learning, reinforcement learning ...
Experience with Machine Learning for audio and visual synthesis About Meta: Meta builds technologies that help people connect, find communities, and grow businesses. When Facebook launched in 2004 ...
Experience with Machine Learning for audio and visual synthesis About Meta: Meta builds technologies that help people connect, find communities, and grow businesses. When Facebook launched in 2004 ...
$33.6K - $48.3K
16% of jobs
$55K is the 25th percentile. Wages below this are outliers.
$48.3K - $63K
19% of jobs
$63K - $77.6K
13% of jobs
The median wage is $79.3K / yr.
$77.6K - $92.3K
14% of jobs
$92.3K - $107K
10% of jobs
$113.8K is the 75th percentile. Wages above this are outliers.
$107K - $121.7K
6% of jobs
$121.7K - $136.4K
4% of jobs
$136.4K - $151.1K
5% of jobs
$151.1K - $165.8K
4% of jobs
$165.8K - $180.5K
8% of jobs
$180.5K - $195.2K
0% of jobs
$33.6K
$96.1K
$195.2K
To thrive in Audio Machine Learning, you need a strong background in machine learning, digital signal processing, and proficiency with programming languages such as Python or MATLAB, typically supported by a relevant degree in computer science, electrical engineering, or a related field. Familiarity with frameworks like TensorFlow or PyTorch, experience with audio libraries (e.g., Librosa), and knowledge of cloud computing tools are highly valued, as are certifications in AI or data science. Strong problem-solving skills, creativity, and effective communication are essential soft skills for success in this field. These skills are crucial for developing innovative solutions, collaborating across multidisciplinary teams, and addressing complex audio data challenges in real-world projects.
Professionals in Audio Machine Learning typically spend their days designing, developing, and optimizing machine learning models tailored to audio data, such as speech or music recognition systems. You may also preprocess large datasets, extract and engineer relevant features, and collaborate closely with data scientists, audio engineers, and software developers to integrate your work into larger applications. Regular tasks often include running experiments, evaluating model performance, tuning hyperparameters, and keeping up with the latest advancements in the field. Team meetings, code reviews, and presenting findings to stakeholders are also common parts of the workweek.
An Audio Machine Learning job involves developing algorithms and models that analyze, process, and generate audio data. Responsibilities typically include working with speech recognition, music analysis, sound classification, and audio enhancement. Professionals in this field use deep learning, signal processing, and neural networks to improve audio-based applications like voice assistants, noise reduction systems, and music recommendation engines. They often work with datasets of speech, music, or environmental sounds to build models that understand and manipulate audio signals effectively.

Other
Posted 13 days ago
Nuance Labs is building the next generation of emotionally expressive, real-time AI.
This is a critical role to build the infrastructure that powers our AI platform. You will own the systems that serve models at scale, orchestrate complex data workflows, and ensure our real-time video AI runs reliably with low latency for users worldwide.
Own Inference Infrastructure: Build and maintain the serving stack for multimodal AI workloads. Optimize for latency, throughput, and cost using batching strategies, autoscaling, and intelligent resource allocation.
Real-Time Video Streaming: Architect systems to handle long-lived WebRTC connections with unpredictable client behavior, ensuring smooth video and audio delivery at scale.
Orchestrate Data Workflows: Build robust pipelines for offline processing, evaluation, and training using orchestration frameworks like Dagster or Ray. Manage petabyte-scale video storage and network requirements.
GPU Cluster Management: Configure and maintain GPU clusters using Kubernetes and Terraform. Implement monitoring, autoscaling based on custom metrics, and cost optimization strategies.
Developer Tooling: Build CI/CD, evaluation, and versioning systems that enable safe, zero-downtime model deployments and rapid iteration cycles.
Infrastructure Expertise: Strong practical experience with Kubernetes, Terraform, and cloud platforms. You can design secure, scalable systems and debug complex distributed issues.
Systems Programming: Proficiency in Python and experience with systems languages (Rust or Go). Comfortable profiling workloads and resolving compute, memory, or network bottlenecks.
Orchestration & Pipelines: Experience managing large-scale offline workflows using tools like Dagster, Ray, Airflow, or similar frameworks.
Production Operations: Deep understanding of production reliability, monitoring, incident response, and capacity planning for high-traffic services.
Experience with WebRTC or real-time media pipelines in production
Experience running GPU-backed inference services at scale (vLLM, Triton Inference Server, TensorRT)
Knowledge of performance optimization and low-level systems debugging
Familiarity with video/audio processing and storage systems