Multimodal Learning Jobs (NOW HIRING)

Senior Machine Learning Engineer, Data Mining

$118K - $156K/yr

Omnitag, our ML-powered multimodal data mining framework, is the engine that powers this discovery. As a Senior Machine Learning Engineer on the Data Mining team, your mission is to build the "Brain ...

Motional

Senior Machine Learning Engineer, Data Mining

Pittsburgh, PA · On-site

$118K - $156K/yr

Motional

Senior Machine Learning Engineer, Data Mining

San Francisco, CA · On-site +1

$144K - $190K/yr

Quick apply

Motional

Senior Machine Learning Engineer, Data Mining

San Francisco, CA · On-site +1

$144K - $190K/yr

Motional

Senior Machine Learning Engineer, Data Mining

Las Vegas, NV · On-site +1

$117K - $154K/yr

Quick apply

Motional

Senior Machine Learning Engineer, Data Mining

Las Vegas, NV · On-site +1

$117K - $154K/yr

Motional

Senior Machine Learning Engineer, Data Mining

Boston, MA · On-site +1

$133K - $175K/yr

Quick apply

Motional

Senior Machine Learning Engineer, Data Mining

Boston, MA · On-site +1

$133K - $175K/yr

Software Engineering Institute

Senior Machine Learning Research Scientist - Frontier Lab

Pittsburgh, PA · On-site

$95K - $121K/yr

Mission modalities and multimodal learning, including sensor fusion and learning under noisy, sparse, or constrained data conditions (including synthetic data and weakly-/self-supervised approaches)

Software Engineering Institute

Senior Machine Learning Research Scientist - Frontier Lab

Pittsburgh, PA · On-site

$95K - $121K/yr

Vantor

Applied AI Scientist

Herndon, VA · On-site

Vision-language models (VLMs), Multimodal learning, Reasoning models, Large language models (LLMs), Computer vision or geospatial AI. • Strong programming skills in Python, with experience using ...

Vantor

Applied AI Scientist

Herndon, VA · On-site

Motional

Senior Machine Learning Engineer, Data Mining

Las Vegas, NV · On-site +1

$117K - $154K/yr

Motional

Senior Machine Learning Engineer, Data Mining

Las Vegas, NV · On-site +1

$117K - $154K/yr

Software Engineering Institute

Senior Machine Learning Research Scientist - Frontier Lab with Security Clearance

Pittsburgh, PA · On-site

$88K - $121K/yr

Software Engineering Institute

Senior Machine Learning Research Scientist - Frontier Lab with Security Clearance

Pittsburgh, PA · On-site

$88K - $121K/yr

Cisco

Senior AI Researcher

San Francisco, CA · On-site

Our research spans foundation models, agentic AI, multimodal learning, reasoning systems, scalable training algorithms, evaluation science, inference optimization, and AI systems infrastructure. We ...

Cisco

Senior AI Researcher

San Francisco, CA · On-site

Apple

Staff Machine Learning Engineer - Ads Signals Intelligence & Information Retrieval

Cupertino, CA · On-site

You'll work on problems at the cutting edge of retrieval, multimodal learning, LLMs, and content intelligence-while contributing to Apple's mission to deliver high-performing, privacy-first ...

Apple

Staff Machine Learning Engineer - Ads Signals Intelligence & Information Retrieval

Cupertino, CA · On-site

You'll work on problems at the cutting edge of retrieval, multimodal learning, LLMs, and content intelligence-while contributing to Apple's mission to deliver high-performing, privacy-first ...

Predactiv

Research Scientist, Gen AI & User Representation Learning

Palo Alto, CA · On-site +1

$100K - $130K/yr

You will conduct applied research that advances representation learning, multimodal understanding, and transformer-based modeling while working closely with engineering teams to translate research ...

Predactiv

Research Scientist, Gen AI & User Representation Learning

Palo Alto, CA · On-site +1

$100K - $130K/yr

TikTok

Research Scientist Intern - TikTok Search / Generative AI (LLM) - Global Frontier Tech Recruitment P

San Jose, CA · On-site

$60/hr

LLMs, NLP, search/recommendation, or multimodal learning. - Proficiency in Python and experience with ML frameworks (e.g., PyTorch, TensorFlow). - Strong problem-solving skills and willingness to ...

TikTok

Research Scientist Intern - TikTok Search / Generative AI (LLM) - Global Frontier Tech Recruitment P

San Jose, CA · On-site

$60/hr

Google

Research Software Engineer, Multimodal AI

San Jose, CA · On-site

Experience with multimodal learning, large language models or AI agents. * Experience with prompt engineering, few-shot learning, post-training techniques, and evaluations. * Familiarity with large ...

Google

Research Software Engineer, Multimodal AI

San Jose, CA · On-site

Rocky Mountain College of Art + Design

Learning Experience Designer

Lakewood, CO · On-site

$65K - $70K/yr

Design multimodal learning experiences including online, hybrid, classroom, simulations, and case-based formats. * Stay informed on, provide input for, and integrate emerging learning modalities and ...

Rocky Mountain College of Art + Design

Learning Experience Designer

Lakewood, CO · On-site

$65K - $70K/yr

Apple

AIML - Machine Learning Researcher - Multimodal Agent

Santa Clara, CA · On-site

We are looking for people with excellent applied machine learning, computer vision, multimodal LLM, and agent training experience and solid engineering skills. This role will have the following ...

Apple

AIML - Machine Learning Researcher - Multimodal Agent

Santa Clara, CA · On-site

Motional

Machine Learning Engineer, Data Mining

Pittsburgh, PA · On-site +1

$144K - $192K/yr

Omnitag, our ML-powered multimodal data mining framework, is the engine that powers this discovery. As a Machine Learning Engineer on the Data Mining team, your mission is to help build the "Brain ...

Quick apply

Motional

Machine Learning Engineer, Data Mining

Pittsburgh, PA · On-site +1

$144K - $192K/yr

Predactiv

Research Scientist, Gen AI & User Representation Learning

Palo Alto, CA · Remote

$100K - $130K/yr

Quick apply

Predactiv

Research Scientist, Gen AI & User Representation Learning

Palo Alto, CA · Remote

$100K - $130K/yr

Apple

Senior Applied ML Researcher - Video Apps

Cupertino, CA · On-site

You will work on challenging problems at the intersection of computer vision, audio signal processing, and multimodal learning, enabling intelligent systems that can see, hear, and reason about the ...

Apple

Senior Applied ML Researcher - Video Apps

Cupertino, CA · On-site

Motional

Senior Machine Learning Engineer, Data Mining

$125K - $165K/yr

Motional

Senior Machine Learning Engineer, Data Mining

$125K - $165K/yr

Motional

Senior Machine Learning Engineer, Data Mining

Las Vegas, NV · On-site

$117K - $154K/yr

Motional

Senior Machine Learning Engineer, Data Mining

Las Vegas, NV · On-site

$117K - $154K/yr

Showing results 1-20

Multimodal Learning Jobs

Multimodal Learning information

See salary details

$21K

$61.7K

$114.5K

How much do multimodal learning jobs pay per year?

As of Jul 22, 2026, the average yearly pay for multimodal learning in the United States is $61,692.00, according to ZipRecruiter salary data. Most workers in this role earn between $41,000.00 and $72,000.00 per year, depending on experience, location, and employer.

What is multimodal learning?

Multimodal learning is an area of machine learning that involves integrating and processing information from multiple types of data, such as text, images, audio, and video. The goal is to create models that can understand and make predictions based on more than one data modality, similar to how humans use various senses. This approach is used in applications like speech recognition with visual cues, image captioning, and video analysis. By combining different data types, multimodal learning systems can achieve better accuracy and more robust understanding.

What is the difference between Multimodal Learning vs Data Scientist?

Aspect	Multimodal Learning	Data Scientist
Required Credentials	Advanced degrees in AI, Machine Learning, or Computer Science	Bachelor's or Master's in Data Science, Statistics, or related fields
Work Environment	Research labs, AI development teams, academia	Business, tech companies, analytics teams
Industry Usage	AI research, multimedia applications, robotics	Data analysis, predictive modeling, business insights

Multimodal Learning focuses on developing AI models that process and integrate multiple data types like images, text, and audio. Data Scientists analyze data to extract insights, build models, and support decision-making. While both roles involve data and algorithms, Multimodal Learning is specialized in AI model development for complex data integration, whereas Data Scientists work broadly across data analysis and interpretation.

What are the key skills and qualifications needed to thrive as a Multimodal Learning Specialist, and why are they important?

To excel as a Multimodal Learning Specialist, you need a solid background in machine learning, data science, and computer vision, often supported by an advanced degree in a related field. Familiarity with deep learning frameworks like TensorFlow or PyTorch, experience integrating data from diverse sources (e.g., text, audio, images), and knowledge of relevant algorithms are crucial. Strong problem-solving abilities, creativity, and effective collaboration are standout soft skills for this role. These competencies are vital for developing innovative models that can process and interpret complex, multi-source data to drive impactful AI solutions.

What are some common challenges faced by professionals working in multimodal learning roles, and how can they be addressed?

Professionals in multimodal learning frequently encounter challenges related to integrating and aligning data from multiple sources, such as text, images, audio, or video. Ensuring data quality and consistency across modalities can be complex, and developing models that effectively combine heterogeneous information often requires advanced technical skills and innovative thinking. Collaboration with domain experts and other data scientists is key to overcoming these obstacles, as is staying up to date with the latest research and tools in machine learning. Regular team meetings and cross-disciplinary workshops can help foster a collaborative environment and promote knowledge sharing.

More about Multimodal Learning jobs

The 10 Top Types Of Multimodal Learning Jobs

What cities are hiring for Multimodal Learning jobs? Cities with the most Multimodal Learning job openings:

What states have the most Multimodal Learning jobs? States with the most job openings for Multimodal Learning jobs include:

What job categories do people searching Multimodal Learning jobs look for? The top searched job categories for Multimodal Learning jobs are:

Multimodal Learning jobs near you

Infographic showing various Multimodal Learning job openings in the United States as of July 2026, with employment types broken down into 1% As Needed, 72% Full Time, 25% Part Time, 1% Temporary, and 1% Contract. Highlights an 86% Physical, 2% Hybrid, and 12% Remote job distribution, with an average salary of $61,692 per year, or $29.7 per hour.

Senior Machine Learning Engineer, Data Mining

Motional

Pittsburgh, PA • On-site

Apply

$118K - $156K/yr

Full-time

Medical, Dental, Vision, Life, Retirement

Posted 11 days ago

Job description

Mission Summary:
At Motional, we're transforming how autonomous vehicles discover critical intelligence hidden within petabytes of multimodal sensor data. Our next-generation autonomous driving stack depends on finding the rare edge cases, long-tail scenarios, and model errors that matter most. Omnitag, our ML-powered multimodal data mining framework, is the engine that powers this discovery.
As a Senior Machine Learning Engineer on the Data Mining team, your mission is to build the "Brain" of this engine: designing massive multimodal Teacher models that understand the world, and distilling them into hyper-efficient Student models that can scour exabytes of data in near real-time. You will work at the intersection of large-scale representation learning, retrieval optimization, and reasoning systems. Your work will directly influence how we compress knowledge into efficient encoders for fast search, and how we apply reinforcement learning to optimize data discovery workflows and intelligent querying. By building smarter mining tools, you will accelerate the entire model improvement lifecycle for teams working on post-training analysis, error diagnosis, and dataset curation.
What You'll Do:

Architect and Train Distilled Models: Design and implement teacher-student model frameworks for multimodal sensor data. Develop training pipelines for knowledge distillation. Ensure student models maintain high accuracy while drastically reducing inference latency and memory footprint.
Reinforcement Learning for Data Discover: Build RL-based policy learning and reasoning systems for autonomous driving applications. Implement and scale RL training workflows (e.g., PPO, DQN, actor-critic methods) for simulation and real-world interaction. Explore reward shaping, environment modeling, and multi-agent RL where applicable.
Optimize Model Deployment for Real-Time Inference: Collaborate with backend engineers to deploy distilled and RL models into production. Optimize for latency, throughput, and hardware efficiency across GPU/CPU clusters. Implement model versioning, A/B testing, and monitoring for performance regressions.
Research and Integrate Agentic Systems: Explore and prototype agentic workflows for autonomous reasoning, chain-of-thought prompting, and goal-directed behavior. Integrate such systems into our broader autonomy stack as experimental or production components.
Drive Production Reliability: Establish patterns for graceful degradation, fault tolerance, and cost optimization. Operate Omnitag as a mission-critical data platform serving the entire ML organization, with a focus on reliability, debuggability, and operational excellence.
Mentor and Collaborate: Work closely with ML scientists, data engineers, and autonomy teams to translate research advances into scalable engineering solutions. Guide junior engineers in best practices for model training, evaluation, and deployment.

What We're Looking For:

BS in Computer Science, Machine Learning, or related field, or equivalent professional experience.
6+ years of hands-on experience in machine learning engineering, with a focus on model post training, optimization, and deployment.
Strong experience with model distillation or teacher-student training - practical knowledge of loss functions, training strategies, and evaluation of compressed models.
Proven experience with reinforcement learning in production or research settings: policy optimization, reward design, simulation environments, and RL-based reasoning.
Expert-level proficiency in Python and ML frameworks (PyTorch, TensorFlow, or JAX).
Strong software engineering fundamentals: testing, CI/CD, containerization, and system design.
Experience deploying ML models in cloud environments (AWS, GCP, or Azure) and optimizing for inference.
Demonstrated ability to ship production-grade ML systems and mentor team members.
Demonstrated track record of shipping robust, well-tested, production-grade systems and mentoring junior engineers

Bonus Points (Nice-to-Haves):

MS/PhD in Computer Science, Machine Learning, or related field.
Experience with agentic systems, autonomous reasoning, chain-of-thought models, or LLM-based planning.
Background in autonomous driving, robotics, or real-time decision-making systems.
Familiarity with multimodal learning, sensor fusion, or embodied AI.
Experience building active learning loops, using the model to find the data that breaks the model.
Experience with ML-based data mining, active learning, or contrastive learning.
Knowledge of model serving tools (TF Serving, Triton, TorchServe) and MLOps platforms.
Publications or open-source contributions in RL, distillation, or efficient ML.

We encourage a hybrid schedule with in-office time at one of our locations in Boston, Pittsburgh, or Las Vegas to support collaboration, or this role can be fully remote.
The salary range for this role is an estimate based on a wide range of compensation factors including but not limited to specific skills, experience and expertise, role location, certifications, licenses, and business needs. The estimated compensation range listed in this job posting reflects base salary only. This role may include additional forms of compensation such as a bonus or company equity. The recruiter assigned to this role can share more information about the specific compensation and benefit details associated with this role during the hiring process.
Candidates for certain positions are eligible to participate in Motional's benefits program. Motional's benefits include but are not limited to medical, dental, vision, 401k with a company match, health saving accounts, life insurance, pet insurance, and more.
Salary Range
$172,000-$229,000 USD
Motional is a driverless technology company making autonomous vehicles a safe, reliable, and accessible reality. We're driven by something more.
Our journey is always people first.
We aren't just developing driverless cars; we're creating safer roadways, more equitable transportation options, and making our communities better places to live, work, and connect. Our team is made up of engineers, researchers, innovators, dreamers and doers, who are creating a technology with the potential to transform the way we move.
Higher purpose, greater impact.
We're creating first-of-its-kind technology that will transform transportation. To do so successfully, we must design for everyone in our cities and on our roads. We believe in building a great place to work through a progressive, global culture that is diverse, inclusive, and ensures people feel valued at every level of the organization. Diversity helps us to see the world differently; it's not only good for our business, it's the right thing to do.
Scale up, not starting up.
Our team is behind some of the industry's largest leaps forward, including the first fully-autonomous cross-country drive in the U.S, the launch of the world's first robotaxi pilot, and operation of the world's longest-standing public robotaxi fleet. We're driven to scale; we're moving towards commercialization of our technology, and we need team members who are ready to embrace change and challenges.
Formed as a joint venture between Hyundai Motor Group and Aptiv, Motional is fundamentally changing how people move through their lives. Headquartered in Boston, Motional has operations in the U.S and Asia. For more information, visit www.Motional.com and follow us on Twitter, LinkedIn, Instagram and YouTube.
Motional AD Inc. is an EOE. We celebrate diversity and are committed to creating an inclusive environment for all employees. To comply with Federal Law, we participate in E-Verify. All newly-hired employees are queried through this electronic system established by the DHS and the SSA to verify their identity and employment eligibility.

About Motional

Sourced by ZipRecruiter

Industry

Motor vehicle manufacturing

Company size

501 - 1,000 Employees

Headquarters location

Boston, MA, US

Year founded

2020

Website

motional.com

Social media

View All Motional Jobs

Apply

Multimodal Learning Jobs (NOW HIRING)

Senior Machine Learning Engineer, Data Mining

Senior Machine Learning Engineer, Data Mining

Senior Machine Learning Engineer, Data Mining

Senior Machine Learning Engineer, Data Mining

Senior Machine Learning Engineer, Data Mining

Senior Machine Learning Engineer, Data Mining

Senior Machine Learning Engineer, Data Mining

Senior Machine Learning Engineer, Data Mining

Senior Machine Learning Research Scientist - Frontier Lab

Senior Machine Learning Research Scientist - Frontier Lab

Applied AI Scientist

Applied AI Scientist

Senior Machine Learning Engineer, Data Mining

Senior Machine Learning Engineer, Data Mining

Senior Machine Learning Research Scientist - Frontier Lab with Security Clearance

Senior Machine Learning Research Scientist - Frontier Lab with Security Clearance

Senior AI Researcher

Senior AI Researcher

Staff Machine Learning Engineer - Ads Signals Intelligence & Information Retrieval

Staff Machine Learning Engineer - Ads Signals Intelligence & Information Retrieval

Research Scientist, Gen AI & User Representation Learning

Research Scientist, Gen AI & User Representation Learning

Research Scientist Intern - TikTok Search / Generative AI (LLM) - Global Frontier Tech Recruitment P

Research Scientist Intern - TikTok Search / Generative AI (LLM) - Global Frontier Tech Recruitment P

Research Software Engineer, Multimodal AI

Research Software Engineer, Multimodal AI

Learning Experience Designer

Learning Experience Designer

AIML - Machine Learning Researcher - Multimodal Agent

AIML - Machine Learning Researcher - Multimodal Agent

Machine Learning Engineer, Data Mining

Machine Learning Engineer, Data Mining

Research Scientist, Gen AI & User Representation Learning

Research Scientist, Gen AI & User Representation Learning

Senior Applied ML Researcher - Video Apps

Senior Applied ML Researcher - Video Apps

Senior Machine Learning Engineer, Data Mining

Senior Machine Learning Engineer, Data Mining

Senior Machine Learning Engineer, Data Mining

Senior Machine Learning Engineer, Data Mining

Multimodal Learning information

See salary details

How much do multimodal learning jobs pay per year?

What is multimodal learning?

What is the difference between Multimodal Learning vs Data Scientist?

What are the key skills and qualifications needed to thrive as a Multimodal Learning Specialist, and why are they important?

What are some common challenges faced by professionals working in multimodal learning roles, and how can they be addressed?

Senior Machine Learning Engineer, Data Mining

Share this job

Job description

About Motional

Industry

Company size

Headquarters location

Year founded

Website

Social media

Share this job