1

Multimodal Learning Jobs in California (NOW HIRING)

Research Engineer

San Francisco, CA · On-site

$175K - $275K/yr

Develop and refine training methodologies, including fine-tuning, reinforcement learning, and large-scale multimodal learning * Design and generate training and evaluation datasets from simulation ...

Research Engineer

San Francisco, CA · On-site

$175K - $275K/yr

Develop and refine training methodologies, including fine-tuning, reinforcement learning, and large-scale multimodal learning * Design and generate training and evaluation datasets from simulation ...

... multimodal learning. * Experience developing and implementing deep learning models and algorithms using modern software libraries such as PyTorch, TensorFlow, or similar, as evidenced by publications ...

... multimodal learning. * Experience developing and implementing deep learning models and algorithms using modern software libraries such as PyTorch, TensorFlow, or similar, as evidenced by publications ...

next page

Showing results 1-20

Multimodal Learning information

What is multimodal learning?

Multimodal learning is an area of machine learning that involves integrating and processing information from multiple types of data, such as text, images, audio, and video. The goal is to create models that can understand and make predictions based on more than one data modality, similar to how humans use various senses. This approach is used in applications like speech recognition with visual cues, image captioning, and video analysis. By combining different data types, multimodal learning systems can achieve better accuracy and more robust understanding.

What is the difference between Multimodal Learning vs Data Scientist?

AspectMultimodal LearningData Scientist
Required CredentialsAdvanced degrees in AI, Machine Learning, or Computer ScienceBachelor's or Master's in Data Science, Statistics, or related fields
Work EnvironmentResearch labs, AI development teams, academiaBusiness, tech companies, analytics teams
Industry UsageAI research, multimedia applications, roboticsData analysis, predictive modeling, business insights

Multimodal Learning focuses on developing AI models that process and integrate multiple data types like images, text, and audio. Data Scientists analyze data to extract insights, build models, and support decision-making. While both roles involve data and algorithms, Multimodal Learning is specialized in AI model development for complex data integration, whereas Data Scientists work broadly across data analysis and interpretation.

What are the key skills and qualifications needed to thrive as a Multimodal Learning Specialist, and why are they important?

To excel as a Multimodal Learning Specialist, you need a solid background in machine learning, data science, and computer vision, often supported by an advanced degree in a related field. Familiarity with deep learning frameworks like TensorFlow or PyTorch, experience integrating data from diverse sources (e.g., text, audio, images), and knowledge of relevant algorithms are crucial. Strong problem-solving abilities, creativity, and effective collaboration are standout soft skills for this role. These competencies are vital for developing innovative models that can process and interpret complex, multi-source data to drive impactful AI solutions.

What are some common challenges faced by professionals working in multimodal learning roles, and how can they be addressed?

Professionals in multimodal learning frequently encounter challenges related to integrating and aligning data from multiple sources, such as text, images, audio, or video. Ensuring data quality and consistency across modalities can be complex, and developing models that effectively combine heterogeneous information often requires advanced technical skills and innovative thinking. Collaboration with domain experts and other data scientists is key to overcoming these obstacles, as is staying up to date with the latest research and tools in machine learning. Regular team meetings and cross-disciplinary workshops can help foster a collaborative environment and promote knowledge sharing.
What cities in California are hiring for Multimodal Learning jobs? Cities in California with the most Multimodal Learning job openings:
Infographic showing various Multimodal Learning job openings in California as of June 2026, with employment types broken down into 7% Locum Tenens, 35% As Needed, 26% Full Time, 1% Part Time, 30% Temporary, and 1% Contract. Highlights an 85% Physical, 6% Hybrid, and 9% Remote job distribution.
Applied Machine Learning Research Engineer - Multimodal for Human Understanding

Applied Machine Learning Research Engineer - Multimodal for Human Understanding

Apple

Sunnyvale, CA • On-site

$147K - $272K/yr

Other

Medical, Dental, Retirement

Posted 25 days ago


Apple rating

8.1

Company rating: 8.1 out of 10

Based on 661 frontline employees who took The Breakroom Quiz

6th of 30 rated technology retailers


Job description

Applied Machine Learning Research Engineer - Multimodal for Human Understanding

We're starting to see the incredible potential of multimodal foundation and large language models, and many applications in the computer vision and machine learning domain that previously appeared infeasible are now within reach. We are looking for a highly motivated and skilled Applied Machine Learning Research Engineer to join our team in the Video Computer Vision group and help us push the boundaries of human understanding. The Video Computer Vision org has pioneered human-centric real-time features such as FaceID, FaceKit, and Gaze and Hand gesture control which have changed the way millions of users interact with their devices. We balance research and product requirements to deliver Apple quality, pioneering experiences, innovating through the full stack, and partnering with HW, SW and AI teams to shape Apple's products and bring our vision to life.

In this role, you will drive ground breaking development at the intersection of AI, generative modeling, and computer vision. You will work across the full lifecycle—from foundational investigation to practical applications—designing, implementing, and evaluating novel algorithms and models. Your primary focus will be human understanding, including human motion, activities, and representation learning. A major aspect of the role involves designing, implementing, evaluating and productizing ML systems capable of human and activity understanding. This position offers a unique opportunity to innovate, build, and ship: you will take your conceptual ideas to products that reach millions of users worldwide. You will collaborate with a diverse group of experts—research scientists, ML engineers, software engineers, data scientists, human-interface designers, and domain specialists—working in an environment that values experimentation, ownership, and continuous learning. By staying at the forefront of advancements in AI, machine learning, and computer vision, you will play a direct role in driving innovation, influencing the evolution of Apple products, and meaningfully enhancing user experience on a global scale.

Minimum Qualifications
  • Strong experience developing machine learning models.
  • Proficiency in Python and solid software engineering fundamentals.
  • Experience with at least one deep learning framework (e.g., PyTorch, JAX, or equivalent).
  • Master's degree in Computer Science or a related field, plus 3 years of relevant industry experience.
Preferred Qualifications
  • Hands-on experience training and deploying production-grade ML models.
  • Experience developing multimodal LLMs or generative models.
  • Production-level experience with a compiled language (e.g., Swift, C++).
  • Expertise in one or more areas: computer vision, machine learning, multimodal LLMs, Reinforcement Learning, Agentic AI.
  • PhD in Computer Science, Electrical Engineering, or a related field with a focus on computer vision, machine learning, or multimodal systems.
  • Demonstrated problem-solving ability, strong sense of ownership and product shipment.

At Apple, base pay is one part of our total compensation package and is determined within a range. This provides the opportunity to progress as you grow and develop within a role. The base pay range for this role is between $147,400 and $272,100, and your base pay will depend on your skills, qualifications, experience, and location. Apple employees also have the opportunity to become an Apple shareholder through participation in Apple's discretionary employee stock programs. Apple employees are eligible for discretionary restricted stock unit awards, and can purchase Apple stock at a discount if voluntarily participating in Apple's Employee Stock Purchase Plan. You'll also receive benefits including: Comprehensive medical and dental coverage, retirement benefits, a range of discounted products and free services, and for formal education related to advancing your career at Apple, reimbursement for certain educational expenses — including tuition. Additionally, this role might be eligible for discretionary bonuses or commission payments as well as relocation. Learn more about Apple Benefits Note: Apple benefit, compensation and employee stock programs are subject to eligibility requirements and other terms of the applicable plan or program. Apple is an equal opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant At Apple, we believe accessibility is a fundamental human right. You'll find that idea reflected in everything here — in our culture, our benefits and our digital tools. By welcoming as many perspectives as possible, we help you build a career where you feel like you belong. Learn about accessibility in Apple's workplace Learn about reasonable accommodations for job applicants Apple accepts applications to this posting on an ongoing basis.


What Apple employees say

Pay

Benefits

Hours and flexibility

Workplace

Get the full story on Breakroom


Apple logo

About Apple

Sourced by ZipRecruiter

Imagine what you could do here! At Apple, new ideas have a way of becoming extraordinary products, services, and customer experiences very quickly. Bring passion and dedication to your job and there's no telling what you could accomplish. Dynamic, intelligent people and inspiring, innovative technologies are the norm here. The people who work here have reinvented entire industries with all Apple Hardware products. The same real passion for innovation that goes into our products also applies to our practices strengthening our dedication to leave the world better than we found it.

Industry

Computer and electronic product manufacturing

Company size

10,000+ Employees

Headquarters location

Cupertino, CA, US

Year founded

1976