1

Multimodal Learning Jobs in Texas (NOW HIRING)

You'll architect and implement Vision-Language-Action (VLA) models, advance reinforcement learning applications, and push the boundaries of multimodal AI integration. This role combines deep ...

Design, train, and optimize machine learning models including LLMs, multimodal models, transformers, and diffusion architectures * Conduct research on model efficiency, quantization, compression, and ...

Apply adult learning principles, including Bloom's Taxonomy and multimodal learning strategies * Translate complex technical topics into clear, engaging, and easy-to-understand content * Validate all ...

Apply adult learning principles, including Bloom's Taxonomy and multimodal learning strategies * Translate complex technical topics into clear, engaging, and easy-to-understand content * Validate all ...

... and Multimodal Large Language Models (MLLMs). These models power both onboard and offboard ... Design, implement, and refine deep learning models to ensure efficiency, scalability, and ...

MultiModal AI Modeling - Strong track record fusing logs, time series, traces, tabular data, and ... MLOps & Continuous Learning - Fluency in automated retraining, drift detection, incremental updates ...

MultiModal AI Modeling - Strong track record fusing logs, time series, traces, tabular data, and ... MLOps & Continuous Learning - Fluency in automated retraining, drift detection, incremental updates ...

MultiModal AI Modeling - Strong track record fusing logs, time series, traces, tabular data, and ... MLOps & Continuous Learning - Fluency in automated retraining, drift detection, incremental updates ...

... and Multimodal Large Language Models (MLLMs). These models power both onboard and offboard ... About the Role We are looking for an experienced Machine Learning Engineer with a strong background ...

Senior / Staff Machine Learning Engineer

Austin, TX · On-site

$124K - $171K/yr

... and Multimodal Large Language Models (MLLMs). These models power both onboard and offboard ... About the Role We are hiring experienced Machine Learning Engineers across Senior, Staff, and ...

next page

Showing results 1-20

Multimodal Learning information

What is multimodal learning?

Multimodal learning is an area of machine learning that involves integrating and processing information from multiple types of data, such as text, images, audio, and video. The goal is to create models that can understand and make predictions based on more than one data modality, similar to how humans use various senses. This approach is used in applications like speech recognition with visual cues, image captioning, and video analysis. By combining different data types, multimodal learning systems can achieve better accuracy and more robust understanding.

What is the difference between Multimodal Learning vs Data Scientist?

AspectMultimodal LearningData Scientist
Required CredentialsAdvanced degrees in AI, Machine Learning, or Computer ScienceBachelor's or Master's in Data Science, Statistics, or related fields
Work EnvironmentResearch labs, AI development teams, academiaBusiness, tech companies, analytics teams
Industry UsageAI research, multimedia applications, roboticsData analysis, predictive modeling, business insights

Multimodal Learning focuses on developing AI models that process and integrate multiple data types like images, text, and audio. Data Scientists analyze data to extract insights, build models, and support decision-making. While both roles involve data and algorithms, Multimodal Learning is specialized in AI model development for complex data integration, whereas Data Scientists work broadly across data analysis and interpretation.

What are the key skills and qualifications needed to thrive as a Multimodal Learning Specialist, and why are they important?

To excel as a Multimodal Learning Specialist, you need a solid background in machine learning, data science, and computer vision, often supported by an advanced degree in a related field. Familiarity with deep learning frameworks like TensorFlow or PyTorch, experience integrating data from diverse sources (e.g., text, audio, images), and knowledge of relevant algorithms are crucial. Strong problem-solving abilities, creativity, and effective collaboration are standout soft skills for this role. These competencies are vital for developing innovative models that can process and interpret complex, multi-source data to drive impactful AI solutions.

What are some common challenges faced by professionals working in multimodal learning roles, and how can they be addressed?

Professionals in multimodal learning frequently encounter challenges related to integrating and aligning data from multiple sources, such as text, images, audio, or video. Ensuring data quality and consistency across modalities can be complex, and developing models that effectively combine heterogeneous information often requires advanced technical skills and innovative thinking. Collaboration with domain experts and other data scientists is key to overcoming these obstacles, as is staying up to date with the latest research and tools in machine learning. Regular team meetings and cross-disciplinary workshops can help foster a collaborative environment and promote knowledge sharing.
What cities in Texas are hiring for Multimodal Learning jobs? Cities in Texas with the most Multimodal Learning job openings:
Advanced Analytics Research Scientist - VLA

Advanced Analytics Research Scientist - VLA

Rockwell Automation, Inc.

Austin, TX • On-site

Full-time

Medical, Dental, Vision, Retirement, PTO

Posted 5 days ago


Rockwell Automation rating

7.9

Company rating: 7.9 out of 10

Based on 32 frontline employees who took The Breakroom Quiz

158th of 417 rated machine equipment manufacturers


Job description

Rockwell Automation is a global technology leader focused on helping the world's manufacturers be more productive, sustainable, and agile. With more than 28,000 employees who make the world better every day, we know we have something special. Behind our customers - amazing companies that help feed the world, provide life-saving medicine on a global scale, and focus on clean water and green mobility - our people are energized problem solvers that take pride in how the work we do changes the world for the better.
We welcome all makers, forward thinkers, and problem solvers who are looking for a place to do their best work. And if that's you we would love to have you join us!
Job Description
The AI Center of Excellence (COE) at Rockwell Automation is looking for an AI Scientist. This scientist will specialize in Vision-Language-Action (VLA) systems. The goal is to help develop the next generation of multimodal AI solutions for industrial environments. You will focus on translating recent advances in multimodal foundation models, embodied AI, and agentic systems into deployable technologies for real-world industrial applications.
You will work with a diverse team of AI researchers, control engineers, and software developers to design systems that integrate perception, reasoning, and action for complex operational settings.
This position offers the opportunity to apply state-of-the-art research in multimodal AI to large-scale industrial systems spanning manufacturing, robotics, and automation.
Responsibilities
  • Lead the development of vision-language-action architectures for industrial applications
  • Design and implement multimodal learning systems that integrate perception, language understanding, and decision making.
  • Adapt and fine-tune multimodal foundation models for domain-specific industrial tasks.
  • Develop Agentic AI frameworks that connect perception models with tools, control systems, or operational workflows.
  • Evaluate system robustness, safety, and reliability in real-world environments
  • Collaborate with cross-functional teams to translate research prototypes into deployable solutions
  • Contribute to the organization's technical leadership through publications, internal research reports, and technical mentoring

The Essentials - You Will Have:
  • Bachelor's Degree in Relevant Field

The Preferred - You Might Also Have:
  • Typically requires 8+ years relevant experience
  • M.Sc or Ph.D in Computer Science, Robotics, Machine Learning, or a related field
  • Familiarity with agent-based systems, tool-use frameworks, or LLM-based planners
  • Expertise in deep learning and multimodal machine learning
  • Hands-on experience with vision-language models, multimodal transformers, or embodied AI systems
  • Proficiency in Python and modern ML frameworks (e.g., PyTorch)
  • Experience designing and training large-scale transformer architectures
  • Analytical thinking and the ability to translate research ideas into working systems

What We Offer
  • Health Insurance including Medical, Dental and Vision
  • 401k
  • Paid Time off
  • Parental and Caregiver Leave
  • Flexible Work Schedule where you will work with your manager to enjoy a work schedule that can be flexible with your personal life.
  • To learn more about our benefits package, please visit at www.raquickfind.com.

This position is part of a job family. Experience will be the determining factor for position level and compensation
At Rockwell Automation we are dedicated to building a diverse, inclusive and authentic workplace, so if you're excited about this role but your experience doesn't align perfectly with every qualification in the job description, we encourage you to apply anyway. You may be just the right person for this or other roles.
#LI-PD1
#LI-hybrid
#lifeatROK
We are an Equal Opportunity Employer including disability and veterans.
If you are an individual with a disability and you need assistance or a reasonable accommodation during the application process, please contact our services team at +1 (844) 404-7247.
Rockwell Automation's hybrid policy aligns that employees are expected to work at a Rockwell location at least Mondays, Tuesdays, and Thursdays unless they have a business obligation out of the office.

What Rockwell Automation employees say

Pay

Benefits

Hours and flexibility

Workplace

Get the full story on Breakroom


Rockwell Automation logo

About Rockwell Automation

Sourced by ZipRecruiter

Rockwell Automation is a global technology leader focused on helping the world's manufacturers be more productive, sustainable, and agile. With more than 25,000 employees who make the world better every day, we know we have something special. Behind our customers - amazing companies that help feed the world, provide life-saving medicine on a global scale, and focus on clean water and green mobility - our people are energized problem solvers that take pride in how the work we do changes the world for the better.

Industry

Industrial automation equipment manufacturing

Company size

10,000+ Employees

Headquarters location

Milwaukee, WI, US

Year founded

1903

Social media