1

Multimodal Learning Jobs in Florida (NOW HIRING)

... of modern, multimodal roadway systems and ensure safe, efficient, and reliable travel for all ... Exposure to NDT methods or interest in learning them * FAA Part 107 drone certification

... of modern, multimodal roadway systems and ensure safe, efficient, and reliable travel for all ... Exposure to NDT methods or interest in learning them * FAA Part 107 drone certification

... of modern, multimodal roadway systems and ensure safe, efficient, and reliable travel for all ... learning among team members. What you will bring to our firm: * NBIS Bridge Inspection Team Leader ...

Develop and optimize AI inference systems for large language models and multimodal AI * Build ... Experience with machine learning frameworks such as PyTorch, TensorFlow, or JAX * Familiarity with ...

... of modern, multimodal roadway systems and ensure safe, efficient, and reliable travel for all ... learning among team members. What you will bring to our firm: * NBIS Bridge Inspection Team Leader ...

... of modern, multimodal roadway systems and ensure safe, efficient, and reliable travel for all ... learning among team members. What you will bring to our firm: * NBIS Bridge Inspection Team Leader ...

Join a team committed to continuous learning and professional development with opportunities for ... Utilizes a multimodal approach to include but not limited to medication, physical therapy and ...

Applied AI Engineer

Miami, FL · On-site

$133K/yr

... Learning, and Data Science. We stand out as an agency that's deeply embedded in our clients ... Background in document understanding, embeddings, or multimodal context assembly . * Familiarity ...

Join CEVA Logistics, and you will be part of a team that values imagination and continued learning ... Experience with air-freight-led solutions and integrated multimodal logistics * Background in ...

next page

Showing results 1-20

Multimodal Learning information

What is multimodal learning?

Multimodal learning is an area of machine learning that involves integrating and processing information from multiple types of data, such as text, images, audio, and video. The goal is to create models that can understand and make predictions based on more than one data modality, similar to how humans use various senses. This approach is used in applications like speech recognition with visual cues, image captioning, and video analysis. By combining different data types, multimodal learning systems can achieve better accuracy and more robust understanding.

What is the difference between Multimodal Learning vs Data Scientist?

AspectMultimodal LearningData Scientist
Required CredentialsAdvanced degrees in AI, Machine Learning, or Computer ScienceBachelor's or Master's in Data Science, Statistics, or related fields
Work EnvironmentResearch labs, AI development teams, academiaBusiness, tech companies, analytics teams
Industry UsageAI research, multimedia applications, roboticsData analysis, predictive modeling, business insights

Multimodal Learning focuses on developing AI models that process and integrate multiple data types like images, text, and audio. Data Scientists analyze data to extract insights, build models, and support decision-making. While both roles involve data and algorithms, Multimodal Learning is specialized in AI model development for complex data integration, whereas Data Scientists work broadly across data analysis and interpretation.

What are the key skills and qualifications needed to thrive as a Multimodal Learning Specialist, and why are they important?

To excel as a Multimodal Learning Specialist, you need a solid background in machine learning, data science, and computer vision, often supported by an advanced degree in a related field. Familiarity with deep learning frameworks like TensorFlow or PyTorch, experience integrating data from diverse sources (e.g., text, audio, images), and knowledge of relevant algorithms are crucial. Strong problem-solving abilities, creativity, and effective collaboration are standout soft skills for this role. These competencies are vital for developing innovative models that can process and interpret complex, multi-source data to drive impactful AI solutions.

What are some common challenges faced by professionals working in multimodal learning roles, and how can they be addressed?

Professionals in multimodal learning frequently encounter challenges related to integrating and aligning data from multiple sources, such as text, images, audio, or video. Ensuring data quality and consistency across modalities can be complex, and developing models that effectively combine heterogeneous information often requires advanced technical skills and innovative thinking. Collaboration with domain experts and other data scientists is key to overcoming these obstacles, as is staying up to date with the latest research and tools in machine learning. Regular team meetings and cross-disciplinary workshops can help foster a collaborative environment and promote knowledge sharing.
What cities in Florida are hiring for Multimodal Learning jobs? Cities in Florida with the most Multimodal Learning job openings:
Research Engineer -- Post-Training & Small Language Models (SLMs), Healthcare AI

Research Engineer -- Post-Training & Small Language Models (SLMs), Healthcare AI

Deloitte

Tampa, FL • On-site

Full-time

Posted 4 days ago


Deloitte rating

8.1

Company rating: 8.1 out of 10

Based on 86 frontline employees who took The Breakroom Quiz

58th of 138 rated financial services


Job description

Job Summary:
Deloitte is leading an AI-first initiative aimed at transforming the healthcare decision-making process through advanced modeling and reasoning systems. As a Research Engineer, you will design, train, and evaluate models that enhance clinical and operational decision-making, focusing on post-training methodologies and ensuring model behavior aligns with healthcare standards.
Responsibilities:
• Design and execute post-training pipelines: supervised fine-tuning (SFT), preference optimization, and reinforcement learning / alignment workflows.
• Build and optimize training using techniques such as SFT, RLHF, PPO, DPO, GRPO, RLAIF, and Constitutional AI, and understand how each affects reasoning quality, safety, latency, cost, and reliability.
• Train reasoning models for healthcare decisioning using verifiable-reward RL - designing reward signals and verifiers grounded in clinical guidelines, policy and criteria, and adjudicated outcomes.
• Develop reward models and preference datasets to improve reasoning quality, factuality, safety, policy adherence, and task performance.
• Curate, clean, synthesize, and evaluate large-scale instruction, preference, and domain-specific datasets, with rigorous filtering, deduplication, and quality control.
• Build verification and reward pipelines from our proprietary clinical, claims, and operational data and from clinical-expert labeling - turning guidelines, policy, and adjudicated outcomes into checkable reward signals at scale.
• Implement efficient fine-tuning strategies including LoRA, QLoRA, PEFT, and adapter-based approaches; build scalable distributed training using DeepSpeed, FSDP, Megatron-LM, Ray, or equivalent.
• Optimize inference performance - latency, throughput, quantization, and deployment efficiency - for production, including frameworks such as vLLM, TensorRT-LLM, or TGI.
• Train and optimize open-weight models such as Llama, Qwen, Mistral, or DeepSeek; build specialized small language models (SLMs) for on-premise and cloud-hybrid deployment with strong performance-per-dollar.
• Design evaluation frameworks covering reasoning, hallucination detection, factuality, instruction following, structured outputs, and domain-specific metrics.
• Build healthcare-grade evaluation - held-out clinical benchmarks, deployment regression gates, calibration and uncertainty, factuality against ground truth, and bias/fairness evaluation across patient populations and subgroups - co-designed with clinical experts.
• Apply PHI/HIPAA-aware data handling and produce model documentation suitable for regulated clinical use.
• Perform red teaming and adversarial testing to identify alignment failures, unsafe behaviors, jailbreak vulnerabilities, and regression risks; collaborate with agentic and application teams to improve tool use, grounding, and long-horizon reasoning.
Qualifications:
Required:
• Bachelor's degree in Computer Science, Machine Learning, Artificial Intelligence, Applied Mathematics, Computational Linguistics, or a related field.
• Demonstrated depth training and post-training large transformer-based language models in production or research - this is your craft, not coursework or a one-off fine-tune. Genuine depth including SFT and at least one preference-optimization or RL method, evidenced by shipped models, releases, or research.
• Hands-on experience with reasoning-model training and/or verifiable-reward (RLVR) workflows.
• Strong understanding of modern post-training techniques: SFT, RLHF, PPO, DPO, GRPO, RLAIF, and preference optimization workflows.
• Experience with open-weight foundation models such as Llama, Qwen, Mistral, DeepSeek, or equivalent architectures.
• Strong expertise in PyTorch and modern deep-learning tooling; experience with distributed training frameworks such as DeepSpeed, FSDP, Megatron-LM, or Ray.
• Experience implementing efficient fine-tuning techniques such as LoRA, QLoRA, PEFT, and quantization-aware workflows.
• Deep understanding of transformer architectures, tokenization, attention mechanisms, decoding strategies, and model scaling trade-offs.
• Strong grasp of LLM evaluation methodologies, benchmarking, reward modeling, and alignment trade-offs; experience with large-scale and synthetic datasets, filtering, deduplication, and quality-control pipelines.
• Strong Python engineering skills and production-grade software practices; ability to work through ambiguous, highly complex technical problems in fast-moving environments.
• Ability to travel 0-50%, on average, based on the work you do and the clients and industries/sectors you serve.
• Limited immigration sponsorship may be available.
Preferred:
• Experience building or optimizing reasoning models, agentic models, or tool-using LLM systems.
• Familiarity with inference optimization frameworks such as vLLM, TensorRT-LLM, TGI, or Ollama.
• Experience with multimodal models, speech models, or domain-specific foundation models; experience using large-scale GPU clusters and distributed compute.
• Contributions to open-source AI projects, research publications, benchmark development, or model releases.
• Familiarity with safety, governance, and responsible-AI practices; experience in regulated or high-stakes industries such as healthcare, finance, insurance, or public sector.
Company:
Deloitte drives progress. Our firms around the world help clients become leaders wherever they choose to compete. Founded in 1900, the company is headquartered in Marunouchi, JPN, with a team of 10001+ employees. The company is currently Late Stage.

What Deloitte employees say

Pay

Benefits

Hours and flexibility

Workplace

Get the full story on Breakroom