AI Solutions Architect
Houston, TX · On-site
The ideal candidate will have advanced working knowledge of data analytics, modern machine learning algorithms, foundation models, large language models, vision-language models, small language models ...
Houston, TX · On-site
The ideal candidate will have advanced working knowledge of data analytics, modern machine learning algorithms, foundation models, large language models, vision-language models, small language models ...
Houston, TX · On-site
The ideal candidate will have advanced working knowledge of data analytics, modern machine learning algorithms, foundation models, large language models, vision-language models, small language models ...
New York, NY · On-site +1
$50/hr
Conduct fundamental and innovative development in low-cost yet powerful vision-language models (VLM), unified models, automatic model compression, optimization and deployement on cloud and edge.
New York, NY · On-site +1
$50/hr
Conduct fundamental and innovative development in low-cost yet powerful vision-language models (VLM), unified models, automatic model compression, optimization and deployement on cloud and edge.
Houston, TX · On-site
The ideal candidate will have advanced working knowledge of data analytics, modern machine learning algorithms, foundation models, large language models, vision-language models, small language models ...
Quick apply
Houston, TX · On-site
The ideal candidate will have advanced working knowledge of data analytics, modern machine learning algorithms, foundation models, large language models, vision-language models, small language models ...
We're looking for a Vision Language Model (VLM) & Visual Foundation Model (VFM) Forward Deployed Engineer to operate at the forefront of visual and multi-modal intelligence deployment in industry ...
We're looking for a Vision Language Model (VLM) & Visual Foundation Model (VFM) Forward Deployed Engineer to operate at the forefront of visual and multi-modal intelligence deployment in industry ...
Palo Alto, CA · On-site
$150K - $300K/yr
We're looking for a Vision Language Model (VLM) & Visual Foundation Model (VFM) Forward Deployed Engineer to operate at the forefront of visual and multi-modal intelligence deployment in industry ...
Quick apply
Palo Alto, CA · On-site
$150K - $300K/yr
We're looking for a Vision Language Model (VLM) & Visual Foundation Model (VFM) Forward Deployed Engineer to operate at the forefront of visual and multi-modal intelligence deployment in industry ...
Palo Alto, CA · On-site
$150K - $300K/yr
We're looking for a Vision Language Model (VLM) & Visual Foundation Model (VFM) Forward Deployed Engineer to operate at the forefront of visual and multi-modal intelligence deployment in industry ...
Palo Alto, CA · On-site
$150K - $300K/yr
We're looking for a Vision Language Model (VLM) & Visual Foundation Model (VFM) Forward Deployed Engineer to operate at the forefront of visual and multi-modal intelligence deployment in industry ...
We are looking for a detail-oriented and technically capable Vision-Language-Action (VLA) Annotator ... models. Your work directly impacts the safety and performance of AI systems operating in the real ...
Quick apply
We are looking for a detail-oriented and technically capable Vision-Language-Action (VLA) Annotator ... models. Your work directly impacts the safety and performance of AI systems operating in the real ...
We are looking for a detail-oriented and technically capable Vision-Language-Action (VLA) Annotator ... models. Your work directly impacts the safety and performance of AI systems operating in the real ...
Quick apply
We are looking for a detail-oriented and technically capable Vision-Language-Action (VLA) Annotator ... models. Your work directly impacts the safety and performance of AI systems operating in the real ...
San Jose, CA · On-site
$244.80K - $450K/yr
We are looking for researchers in LLM, VLM and Omni Model domain who are experienced in single ... Employees have day one access to medical, dental, and vision insurance, a 401(k) savings plan with ...
San Jose, CA · On-site
$244.80K - $450K/yr
We are looking for researchers in LLM, VLM and Omni Model domain who are experienced in single ... Employees have day one access to medical, dental, and vision insurance, a 401(k) savings plan with ...
San Diego, CA · On-site
$110.90K - $152.40K/yr
Develop and prototype novel prompting strategies for Vision-Language Models (VLMs) to elicit complex, causal reasoning about driving scenarios. * Collaborate closely with the ML Infra, Perception ...
San Diego, CA · On-site
$110.90K - $152.40K/yr
Develop and prototype novel prompting strategies for Vision-Language Models (VLMs) to elicit complex, causal reasoning about driving scenarios. * Collaborate closely with the ML Infra, Perception ...
We are looking for a detail-oriented and technically capable Vision-Language-Action (VLA) Annotator ... models. Your work directly impacts the safety and performance of AI systems operating in the real ...
Quick apply
We are looking for a detail-oriented and technically capable Vision-Language-Action (VLA) Annotator ... models. Your work directly impacts the safety and performance of AI systems operating in the real ...
San Jose, CA · On-site
$244.80K - $450K/yr
We are looking for researchers in LLM, VLM and Omni Model domain who are experienced in single ... Employees have day one access to medical, dental, and vision insurance, a 401(k) savings plan with ...
San Jose, CA · On-site
$244.80K - $450K/yr
We are looking for researchers in LLM, VLM and Omni Model domain who are experienced in single ... Employees have day one access to medical, dental, and vision insurance, a 401(k) savings plan with ...
Austin, TX · On-site +1
You'll architect and implement Vision-Language-Action (VLA) models, advance reinforcement learning applications, and push the boundaries of multimodal AI integration. This role combines deep ...
Austin, TX · On-site +1
You'll architect and implement Vision-Language-Action (VLA) models, advance reinforcement learning applications, and push the boundaries of multimodal AI integration. This role combines deep ...
This position will play a vital role in driving high-quality outcomes in artificial intelligence in medicine, with a focus on large language models, natural language processing (NLP), and vision ...
This position will play a vital role in driving high-quality outcomes in artificial intelligence in medicine, with a focus on large language models, natural language processing (NLP), and vision ...
Bellevue, WA · On-site
$184K - $257K/yr
... vision, NLP, speech • Experience writing software and executing complex experiments involving large AI models and datasets • Must obtain work authorization in the country of employment at the ...
Bellevue, WA · On-site
$184K - $257K/yr
... vision, NLP, speech • Experience writing software and executing complex experiments involving large AI models and datasets • Must obtain work authorization in the country of employment at the ...
San Diego, CA · On-site
$200.80K - $301.20K/yr
Design and develop models that connect vision, language, and action for real world robotic ... Knowledge of model compression techniques (quantization, distillation) for efficient deployment.
San Diego, CA · On-site
$200.80K - $301.20K/yr
Design and develop models that connect vision, language, and action for real world robotic ... Knowledge of model compression techniques (quantization, distillation) for efficient deployment.
A leading AI research company in California is seeking an experienced Machine Learning Engineer to develop Vision-Language-Action models for robotics. The ideal candidate has over 5 years of ...
A leading AI research company in California is seeking an experienced Machine Learning Engineer to develop Vision-Language-Action models for robotics. The ideal candidate has over 5 years of ...
Cupertino, CA · On-site
$252.90K/yr
This role requires experience in vision-language models, and ability to fine-tune/adapt/distill multi-modal LLMs. You will be part of a fast-paced, impact-driven Applied Research organization working ...
Cupertino, CA · On-site
$252.90K/yr
This role requires experience in vision-language models, and ability to fine-tune/adapt/distill multi-modal LLMs. You will be part of a fast-paced, impact-driven Applied Research organization working ...
Manhattan, NY · Remote
$300K/yr
You'll play a pivotal role in building advanced vision pipelines (detection, segmentation, transformers, 3D vision) and integrating them with large language models (LLMs) and vision-language models ...
Manhattan, NY · Remote
$300K/yr
You'll play a pivotal role in building advanced vision pipelines (detection, segmentation, transformers, 3D vision) and integrating them with large language models (LLMs) and vision-language models ...
Herndon, VA · On-site +1
$175K - $250K/yr
AI/Machine Learning Engineer - Vision Language Models / Multimodal AI (NGA) Location: Springfield or Herndon, VA (onsite) Clearance: TS/SCI (CI Poly preferred) Position Type: Full-Time, Direct Hire ...
New
Herndon, VA · On-site +1
$175K - $250K/yr
AI/Machine Learning Engineer - Vision Language Models / Multimodal AI (NGA) Location: Springfield or Herndon, VA (onsite) Clearance: TS/SCI (CI Poly preferred) Position Type: Full-Time, Direct Hire ...
New
$10.34 - $15.49
18% of jobs
$16.94 is the 25th percentile. Wages below this are outliers.
$15.49 - $20.65
25% of jobs
$20.65 - $25.81
4% of jobs
The median wage is $27.96 / hr.
$25.81 - $30.97
6% of jobs
$30.97 - $36.12
21% of jobs
$36.31 is the 75th percentile. Wages above this are outliers.
$36.12 - $41.28
7% of jobs
$41.28 - $46.44
9% of jobs
$46.44 - $51.60
6% of jobs
$51.60 - $56.75
1% of jobs
$56.75 - $61.91
0% of jobs
$61.91 - $67.07
1% of jobs
$10
$31
$67
| Aspect | Vision Language Model | Computer Vision Engineer |
|---|---|---|
| Required credentials | Advanced degrees in AI, Machine Learning, or related fields | Degree in Computer Science, Electrical Engineering, or related fields |
| Work environment | Research labs, AI startups, tech companies focusing on multimodal AI | Tech companies, research institutions, industries applying image analysis |
| Industry usage | Developing multimodal AI systems combining vision and language | Creating algorithms for image recognition, object detection, and analysis |
| Search and comparison intent | Understanding roles in AI development involving vision and language | Focus on technical image processing and computer vision applications |
While both roles involve working with visual data, a Vision Language Model specializes in integrating visual and textual information using advanced AI techniques, often in research or product development. In contrast, a Computer Vision Engineer focuses on developing algorithms for analyzing and interpreting visual data, primarily in applications like image recognition and object detection.

Other
This job post has expired today. Applications are no longer accepted.