1

Video Model Jobs (NOW HIRING)

Senior Vision Language Model Engineer

Santa Clara, CA ยท On-site

$121K - $167K/yr

Build, curate, and maintain highโ€‘quality multimodal datasets (e.g., video, sensor, language ... Collaborate with research, model development, performance, and product teams. * Contribute to ...

Additional experience with game engines like Unity or Unreal Engine, or 3D modeling software like Maya a plus. * Experienced with capturing video and live camera switching, for livestream and ...

Senior Vision Language Model Engineer

Santa Clara, CA ยท On-site

$122K - $168K/yr

Build, curate, and maintain highquality multimodal datasets (e.g., video, sensor, language/action ... Collaborate with research, model development, performance, and product teams. * Contribute to ...

Video Supervisor

Boston, MA ยท On-site

$68K - $85K/yr

Additional experience with game engines like Unity or Unreal Engine, or 3D modeling software like Maya a plus. * Experienced with capturing video and live camera switching, for livestream and ...

ML Engineer, Generative Video

New York, NY ยท On-site

$175K - $275K/yr

Our models leverage contextual awareness to execute the same creative decisions a professional editor would -- dramatically improving productivity for experienced teams, while making video creation ...

Additional experience with game engines like Unity or Unreal Engine, or 3D modeling software like Maya a plus. * Experienced with capturing video and live camera switching, for livestream and ...

ML Engineer, Generative Video

New York, NY ยท On-site

$175K - $275K/yr

Our models leverage contextual awareness to execute the same creative decisions a professional editor would - dramatically improving productivity for experienced teams, while making video creation ...

It ships a coordinated stack of image, video, animation, and 3D generation models that need to work together. You'll design the orchestration layer: how a character generated in the image model ...

Video Producer

Alexandria, VA ยท On-site

$80K - $130K/yr

The Video Producer will support Barrow Wise's NSF project and perform the following duties ... We are confident that Barrow Wise's core values, business model, and team focus create positive ...

next page

Showing results 1-20

Video Model information

See salary details

$14

$31

$60

How much do video model jobs pay per hour?

As of Jun 27, 2026, the average hourly pay for video model in the United States is $31.42, according to ZipRecruiter salary data. Most workers in this role earn between $23.80 and $35.34 per hour, depending on experience, location, and employer.

What are the key skills and qualifications needed to thrive as a Video Model, and why are they important?

To thrive as a Video Model, you need a strong on-camera presence, physical fitness, and the ability to follow creative direction, often supported by a portfolio or prior modeling experience. Familiarity with video production processes, lighting setups, and sometimes editing tools is beneficial. Confidence, adaptability, and excellent communication skills help you stand out by working effectively with directors, photographers, and other team members. These skills are crucial for delivering compelling performances that meet client expectations and contribute to successful productions.

What are video models?

Video models are individuals who appear in video productions, such as music videos, commercials, fashion films, online campaigns, and promotional content. Their role is to visually represent a brand, product, or concept, often through posing, acting, or dancing on camera. Video models work closely with directors, photographers, and stylists to achieve the desired look and message for the project. The work can vary from showcasing clothing and products to telling a story through performance. Successful video models are comfortable in front of the camera, can take direction well, and adapt to different creative concepts.

What is the difference between Video Model vs Video Editor?

AspectVideo ModelVideo Editor
Primary RoleShowcases products or brands through modeling in videosCreates, edits, and produces video content
Required SkillsPresentation, acting, understanding of brandingEditing software proficiency, storytelling, technical skills
Work EnvironmentOn-camera, photoshoots, filming locationsEditing suites, post-production studios
Industry UsageFashion, advertising, influencer marketingMedia, entertainment, marketing campaigns

While both roles involve video content, a Video Model primarily appears on camera to promote products or brands, focusing on presentation and on-screen presence. In contrast, a Video Editor works behind the scenes to craft and refine video footage, emphasizing technical editing skills. Both roles are essential in video production but serve different functions within the industry.

What are some common challenges faced by video models during shoots, and how can they be addressed?

Video models often encounter challenges such as long hours on set, frequent changes in direction, and the need to quickly adapt to different creative visions. Maintaining energy and professionalism during repetitive takes can be demanding. Building strong communication with the director and crew, staying prepared with proper wardrobe and makeup, and practicing self-care between shoots are effective ways to overcome these challenges and deliver consistent performances.
More about Video Model jobs
Senior Vision Language Model Engineer

Senior Vision Language Model Engineer

Nvidia Corporation

Santa Clara, CA โ€ข On-site

$122K - $168K/yr

Full-time

Posted 15 days ago


Job description

NVIDIA is the platform upon which every new AI-powered application is built. We are seeking a senior vision language model engineer to design and build agentic data and training workflows for Autonomous Vehicles, Robotics, and Medical applications. The right person for this role brings technical innovation and collaborative culture to change the way NVIDIA builds dataset search platforms for physical AI developers. Our dataset search offerings are ease to use, performant and scalable. Your work will redefine the dataset search and model training capabilities in NVIDIA product offerings and impact the most iconic companies in Physical AI.
What you'll be doing:
  • Partner with our researchers to develop and evaluate prototypes of our latest models, such as VLMs and VLAs, for video search, video understanding, and more. Enable fundamental advances in autonomous driving, healthcare, and robotics.
  • Design and implement agentic data workflows that automate data discovery, labeling, evaluation, and retraining to maximize development velocity.
  • Build, curate, and maintain high-quality multimodal datasets (e.g., video, sensor, language/action traces) tailored for end-to-end physical AI problems, such as autonomous driving.
  • Explore and productize new data sources including simulation and synthetic data.
  • Use agentic AI workflows across the full applied research lifecycle.
  • Collaborate with research, model development, performance, and product teams.
  • Contribute to NVIDIA Cosmos Dataset Search and other core NVIDIA platforms and products.

What we need to see:
  • PhD with 4+ years, MS with 6+ years, or BS (or equivalent experience) with 8+ years of relevant experience in Computer Science, Computer Engineering, or a related technical field
  • Strong background in modern deep learning, including transformer-based architectures, video modeling, and multimodal VLM/VLA or foundation models.
  • Excellent experience training and deploying deep learning models on real-world datasets: data preprocessing, distributed training, evaluation, debugging, and iterative improvement.
  • Excellent experience with python and at least one deep learning framework.
  • Current with the latest research on image and video search in autonomous vehicles, healthcare, robotics, or related physical AI applications.
  • Fluent with agentic AI workflows across the full applied research lifecycle, including prototyping novel algorithms and search pipelines, benchmarking, and integrating prototypes in production codebases.
  • Clear and effective communication skills, with experience working well in a dynamic, product- and research-focused team.

Ways to Stand Out from the Crowd:
  • Strong track record publishing in top-tier conference such as CVPR, NeuRIPS, ICML, ECCV
  • Patents in video retrieval or related field
  • Strong coding architecture skills demonstrated through contributions to large internal or open-source projects.
  • Experience in robotic systems such as autonomous vehicles or humanoid robotics.

Come join us at NVIDIA and contribute to a team that is pushing the edges of what can be done in AI and computer vision. We're looking for candidates who are innovative, ambitious, and ready to leave a lasting mark on the world!
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 184,000 USD - 287,500 USD for Level 4, and 224,000 USD - 356,500 USD for Level 5.
You will also be eligible for equity and benefits.
Applications for this job will be accepted at least until May 17, 2026.
This posting is for an existing vacancy.
NVIDIA uses AI tools in its recruiting processes.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Nvidia logo

About Nvidia

Sourced by ZipRecruiter

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It's a unique legacy of innovation that's fueled by great technology--and amazing people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what's never been done before takes vision, innovation, and the world's best talent.

Industry

Computer and electronic product manufacturing

Company size

10,000+ Employees

Headquarters location

Santa Clara, CA, US

Year founded

1993