1

Contractual Computer Vision Deep Learning Engineer Jobs

Computer Vision AI Engineer

Aurora, CO · On-site

$113.20K - $133.50K/yr

Responsibilities : • Develop and implement deep learning computer vision models, with a focus on ... GPU programming, including CUDA or RAPIDs, to optimize the performance of computer vision ...

Computer Vision AI Engineer

El Segundo, CA · On-site

$118.80K - $140.10K/yr

Responsibilities : • Develop and implement deep learning computer vision models, with a focus on ... GPU programming, including CUDA or RAPIDs, to optimize the performance of computer vision ...

Senior Deep Learning Engineer

New York, NY · On-site

$114.30K - $157K/yr

We're looking for a Senior Deep Learning Engineer with extensive experience in modern neural network techniques and PyTorch to help us push the boundaries of computer vision in real-world ...

Computer Vision AI Engineer

$114.10K - $134.60K/yr

Responsibilities : • Develop and implement deep learning computer vision models, with a focus on ... GPU programming, including CUDA or RAPIDs, to optimize the performance of computer vision ...

Senior Deep Learning Engineer

Manhattan, NY

$115.20K - $158.20K/yr

We're looking for a Senior Deep Learning Engineer with extensive experience in modern neural network techniques and PyTorch to help us push the boundaries of computer vision in real-world ...

next page

Showing results 1-20

Contractual Computer Vision Deep Learning Engineer information

See salary details

$19

$59

$81

How much do contractual computer vision deep learning engineer jobs pay per hour?

As of Jun 4, 2026, the average hourly pay for contractual computer vision deep learning engineer in the United States is $59.59, according to ZipRecruiter salary data. Most workers in this role earn between $51.92 and $66.35 per hour, depending on experience, location, and employer.
More about Contractual Computer Vision Deep Learning Engineer jobs
What cities are hiring for Contractual Computer Vision Deep Learning Engineer jobs? Cities with the most Contractual Computer Vision Deep Learning Engineer job openings:
What are the most commonly searched types of Computer Vision Deep Learning Engineer jobs? The most popular types of Computer Vision Deep Learning Engineer jobs are:
What states have the most Contractual Computer Vision Deep Learning Engineer jobs? States with the most job openings for Contractual Computer Vision Deep Learning Engineer jobs include:
Infographic showing various Contractual Computer Vision Deep Learning Engineer job openings in the United States as of May 2026, with employment types broken down into 1% As Needed, 30% Full Time, 49% Part Time, and 20% Contract. Highlights an 87% Physical, 2% Hybrid, and 11% Remote job distribution, with an average salary of $123,952 per year, or $59.6 per hour.
Machine Learning Infrastructure Engineer

Machine Learning Infrastructure Engineer

Zensors

San Francisco, CA

$126.70K - $166.10K/yr

Other

Posted 27 days ago


Job description

Machine Learning Engineer In ML Runtime & Optimization

Zensors is the spatial intelligence platform for the physical world. Our AI platform provides real-time insights—from airport queue times to office utilization—helping organizations make smarter operational decisions.

Zensors is processing massive streams of video data 24/7 with human-level accuracy. To do this at scale, we rely on cutting-edge optimization to ensure our vision transformer and detection models run efficiently on both cloud and edge compute resources.

The AI Infrastructure team at Zensors builds the engine that powers our visual sensing platform. We provide the tools to automate the lifecycle of our AI workflow, including model development, evaluation, optimization, deployment, and monitoring across thousands of video streams.

As a Machine Learning Engineer in ML Runtime & Optimization, you will develop technologies to accelerate the training and inference of computer vision models that power smart spaces and cities.

Your responsibilities will include:

  • Optimizing Core ML Pipelines: Identifying key bottlenecks in our current video analytics pipeline and performing in-depth analysis to ensure the best possible performance on current server and edge compute architectures.
  • Cross-Stack Collaboration: Collaborating closely with AI research and platform engineering teams to optimize core parallel algorithms and influence the design of our next-generation inference infrastructure.
  • Model Acceleration: Applying advanced model optimization techniques—such as quantization (Int8/FP16), pruning, and layer fusion—to our Vision Transformers (ViTs) and CNNs to maximize throughput and minimize latency.
  • Building Efficient Operators: Working across the entire ML framework/compiler stack (e.g., PyTorch, CUDA, TensorRT, and NVIDIA DeepStream) to write custom optimized ML operator libraries.
  • Resource Efficiency: Reducing the compute cost per video stream to enable massive scalability of our SaaS product.
  • Data Management: Building, improving, maintaining, and operating systems to facilitate the collection, labeling, and use of visual data for ML training.

Requirements:

  • BS/MS or Ph.D. in Computer Science, Electrical Engineering, or a related discipline.
  • Strong programming skills in C/C++ and Python.
  • Experience with model optimization, quantization, and efficient deep learning techniques (e.g., knowledge distillation, pruning).
  • Deep understanding of GPU hardware performance, including execution models, thread hierarchy, memory/cache management, and the cost/performance trade-offs of video processing.
  • Experience with profiling and benchmarking tools (e.g., Nsight Systems, Nsight Compute) to validate performance on complex architectures.
  • Experience identifying and resolving compute and data flow bottlenecks, particularly in high-bandwidth video processing pipelines.
  • Strong communication skills and the ability to work cross-functionally between research and infrastructure teams.

Preferred Qualifications:

  • Familiarity with database systems (e.g., SQL, Neo4j).
  • Work in Computer Vision, Deep Learning, and Vision Transformers.
  • Experience with video processing frameworks such as NVIDIA DeepStream, DALI, or FFmpeg.
  • Familiarity with ML compilers (e.g., TVM, MLIR) or inference engines like TensorRT or ONNX Runtime.
  • Knowledge of distributed training systems or cloud-scale inference serving (e.g., Triton Inference Server).