Deep Learning Quantization Jobs in California (NOW HIRING)

Senior Software Engineer, Deep Learning - MLIR TRT

$143K - $189K/yr

They are seeking a Senior Deep Learning Software Engineer to develop and productize deep learning ... and quantization to low-level kernel development and performance optimization. • Develop ...

NVIDIA

Senior Software Engineer, Deep Learning - MLIR TRT

Santa Clara, CA · On-site

$143K - $189K/yr

Nvidia Corporation

Senior Research Scientist, Efficient Deep Learning

Santa Clara, CA · On-site

$115K - $147K/yr

NVIDIA is searching for an outstanding Senior Researcher working on efficient deep learning to join ... A background in pruning, quantization, NAS, efficient backbones is required. * Experience with ...

Nvidia Corporation

Senior Research Scientist, Efficient Deep Learning

Santa Clara, CA · On-site

$115K - $147K/yr

Nvidia

Senior Research Scientist, Efficient Deep Learning

Santa Clara, CA

$115K - $147K/yr

Nvidia

Senior Research Scientist, Efficient Deep Learning

Santa Clara, CA

$115K - $147K/yr

NVIDIA

Senior Research Scientist, Efficient Deep Learning

Santa Clara, CA · On-site

$115K - $147K/yr

NVIDIA is searching for an outstanding Senior Researcher working on efficient deep learning to join ... background in pruning, quantization, NAS, efficient backbones is required. • Excellent ...

NVIDIA

Senior Research Scientist, Efficient Deep Learning

Santa Clara, CA · On-site

$115K - $147K/yr

Nvidia

Senior Applied Deep Learning Research Scientist, Efficiency

Santa Clara, CA

$115K - $147K/yr

We are now looking for an Applied Deep Learning Research Scientist, Efficiency! Join our ADLR ... Topics include quantization/sparsity/optimizers/reinforcement learning, efficient architectures and ...

Nvidia

Senior Applied Deep Learning Research Scientist, Efficiency

Santa Clara, CA

$115K - $147K/yr

Nvidia Corporation

Senior Applied Deep Learning Research Scientist, Efficiency

Santa Clara, CA · On-site

$115K - $147K/yr

Nvidia Corporation

Senior Applied Deep Learning Research Scientist, Efficiency

Santa Clara, CA · On-site

$115K - $147K/yr

Harbinger Motors Inc.

Sr. Computer Vision Engineer (Deep Learning)

Mountain View, CA

$123K - $169K/yr

We are seeking a highly skilled Senior Deep Learning Engineer to drive the development and ... Proficiency in model optimization techniques such as quantization, pruning, and knowledge ...

Harbinger Motors Inc.

Sr. Computer Vision Engineer (Deep Learning)

Mountain View, CA

$123K - $169K/yr

We are seeking a highly skilled Senior Deep Learning Engineer to drive the development and ... Proficiency in model optimization techniques such as quantization, pruning, and knowledge ...

Nvidia

Senior Software Engineer, Deep Learning - MLIR TRT

Santa Clara, CA

$143K - $189K/yr

We are looking for outstanding Senior Deep Learning Software Engineers to develop and productize ... Working across a wide range of abstractions from model fine-tuning and quantization to low-level ...

Nvidia

Senior Software Engineer, Deep Learning - MLIR TRT

Santa Clara, CA

$143K - $189K/yr

Nvidia

Research Scientist, Efficient Deep Learning - New College Grad 2026

Santa Clara, CA

NVIDIA is searching for an outstanding researcher working on efficient deep learning to join the ... Background in pruning, quantization, NAS, efficient backbones, and so on, is a plus. * Experience ...

Nvidia

Research Scientist, Efficient Deep Learning - New College Grad 2026

Santa Clara, CA

Nvidia Corporation

Research Scientist, Efficient Deep Learning - New College Grad 2026

Santa Clara, CA · On-site

Nvidia Corporation

Research Scientist, Efficient Deep Learning - New College Grad 2026

Santa Clara, CA · On-site

Nvidia Corporation

Senior Software Engineer, Deep Learning - MLIR TRT

Santa Clara, CA · On-site

$143K - $189K/yr

Nvidia Corporation

Senior Software Engineer, Deep Learning - MLIR TRT

Santa Clara, CA · On-site

$143K - $189K/yr

Harbinger

Sr. Computer Vision Engineer (Deep Learning)

Mountain View, CA · On-site

$124K - $170K/yr

They are seeking a highly skilled Senior Deep Learning Engineer to develop and deploy advanced ... quantization, pruning, and knowledge distillation. • Doctorate (Ph.D.) in Computer Science ...

Harbinger

Sr. Computer Vision Engineer (Deep Learning)

Mountain View, CA · On-site

$124K - $170K/yr

Qualcomm

Staff Machine Learning Engineer - Model Optimization & Quantization

San Diego, CA · On-site

Implement advanced quantization techniques including weight-only quantization, activation ... Preferred Qualifications: * 3+ years of industry experience in machine learning, deep learning, or ...

Qualcomm

Staff Machine Learning Engineer - Model Optimization & Quantization

San Diego, CA · On-site

Harbinger

Sr. Computer Vision Engineer (Deep Learning)

Mountain View, CA · On-site

$124K - $170K/yr

We are seeking a highly skilled Senior Deep Learning Engineer to drive the development and ... quantization, pruning, and knowledge distillation. • Doctorate (Ph.D.) in Computer Science ...

Harbinger

Sr. Computer Vision Engineer (Deep Learning)

Mountain View, CA · On-site

$124K - $170K/yr

We are seeking a highly skilled Senior Deep Learning Engineer to drive the development and ... quantization, pruning, and knowledge distillation. • Doctorate (Ph.D.) in Computer Science ...

Harbinger

Sr. Computer Vision Engineer (Deep Learning)

Mountain View, CA · On-site

$124K - $170K/yr

Responsibilities : • Design and implement advanced deep learning architectures to enhance ... quantization, pruning, and knowledge distillation. • Doctorate (Ph.D.) in Computer Science ...

Harbinger

Sr. Computer Vision Engineer (Deep Learning)

Mountain View, CA · On-site

$124K - $170K/yr

Xforia, Inc.

Machine Learning Scientist

Laguna Hills, CA · On-site

Knowledge of deep learning quantization techniques. Experience with Docker and Git. Experience with Scrum/Agile software development methodologies (e.g., Jira). Language Skills: Exceptional ...

Xforia, Inc.

Machine Learning Scientist

Laguna Hills, CA · On-site

Knowledge of deep learning quantization techniques. Experience with Docker and Git. Experience with Scrum/Agile software development methodologies (e.g., Jira). Language Skills: Exceptional ...

Nvidia Corporation

Senior Deep Learning Software Engineer, TensorRT Performance

Santa Clara, CA · On-site

$143K - $189K/yr

... like quantization, scheduling, memory management, and distributed inference to set the gold ... Scale performance of deep learning models across different architectures and types of NVIDIA ...

Nvidia Corporation

Senior Deep Learning Software Engineer, TensorRT Performance

Santa Clara, CA · On-site

$143K - $189K/yr

Nvidia

Senior Deep Learning Software Engineer, TensorRT Performance

Santa Clara, CA · Hybrid

$143K - $189K/yr

Nvidia

Senior Deep Learning Software Engineer, TensorRT Performance

Santa Clara, CA · Hybrid

$143K - $189K/yr

Harbinger Motors Inc.

Sr. Computer Vision Engineer (Deep Learning)

Mountain View, CA · On-site

$180K - $240K/yr

We are seeking a highly skilled Senior Deep Learning Engineer to drive the development and ... Proficiency in model optimization techniques such as quantization, pruning, and knowledge ...

Harbinger Motors Inc.

Sr. Computer Vision Engineer (Deep Learning)

Mountain View, CA · On-site

$180K - $240K/yr

We are seeking a highly skilled Senior Deep Learning Engineer to drive the development and ... Proficiency in model optimization techniques such as quantization, pruning, and knowledge ...

Phantom AI

Senior/Staff Computer Vision Engineer - Deep Learning Focus

Mountain View, CA · On-site

$180K - $240K/yr

We are seeking a highly skilled Senior Deep Learning Engineer to drive the development and ... Proficiency in model optimization techniques such as quantization, pruning, and knowledge ...

Phantom AI

Senior/Staff Computer Vision Engineer - Deep Learning Focus

Mountain View, CA · On-site

$180K - $240K/yr

We are seeking a highly skilled Senior Deep Learning Engineer to drive the development and ... Proficiency in model optimization techniques such as quantization, pruning, and knowledge ...

Showing results 1-20

Deep Learning Quantization Jobs in California

Deep Learning Quantization information

What are the key skills and qualifications needed to thrive as a Deep Learning Quantization Engineer, and why are they important?

To excel as a Deep Learning Quantization Engineer, you need a strong background in machine learning, applied mathematics, and computer science, usually supported by an advanced degree in a related field. Familiarity with deep learning frameworks (such as TensorFlow or PyTorch), quantization toolkits, and hardware acceleration platforms is crucial. Analytical thinking, problem-solving, and clear technical communication are standout soft skills in this role. These abilities are essential for efficiently optimizing models for deployment on resource-constrained hardware while maintaining accuracy and performance.

What is the difference between Deep Learning Quantization vs Machine Learning Engineer?

Aspect	Deep Learning Quantization	Machine Learning Engineer
Required Credentials	Advanced degrees in AI, Computer Science, or related fields; knowledge of neural networks	Bachelor's or Master's in CS, Data Science, or related fields; programming skills
Work Environment	Research labs, AI development teams, hardware optimization settings	Software development teams, data-driven projects, product-focused environments
Industry Usage	AI hardware optimization, model deployment, edge computing	Model development, data analysis, software solutions across industries

Deep Learning Quantization focuses on reducing model size and improving inference speed through techniques like weight and activation quantization, often in hardware or embedded systems. Machine Learning Engineers develop, implement, and optimize machine learning models for various applications. While both roles require knowledge of AI and programming, Deep Learning Quantization is more specialized in model optimization techniques, whereas Machine Learning Engineers work broadly on model development and deployment.

What is deep learning quantization?

Deep learning quantization is the process of reducing the precision of the numbers used to represent a neural network's parameters, activations, or both. By converting the typically used 32-bit floating-point values to lower bit-width formats such as 16-bit or 8-bit integers, quantization significantly reduces the memory footprint and computational requirements of deep learning models. This technique helps deploy models efficiently on edge devices and mobile hardware while maintaining acceptable accuracy levels. Quantization is widely used in model optimization for faster inference and lower power consumption.

What are some common challenges faced when implementing deep learning quantization in production environments?

One of the main challenges in implementing deep learning quantization is balancing model accuracy with computational efficiency, as quantization can sometimes lead to a drop in model performance. Additionally, ensuring hardware compatibility and optimizing for different devices (such as CPUs, GPUs, or edge devices) can require extensive testing and tuning. Collaboration with data scientists, software engineers, and hardware specialists is often essential to successfully deploy quantized models at scale. Staying updated with the latest quantization techniques and frameworks is also important for overcoming these challenges.

What are popular job titles related to Deep Learning Quantization jobs in California? For Deep Learning Quantization jobs in California, the most frequently searched job titles are:

What job categories do people searching Deep Learning Quantization jobs in California look for? The top searched job categories for Deep Learning Quantization jobs in California are:

What cities in California are hiring for Deep Learning Quantization jobs? Cities in California with the most Deep Learning Quantization job openings:

Deep Learning Quantization jobs near you

Infographic showing various Deep Learning Quantization job openings in California as of July 2026, with employment types broken down into 74% Full Time, 23% Part Time, 1% Temporary, and 2% Contract. Highlights an 72% Physical, 2% Hybrid, and 26% Remote job distribution.

Senior Software Engineer, Deep Learning - MLIR TRT

NVIDIA

Santa Clara, CA • On-site

Apply

$143K - $189K/yr

Full-time

Re-posted yesterday

Nvidia rating

9.3

Based on 5 frontline employees who took The Breakroom Quiz

15th of 209 rated software companies

Job description

Job Summary:
NVIDIA has been a leader in computer graphics and accelerated computing for over 25 years and is now focusing on AI to revolutionize computing. They are seeking a Senior Deep Learning Software Engineer to develop and productize deep learning solutions for autonomous vehicles, including optimizing compiler technology for NVIDIA's hardware architecture.
Responsibilities:
• Developing compiler technologies to accelerate deep learning inference on NVIDIA hardware platforms for Physical AI.
• Working across a wide range of abstractions from model fine-tuning and quantization to low-level kernel development and performance optimization.
• Develop workflows that let users leverage frameworks (e.g. PyTorch, JAX) and compiler technologies tools (e.g. MLIR, Triton) without forgoing performance
• Work with customers to help accelerate their workloads on NVIDIA platforms.
• Stay up to date with the latest research and innovations in deep learning, implement and experiment with new insights to improve NVIDIA's Physical AI DNNs.
Qualifications:
Required:
• MS or PhD degree in computer science, computer vision, robotics, computer architecture or equivalent experience in technical field (or equivalent experience)
• 5+ years of work experience in software development.
• 2+ years of experience in developing deep learning frameworks (e.g. PyTorch, JAX, TensorFlow, ONNX, etc.) or compiler technologies (e.g. LLVM, MLIR, TVM, Triton, etc.).
• Domain experience in technologies used for GPU programming (e.g. CUDA C++ and/or DSLs like OpenAI Triton) or with system-level optimization for deep learning training or inference.
• Strong C/C++ programming skills
• Familiar with start-of-the-art deep learning techniques for inference and training.
• Willing to take action and have strong analytical skills.
Preferred:
• Experience with MLIR or LLVM or similar compiler technologies
• Background with low precision inference, quantization, compression of DNNs
• Experience with GPU programming
• Experience with building DSLs or optimizing compilers (e.g. graph compiler or kernel generator) for GPUs or other accelerated computing platforms.
• Open source project ownership or contribution, healthy GitHub repositories, guiding and/or mentoring experience
Company:
NVIDIA is a computing platform company operating at the intersection of graphics, HPC, and AI. Founded in 1993, the company is headquartered in Santa Clara, USA, with a team of 10001+ employees. The company is currently Late Stage.

What Nvidia employees say

Hours and flexibility

Workplace

Get the full story on Breakroom

About Nvidia

Sourced by ZipRecruiter

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It's a unique legacy of innovation that's fueled by great technology--and amazing people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what's never been done before takes vision, innovation, and the world's best talent.

Industry

Computer and electronic product manufacturing

Company size

10,000+ Employees

Headquarters location

Santa Clara, CA, US

Year founded

1993

Website

nvidia.com

Social media

View All Nvidia Jobs

Apply

Deep Learning Quantization Jobs in California (NOW HIRING)

Senior Software Engineer, Deep Learning - MLIR TRT

Senior Software Engineer, Deep Learning - MLIR TRT

Senior Research Scientist, Efficient Deep Learning

Senior Research Scientist, Efficient Deep Learning

Senior Research Scientist, Efficient Deep Learning

Senior Research Scientist, Efficient Deep Learning

Senior Research Scientist, Efficient Deep Learning

Senior Research Scientist, Efficient Deep Learning

Senior Applied Deep Learning Research Scientist, Efficiency

Senior Applied Deep Learning Research Scientist, Efficiency

Senior Applied Deep Learning Research Scientist, Efficiency

Senior Applied Deep Learning Research Scientist, Efficiency

Sr. Computer Vision Engineer (Deep Learning)

Sr. Computer Vision Engineer (Deep Learning)

Senior Software Engineer, Deep Learning - MLIR TRT

Senior Software Engineer, Deep Learning - MLIR TRT

Research Scientist, Efficient Deep Learning - New College Grad 2026

Research Scientist, Efficient Deep Learning - New College Grad 2026

Research Scientist, Efficient Deep Learning - New College Grad 2026

Research Scientist, Efficient Deep Learning - New College Grad 2026

Senior Software Engineer, Deep Learning - MLIR TRT

Senior Software Engineer, Deep Learning - MLIR TRT

Sr. Computer Vision Engineer (Deep Learning)

Sr. Computer Vision Engineer (Deep Learning)

Staff Machine Learning Engineer - Model Optimization & Quantization

Staff Machine Learning Engineer - Model Optimization & Quantization

Sr. Computer Vision Engineer (Deep Learning)

Sr. Computer Vision Engineer (Deep Learning)

Sr. Computer Vision Engineer (Deep Learning)

Sr. Computer Vision Engineer (Deep Learning)

Machine Learning Scientist

Machine Learning Scientist

Senior Deep Learning Software Engineer, TensorRT Performance

Senior Deep Learning Software Engineer, TensorRT Performance

Senior Deep Learning Software Engineer, TensorRT Performance

Senior Deep Learning Software Engineer, TensorRT Performance

Sr. Computer Vision Engineer (Deep Learning)

Sr. Computer Vision Engineer (Deep Learning)

Senior/Staff Computer Vision Engineer - Deep Learning Focus

Senior/Staff Computer Vision Engineer - Deep Learning Focus

Deep Learning Quantization information

What are the key skills and qualifications needed to thrive as a Deep Learning Quantization Engineer, and why are they important?

What is the difference between Deep Learning Quantization vs Machine Learning Engineer?

What is deep learning quantization?

What are some common challenges faced when implementing deep learning quantization in production environments?

Senior Software Engineer, Deep Learning - MLIR TRT

Share this job

Nvidia rating

Get the real story on frontline employers

Job description

What Nvidia employees say

Get the real story on frontline employers

Hours and flexibility

Easy to get time off

Most people find it easy to take sick days

Workplace

Most people get breaks without interruption

Some people are stressed out

Only some people recommend their team

About Nvidia

Industry

Company size

Headquarters location

Year founded

Website

Social media

Share this job