1

Deep Learning Quantization Jobs in California (NOW HIRING)

Role Summary We are looking for a Research Scientist with deep expertise in quantized deep learning ... Design and implement hardware-aware optimizations, including quantization strategies, model ...

Role Summary We are looking for a Research Scientist with deep expertise in quantized deep learning ... Design and implement hardware-aware optimizations, including quantization strategies, model ...

Senior Perception Learning Engineer

Sunnyvale, CA · On-site

$122K - $167K/yr

... deep learning approaches. • Expertise in model acceleration, quantization, or compression (TensorRT, ONNX Runtime). • Familiarity with real-time frameworks and middleware such as ROS 2, GStreamer ...

Senior Perception Learning Engineer

Sunnyvale, CA · On-site

$122K - $168K/yr

Strong classical computer vision skills (geometry-based methods, feature extraction) complementing deep learning approaches. * Expertise in model acceleration, quantization, or compression (TensorRT ...

Senior Perception Learning Engineer

Sunnyvale, CA · On-site

$122K - $167K/yr

... deep learning approaches. • Expertise in model acceleration, quantization, or compression (TensorRT, ONNX Runtime). • Familiarity with real-time frameworks and middleware such as ROS 2, GStreamer ...

next page

Showing results 1-20

Deep Learning Quantization information

What are the key skills and qualifications needed to thrive as a Deep Learning Quantization Engineer, and why are they important?

To excel as a Deep Learning Quantization Engineer, you need a strong background in machine learning, applied mathematics, and computer science, usually supported by an advanced degree in a related field. Familiarity with deep learning frameworks (such as TensorFlow or PyTorch), quantization toolkits, and hardware acceleration platforms is crucial. Analytical thinking, problem-solving, and clear technical communication are standout soft skills in this role. These abilities are essential for efficiently optimizing models for deployment on resource-constrained hardware while maintaining accuracy and performance.

What is the difference between Deep Learning Quantization vs Machine Learning Engineer?

AspectDeep Learning QuantizationMachine Learning Engineer
Required CredentialsAdvanced degrees in AI, Computer Science, or related fields; knowledge of neural networksBachelor's or Master's in CS, Data Science, or related fields; programming skills
Work EnvironmentResearch labs, AI development teams, hardware optimization settingsSoftware development teams, data-driven projects, product-focused environments
Industry UsageAI hardware optimization, model deployment, edge computingModel development, data analysis, software solutions across industries

Deep Learning Quantization focuses on reducing model size and improving inference speed through techniques like weight and activation quantization, often in hardware or embedded systems. Machine Learning Engineers develop, implement, and optimize machine learning models for various applications. While both roles require knowledge of AI and programming, Deep Learning Quantization is more specialized in model optimization techniques, whereas Machine Learning Engineers work broadly on model development and deployment.

What is deep learning quantization?

Deep learning quantization is the process of reducing the precision of the numbers used to represent a neural network's parameters, activations, or both. By converting the typically used 32-bit floating-point values to lower bit-width formats such as 16-bit or 8-bit integers, quantization significantly reduces the memory footprint and computational requirements of deep learning models. This technique helps deploy models efficiently on edge devices and mobile hardware while maintaining acceptable accuracy levels. Quantization is widely used in model optimization for faster inference and lower power consumption.

What are some common challenges faced when implementing deep learning quantization in production environments?

One of the main challenges in implementing deep learning quantization is balancing model accuracy with computational efficiency, as quantization can sometimes lead to a drop in model performance. Additionally, ensuring hardware compatibility and optimizing for different devices (such as CPUs, GPUs, or edge devices) can require extensive testing and tuning. Collaboration with data scientists, software engineers, and hardware specialists is often essential to successfully deploy quantized models at scale. Staying updated with the latest quantization techniques and frameworks is also important for overcoming these challenges.
What cities in California are hiring for Deep Learning Quantization jobs? Cities in California with the most Deep Learning Quantization job openings:
Infographic showing various Deep Learning Quantization job openings in California as of May 2026, with employment types broken down into 6% Internship, and 94% Full Time. Highlights an 94% In-person, and 6% Remote job distribution.
Machine Learning/AI Engineer

Machine Learning/AI Engineer

Rivian

Palo Alto, CA • On-site

$139K - $155K/yr

Full-time

Medical, Dental, Vision, Life, Retirement, PTO

Posted 15 days ago


Rivian rating

7.4

Company rating: 7.4 out of 10

Based on 154 frontline employees who took The Breakroom Quiz

17th of 44 rated automakers


Job description

About Rivian

Rivian is on a mission to keep the world adventurous forever. This goes for the emissions-free Electric Adventure Vehicles we build, and the curious, courageous souls we seek to attract. 

As a company, we constantly challenge what’s possible, never simply accepting what has always been done. We reframe old problems, seek new solutions and operate comfortably in areas that are unknown. Our backgrounds are diverse, but our team shares a love of the outdoors and a desire to protect it for future generations. 


Role Summary

We are looking for a Research Scientist with deep expertise in quantized deep learning models for hardware acceleration in autonomous systems. In this cross-disciplinary role, you will bridge perception model design and hardware-aware deployment, enabling efficient execution of high-performance perception algorithms across embedded compute platforms.  You will focus on researching state of the art perception models and develop optimization pipelines for the quantized versions of these models customized to provide real-time performance and energy efficiency on next-generation autonomy hardware.


Responsibilities
  • Research state of the art perception models in collaboration with the ADAS SW teams
  • Contribute to the development of optimizations for mapping quantized perception models (e.g., CNNs, Transformers, LLMs, VLMs) to embedded and heterogeneous hardware platforms.
  • Design and implement hardware-aware optimizations, including quantization strategies, model compression, memory-efficient representations, and operator fusion, targeted to custom accelerators
  • Collaborate with hardware teams to co-optimize model architecture and compute pipeline under real-time constraints (latency, throughput, power).
  • Benchmark and analyze system performance across platforms and iterate to achieve optimal deployment efficiency.
  • Partner with perception, systems, and autonomy teams to align model optimization efforts with hardware roadmap and real-world autonomy requirements.

Qualifications

Basic Qualifications:
B.S. in Computer Engineering, Electrical Engineering, Computer Science, or related field with a focus on ML compilers, embedded systems, or hardware-aware AI.
Hands-on experience with quantized model deployment, ML design stacks, and code generation for embedded or heterogeneous compute systems.
Understanding of computer vision models (e.g., object detection, segmentation) and their optimization for edge inference.
Experience in deep learning frameworks (e.g., PyTorch, TensorFlow) and their low-level IRs or export formats (e.g., ONNX).
Solid programming skills in C++, Python
Familiarity with CUDA/OpenCL (or other accelerator programming models).

Preferred Qualifications:
Prior experience working with hardware-software co-design, especially for autonomous or robotics platforms.
Knowledge of numerical precision trade-offs, quantization-aware training (QAT), and dynamic/static quantization flows.
Familiarity with embedded real-time constraints and hardware profiling/debugging tools.
Familiarity with rearchitecting models to best suit hardware capabilities
Publication record in top-tier ML/Systems conferences (e.g., MLSys, NeurIPS, DAC,
ICCAD).


Pay Disclosure

Pay Range: The salary range for this role is $139,000 - $155,000 annually for Bay Area based applicants. This is the lowest to highest salary we in good faith believe we would pay for this role at the time of this posting. An employee’s position within the salary range will be based on several factors including, but not limited to, specific competencies, relevant education, qualifications, certifications, experience, skills, geographic location, shift, and organizational needs. The successful candidate may be eligible for annual performance bonus and equity awards.


Benefits: We offer a comprehensive package of benefits for full-time and part-time employees, their spouse or domestic partner, and children up to age 26, including but not limited to paid vacation, paid sick leave, and a competitive portfolio of insurance benefits including life, medical, dental, vision, short-term disability insurance, and long-term disability insurance to eligible employees. You may also have the opportunity to participate in Rivian’s 401(k) Plan and Employee Stock Purchase Program if you meet certain eligibility requirements. Full-time employee coverage is effective on their first day of employment. Part-time employee coverage is effective the first of the month following 90 days of employment. More information about benefits is available at rivianbenefits.com.

You can apply for this role through careers.rivian.com (or through internal-careers-rivian.icims.com if you are a current employee). This job is not expected to be closed any sooner than 4/30/2026.



Equal Opportunity

Rivian is an equal opportunity employer and complies with all applicable federal, state, and local fair employment practices laws. All qualified applicants will receive consideration for employment without regard to race, color, religion, national origin, ancestry, sex, sexual orientation, gender, gender expression, gender identity, genetic information or characteristics, physical or mental disability, marital/domestic partner status, age, military/veteran status, medical condition, or any other characteristic protected by law.

Rivian is committed to ensuring that our hiring process is accessible for persons with disabilities. If you have a disability or limitation, such as those covered by the Americans with Disabilities Act, that requires accommodations to assist you in the search and application process, please email us at candidateaccommodations@rivian.com.

Candidate Data Privacy

Rivian may collect, use and disclose your personal information or personal data (within the meaning of the applicable data protection laws) when you apply for employment and/or participate in our recruitment processes (“Candidate Personal Data”). This data includes contact, demographic, communications, educational, professional, employment, social media/website, network/device, recruiting system usage/interaction, security and preference information. Rivian may use your Candidate Personal Data for the purposes of (i) tracking interactions with our recruiting system; (ii) carrying out, analyzing and improving our application and recruitment process, including assessing you and your application and conducting employment, background and reference checks; (iii) establishing an employment relationship or entering into an employment contract with you; (iv) complying with our legal, regulatory and corporate governance obligations; (v) recordkeeping; (vi) ensuring network and information security and preventing fraud; and (vii) as otherwise required or permitted by applicable law. 

Rivian may share your Candidate Personal Data with (i) internal personnel who have a need to know such information in order to perform their duties, including individuals on our People Team, Finance, Legal, and the team(s) with the position(s) for which you are applying; (ii) Rivian affiliates; and (iii) Rivian’s service providers, including providers of background checks, staffing services, and cloud services. 

Rivian may transfer or store internationally your Candidate Personal Data, including to or in the United States, Canada, the United Kingdom, and the European Union and in the cloud, and this data may be subject to the laws and accessible to the courts, law enforcement and national security authorities of such jurisdictions.  

Please note that we are currently not accepting applications from third party application services.

Qualifications:

Basic Qualifications:
B.S. in Computer Engineering, Electrical Engineering, Computer Science, or related field with a focus on ML compilers, embedded systems, or hardware-aware AI.
Hands-on experience with quantized model deployment, ML design stacks, and code generation for embedded or heterogeneous compute systems.
Understanding of computer vision models (e.g., object detection, segmentation) and their optimization for edge inference.
Experience in deep learning frameworks (e.g., PyTorch, TensorFlow) and their low-level IRs or export formats (e.g., ONNX).
Solid programming skills in C++, Python
Familiarity with CUDA/OpenCL (or other accelerator programming models).

Preferred Qualifications:
Prior experience working with hardware-software co-design, especially for autonomous or robotics platforms.
Knowledge of numerical precision trade-offs, quantization-aware training (QAT), and dynamic/static quantization flows.
Familiarity with embedded real-time constraints and hardware profiling/debugging tools.
Familiarity with rearchitecting models to best suit hardware capabilities
Publication record in top-tier ML/Systems conferences (e.g., MLSys, NeurIPS, DAC,
ICCAD).

Education:UNAVAILABLEEmployment Type: FULL_TIME

What Rivian employees say

Pay

Benefits

Hours and flexibility

Workplace

Get the full story on Breakroom


Rivian logo

About Rivian

Sourced by ZipRecruiter

Rivian is a pioneering automotive industry player headquartered in Irvine, California. Established in 2009, the company has made notable advancements in developing sustainable transportation solutions. It is widely recognized for its electric adventure vehicles: the R1T pickup and the R1S SUV. Rivian is dedicated to creating a positive shift in societal mobility and emphasizes sustainability, innovation, and adventure as part of its core values. Their mission is to keep the world adventurous forever - a testament to their commitment in transitioning the world to sustainable transportation. Rivian's achievements are numerous, with one of the most notable being securing a significant multi-billion dollar investment from Amazon for the production of electric delivery vans.

Industry

Automobile dealers

Company size

10,000+ Employees

Headquarters location

Irvine, CA, US

Year founded

2009