... developers optimize and deploy machine learning models on edge and mobile hardware. AIMET is Qualcomm's open-source library for state-of-the-art model quantization, and compression techniques. You ...
... developers optimize and deploy machine learning models on edge and mobile hardware. AIMET is Qualcomm's open-source library for state-of-the-art model quantization, and compression techniques. You ...
... developers optimize and deploy machine learning models on edge and mobile hardware. AIMET is Qualcomm's open-source library for state-of-the-art model quantization, and compression techniques. You ...
... developers optimize and deploy machine learning models on edge and mobile hardware. AIMET is Qualcomm's open-source library for state-of-the-art model quantization, and compression techniques. You ...
... Learning Engineer to join their AI Hub team. The role involves developing tools for optimizing and deploying machine learning models on edge and mobile hardware, focusing on model quantization and ...
... Learning Engineer to join their AI Hub team. The role involves developing tools for optimizing and deploying machine learning models on edge and mobile hardware, focusing on model quantization and ...
... Learning Engineer to join their AI Hub team. The role involves developing tools for optimizing and deploying machine learning models on edge and mobile hardware, focusing on model quantization and ...
... Learning Engineer to join their AI Hub team. The role involves developing tools for optimizing and deploying machine learning models on edge and mobile hardware, focusing on model quantization and ...
As a Machine Learning Engineer, you will play a central role in translating cutting-edge machine ... Hands on experience with quantization techniques (AWQ, GPTQ, FP8/GGUF)
As a Machine Learning Engineer, you will play a central role in translating cutting-edge machine ... Hands on experience with quantization techniques (AWQ, GPTQ, FP8/GGUF)
Machine Learning Engineer
Palo Alto, CA · On-site
As a Machine Learning Engineer, you will play a central role in translating cutting-edge machine ... Hands on experience with quantization techniques (AWQ, GPTQ, FP8/GGUF)
Machine Learning Engineer
Palo Alto, CA · On-site
As a Machine Learning Engineer, you will play a central role in translating cutting-edge machine ... Hands on experience with quantization techniques (AWQ, GPTQ, FP8/GGUF)
Machine Learning Engineer
Washington, DC · On-site +1
Machine Learning Engineer We're seeking a skilled Machine Learning Engineer to build and deploy ... Experience with model compression techniques (quantization, pruning, distillation) * Contributions ...
Machine Learning Engineer
Washington, DC · On-site +1
Machine Learning Engineer We're seeking a skilled Machine Learning Engineer to build and deploy ... Experience with model compression techniques (quantization, pruning, distillation) * Contributions ...
Machine Learning Engineer, Robotics Santa Clara, CA XPENG is a leading smart technology company at ... Work on efficient foundation models (e.g. small LLMs, weight sharing, model quantization, etc ...
Machine Learning Engineer, Robotics Santa Clara, CA XPENG is a leading smart technology company at ... Work on efficient foundation models (e.g. small LLMs, weight sharing, model quantization, etc ...
Senior Machine Learning Engineer
New York, NY · On-site
$114K - $157K/yr
Sr. Machine Learning Engineer Location: New York, NY Sponsorship: Yes Relocation: Yes Industry ... model quantization, fixed point neural networks (CNN and RNN) * Excellent research and problem ...
Quick apply
Senior Machine Learning Engineer
New York, NY · On-site
$114K - $157K/yr
Sr. Machine Learning Engineer Location: New York, NY Sponsorship: Yes Relocation: Yes Industry ... model quantization, fixed point neural networks (CNN and RNN) * Excellent research and problem ...
Senior Machine Learning Engineer
San Jose, CA · On-site
$122K - $168K/yr
... quantization, pruning, and knowledge distillation. • Collaborate with cross-functional teams to ... Electrical Engineering, Machine Learning, or related fields. • Must have prior experience ...
Senior Machine Learning Engineer
San Jose, CA · On-site
$122K - $168K/yr
... quantization, pruning, and knowledge distillation. • Collaborate with cross-functional teams to ... Electrical Engineering, Machine Learning, or related fields. • Must have prior experience ...
Innovate quantization-aware-training recipes and algorithms that tackle complex optimization ... Degree or equivalent experience in Computer Science, Machine Learning, Robotics, Computer Vision ...
Innovate quantization-aware-training recipes and algorithms that tackle complex optimization ... Degree or equivalent experience in Computer Science, Machine Learning, Robotics, Computer Vision ...
As a Machine Learning Engineer, you will shape the technical direction of the company by automating ... quantization. Qualifications : Required : • 3 to 5 years of industry experience in full-stack ...
As a Machine Learning Engineer, you will shape the technical direction of the company by automating ... quantization. Qualifications : Required : • 3 to 5 years of industry experience in full-stack ...
With this mission, we are looking for passionated machine learning engineers of all levels who will ... Work on efficient foundation models (e.g. small LLMs, weight sharing, model quantization, etc ...
With this mission, we are looking for passionated machine learning engineers of all levels who will ... Work on efficient foundation models (e.g. small LLMs, weight sharing, model quantization, etc ...
With this mission, we are looking for passionated machine learning engineers of all levels who will ... Work on efficient foundation models (e.g. small LLMs, weight sharing, model quantization, etc ...
With this mission, we are looking for passionated machine learning engineers of all levels who will ... Work on efficient foundation models (e.g. small LLMs, weight sharing, model quantization, etc ...
Machine Learning Engineer
Washington, DC · On-site +1
We're seeking a skilled Machine Learning Engineer to build and deploy production ML systems for the ... Experience with model compression techniques (quantization, pruning, distillation) * Contributions ...
Machine Learning Engineer
Washington, DC · On-site +1
We're seeking a skilled Machine Learning Engineer to build and deploy production ML systems for the ... Experience with model compression techniques (quantization, pruning, distillation) * Contributions ...
... Level Machine Learning Engineer to develop and optimize machine learning models for edge AI ... quantization, pruning, and knowledge distillation. • Collaborate with cross-functional teams to ...
... Level Machine Learning Engineer to develop and optimize machine learning models for edge AI ... quantization, pruning, and knowledge distillation. • Collaborate with cross-functional teams to ...
Machine Learning Engineer
Cambridge, MA · On-site +1
Machine Learning Engineer - Computer Vision & Robotics Tycho.AI is redefining the future of ... CUDA kernel development and model optimization (quantization, pruning, distillation). * Experience ...
Quick apply
Machine Learning Engineer
Cambridge, MA · On-site +1
Machine Learning Engineer - Computer Vision & Robotics Tycho.AI is redefining the future of ... CUDA kernel development and model optimization (quantization, pruning, distillation). * Experience ...
Machine Learning Engineer - Edge
Dover, NH · On-site +1
$86K - $135K/yr
Machine Learning Engineer - Edge *Please consider before applying: This is a hybrid role, and ... Apply techniques such as model compression, quantization, pruning, and distillation to improve ...
Machine Learning Engineer - Edge
Dover, NH · On-site +1
$86K - $135K/yr
Machine Learning Engineer - Edge *Please consider before applying: This is a hybrid role, and ... Apply techniques such as model compression, quantization, pruning, and distillation to improve ...
Machine Learning Engineer - Edge
Dover, NH · On-site +1
$86K - $135K/yr
Machine Learning Engineer - Edge *Please consider before applying: This is a hybrid role, and ... Apply techniques such as model compression, quantization, pruning, and distillation to improve ...
Machine Learning Engineer - Edge
Dover, NH · On-site +1
$86K - $135K/yr
Machine Learning Engineer - Edge *Please consider before applying: This is a hybrid role, and ... Apply techniques such as model compression, quantization, pruning, and distillation to improve ...
Machine Learning Engineer - Edge
Lowell, MA · On-site +1
$86K - $135K/yr
Machine Learning Engineer - Edge *Please consider before applying: This is a hybrid role, and ... Apply techniques such as model compression, quantization, pruning, and distillation to improve ...
Machine Learning Engineer - Edge
Lowell, MA · On-site +1
$86K - $135K/yr
Machine Learning Engineer - Edge *Please consider before applying: This is a hybrid role, and ... Apply techniques such as model compression, quantization, pruning, and distillation to improve ...
Machine Learning Engineer Quantization information
See salary details
$31.5K - $46.2K
1% of jobs
$46.2K - $61K
1% of jobs
$61K - $75.7K
5% of jobs
$75.7K - $90.4K
6% of jobs
$102.6K is the 25th percentile. Wages below this are outliers.
$90.4K - $105.1K
14% of jobs
$105.1K - $119.9K
14% of jobs
The median wage is $127.2K / yr.
$119.9K - $134.6K
18% of jobs
$134.6K - $149.3K
14% of jobs
$152.3K is the 75th percentile. Wages above this are outliers.
$149.3K - $164K
12% of jobs
$164K - $178.8K
11% of jobs
$178.8K - $193.5K
5% of jobs
$31.5K
$128.8K
$193.5K
How much do machine learning engineer quantization jobs pay per year?
What are some common challenges Machine Learning Engineers face when implementing quantization techniques in production models?
What are the key skills and qualifications needed to thrive as a Machine Learning Engineer Quantization, and why are they important?
What does a Machine Learning Engineer Quantization do?
What is the difference between Machine Learning Engineer Quantization vs Data Scientist?
| Aspect | Machine Learning Engineer Quantization | Data Scientist |
|---|---|---|
| Required Credentials | Bachelor's or master's in CS, ML, or related; certifications in ML or AI | Bachelor's or master's in statistics, CS, or related; certifications in data analysis or statistics |
| Work Environment | Developing optimized ML models, deploying quantized models for efficiency | Analyzing data, building predictive models, interpreting results |
| Industry Usage | Tech companies, AI hardware firms, embedded systems | Finance, healthcare, marketing, research institutions |
Machine Learning Engineer Quantization focuses on optimizing ML models for deployment efficiency, often working closely with hardware and software teams. Data Scientists analyze data and build models for insights. While both roles require ML knowledge, quantization engineers specialize in model compression techniques, whereas data scientists focus on data analysis and interpretation.
- Banister Pipeline
- Full Time No Experience Machine Learning
- Staff Machine Learning Engineer
- Online Machine Learning
- Machine Learning Engineer Opt
- Machine Learning Operations Engineer
- Machine Learning Engineer Associate
- Freelance Google Machine Learning Engineer
- Principal Machine Learning Engineer
- Ai Machine Learning Engineer

Full-time
Posted 4 days ago
Qualcomm rating
9.6
Based on 5 frontline employees who took The Breakroom Quiz
5th of 189 rated software companies
Job description
Qualcomm Technologies, Inc.
Job Area:
Engineering Group, Engineering Group > Machine Learning Engineering
General Summary:
About the Role
Join the Qualcomm AI Hub team and help developers integrate machine learning into their products and experiences: https://aihub.qualcomm.com/.
In this role you will develop tools to help developers optimize and deploy machine learning models on edge and mobile hardware. AIMET is Qualcomm's open-source library for state-of-the-art model quantization, and compression techniques. You will develop and support cutting-edge model optimization workflows - pushing the boundary of what's possible on resource-constrained hardware. Applications range from quantizing large language models (LLMs) and generative AI models to compressing latency-critical vision, audio, and multimodal networks for deployment on Qualcomm Snapdragon and other edge SoCs.
For this role we are seeking a talented and motivated Staff Software Engineer with expertise in the optimizing and deploying ML models - especially for edge devices.
What You'll Do
- Design, develop, and maintain quantization algorithms and compression pipelines within the AIMET framework (PTQ, QAT, mixed-precision, AdaScale etc.)
- Implement advanced quantization techniques including weight-only quantization, activation quantization, KV-cache quantization, and sub-4-bit quantization for LLMs and generative AI models
- Build tooling to analyze, profile, and debug model accuracy degradation caused by quantization
- Integrate AIMET workflows with popular ML frameworks - PyTorch and ONNX
- Develop APIs and developer-facing tooling to make AIMET accessible and easy to use for external customers and design partners
- Integrate AIMET in AI Hub Workbench Quantize job to enable Quantization at large scale.
- Own end-to-end quantization and optimization of models published on Qualcomm AI Hub, ensuring they meet accuracy, latency, and power targets on Qualcomm hardware
- Quantize and validate a broad range of model families - vision transformers, LLMs, diffusion models, speech, and multimodal architectures - for deployment via AI Hub
- Develop and maintain automated quantization pipelines and evaluation harnesses to scale model onboarding across AI Hub's growing model catalog
Minimum Qualifications:
• Bachelor's degree in Computer Science, Engineering, Information Systems, or related field and 4+ years of Hardware Engineering, Software Engineering, Systems Engineering, or related work experience.
OR
Master's degree in Computer Science, Engineering, Information Systems, or related field and 3+ years of Hardware Engineering, Software Engineering, Systems Engineering, or related work experience.
OR
PhD in Computer Science, Engineering, Information Systems, or related field and 2+ years of Hardware Engineering, Software Engineering, Systems Engineering, or related work experience.
Preferred Qualifications:
- 3+ years of industry experience in machine learning, deep learning, or AI infrastructure
- Strong proficiency in Python, with hands-on experience in PyTorch, ONNX and/or TensorFlow
- Solid understanding of neural network architectures - CNNs, Transformers, LLMs, diffusion models, multimodal models
- Experience with model quantization techniques - PTQ, QAT, weight-only quantization, mixed-precision, sub-4-bit methods
- Hands-on experience quantizing LLMs (GPT, LLaMA, Mistral, Falcon, or similar families) for inference optimization
- Familiarity with AIMET, GPTQ, AWQ, SmoothQuant, or similar quantization frameworks is a strong plus
- Experience working with ONNX, TFLite/LiteRT, or other model interchange formats
- Understanding of hardware constraints: memory bandwidth, compute precision (INT4/INT8/FP16/BF16), and NPU/DSP execution
- Experience collaborating across teams or BUs to drive technical alignment and model delivery
- Proficiency with git and software development best practices
- Strong written and verbal communication skills - ability to write clean APIs, documentation, and engage directly with external developers
- Experience with C++ for performance-critical components is a bonus
- Familiarity with ARM processors and mobile SoC architecture (Snapdragon) is a plus
- Experience with automated evaluation pipelines and model benchmarking at scale is a plus
Level of Responsibility
- Works independently with minimal supervision
- Provides technical guidance and mentorship to other team members
- Decision-making is significant and affects work beyond the immediate team
- Requires strong communication skills to convey complex quantization concepts to varied audiences - from hardware engineers and BU partners to external researchers and application developers
- Has meaningful influence on the AIMET product roadmap, AI Hub model catalog, and cross-BU quantization strategy
- Tasks are open-ended; planning, prioritization, and problem-solving are core to the role
Qualcomm is an equal opportunity employer. If you are an individual with a disability and need an accommodation during the application/hiring process, rest assured that Qualcomm is committed to providing an accessible process. You may e-mail disability-accomodations@qualcomm.com or call Qualcomm's toll-free number found here. Upon request, Qualcomm will provide reasonable accommodations to support individuals with disabilities to be able participate in the hiring process. Qualcomm is also committed to making our workplace accessible for individuals with disabilities. (Keep in mind that this email address is used to provide reasonable accommodations for individuals with disabilities. We will not respond here to requests for updates on applications or resume inquiries).
To all Staffing and Recruiting Agencies: Our Careers Site is only for individuals seeking a job at Qualcomm. Staffing and recruiting agencies and individuals being represented by an agency are not authorized to use this site or to submit profiles, applications or resumes, and any such submissions will be considered unsolicited. Qualcomm does not accept unsolicited resumes or applications from agencies. Please do not forward resumes to our jobs alias, Qualcomm employees or any other company location. Qualcomm is not responsible for any fees related to unsolicited resumes/applications.
EEO Employer: Qualcomm is an equal opportunity employer; all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or any other protected classification.
Qualcomm expects its employees to abide by all applicable policies and procedures, including but not limited to security and other requirements regarding protection of Company confidential information and other confidential and/or proprietary information, to the extent those requirements are permissible under applicable law.
Pay range and Other Compensation & Benefits:
$158,400.00 - $237,600.00
The above pay scale reflects the broad, minimum to maximum, pay scale for this job code for the location for which it has been posted. Even more importantly, please note that salary is only one component of total compensation at Qualcomm. We also offer a competitive annual discretionary bonus program and opportunity for annual RSU grants (employees on sales-incentive plans are not eligible for our annual bonus). In addition, our highly competitive benefits package is designed to support your success at work, at home, and at play. Your recruiter will be happy to discuss all that Qualcomm has to offer - and you can review more details about our US benefits at this link.
If you would like more information about this role, please contact Qualcomm Careers.
About Qualcomm
Sourced by ZipRecruiter
Qualcomm is enabling a world where everyone and everything can be intelligently connected. You interact with products and technologies made possible by Qualcomm every day, including 5G-enabled smartphones that double as pro-level cameras and gaming devices, smarter vehicles and cities, and the technology behind the smart, connected factories that manufactured your latest purchase. Our powerful connectivity solutions keep you connected—even in remote areas. Qualcomm 5G and AI innovations are the power behind the connected intelligent edge. You’ll find our technologies behind and inside the innovations that deliver significant value across multiple industries and to billions of people every day.
Industry
Technology, communication and media
Company size
10,000+ Employees
Headquarters location
San Diego, CA, US
Year founded
1985