... Learning Engineer to join their AI Hub team. The role involves developing tools for optimizing and deploying machine learning models on edge and mobile hardware, focusing on model quantization and ...
... Learning Engineer to join their AI Hub team. The role involves developing tools for optimizing and deploying machine learning models on edge and mobile hardware, focusing on model quantization and ...
As a Machine Learning Engineer, you will play a central role in translating cutting-edge machine ... Hands on experience with quantization techniques (AWQ, GPTQ, FP8/GGUF)
As a Machine Learning Engineer, you will play a central role in translating cutting-edge machine ... Hands on experience with quantization techniques (AWQ, GPTQ, FP8/GGUF)
Machine Learning Engineer
Palo Alto, CA · On-site
As a Machine Learning Engineer, you will play a central role in translating cutting-edge machine ... Hands on experience with quantization techniques (AWQ, GPTQ, FP8/GGUF)
Machine Learning Engineer
Palo Alto, CA · On-site
As a Machine Learning Engineer, you will play a central role in translating cutting-edge machine ... Hands on experience with quantization techniques (AWQ, GPTQ, FP8/GGUF)
Machine Learning Engineer, Robotics Santa Clara, CA XPENG is a leading smart technology company at ... Work on efficient foundation models (e.g. small LLMs, weight sharing, model quantization, etc ...
Machine Learning Engineer, Robotics Santa Clara, CA XPENG is a leading smart technology company at ... Work on efficient foundation models (e.g. small LLMs, weight sharing, model quantization, etc ...
Senior Machine Learning Engineer
San Jose, CA · On-site
$122K - $168K/yr
... quantization, pruning, and knowledge distillation. • Collaborate with cross-functional teams to ... Electrical Engineering, Machine Learning, or related fields. • Must have prior experience ...
Senior Machine Learning Engineer
San Jose, CA · On-site
$122K - $168K/yr
... quantization, pruning, and knowledge distillation. • Collaborate with cross-functional teams to ... Electrical Engineering, Machine Learning, or related fields. • Must have prior experience ...
Innovate quantization-aware-training recipes and algorithms that tackle complex optimization ... Degree or equivalent experience in Computer Science, Machine Learning, Robotics, Computer Vision ...
Innovate quantization-aware-training recipes and algorithms that tackle complex optimization ... Degree or equivalent experience in Computer Science, Machine Learning, Robotics, Computer Vision ...
With this mission, we are looking for passionated machine learning engineers of all levels who will ... Work on efficient foundation models (e.g. small LLMs, weight sharing, model quantization, etc ...
With this mission, we are looking for passionated machine learning engineers of all levels who will ... Work on efficient foundation models (e.g. small LLMs, weight sharing, model quantization, etc ...
With this mission, we are looking for passionated machine learning engineers of all levels who will ... Work on efficient foundation models (e.g. small LLMs, weight sharing, model quantization, etc ...
With this mission, we are looking for passionated machine learning engineers of all levels who will ... Work on efficient foundation models (e.g. small LLMs, weight sharing, model quantization, etc ...
... Level Machine Learning Engineer to develop and optimize machine learning models for edge AI ... quantization, pruning, and knowledge distillation. • Collaborate with cross-functional teams to ...
... Level Machine Learning Engineer to develop and optimize machine learning models for edge AI ... quantization, pruning, and knowledge distillation. • Collaborate with cross-functional teams to ...
Machine Learning Engineer
Sunnyvale, CA · Remote
$70 - $80/hr
Health insurance Machine Learning Engineer 100% Remote We are seeking a highly skilled Machine Learning Engineer to design, develop, deploy, and maintain scalable machine learning solutions that ...
Quick apply
Machine Learning Engineer
Sunnyvale, CA · Remote
$70 - $80/hr
Health insurance Machine Learning Engineer 100% Remote We are seeking a highly skilled Machine Learning Engineer to design, develop, deploy, and maintain scalable machine learning solutions that ...
Machine Learning Engineer
San Francisco, CA · On-site +1
$172K - $384K/yr
They're now looking for a Machine Learning Engineer to help build the next generation of AI-powered tools that generate structured visuals from scientific inputs . If you're excited by real-world ...
Machine Learning Engineer
San Francisco, CA · On-site +1
$172K - $384K/yr
They're now looking for a Machine Learning Engineer to help build the next generation of AI-powered tools that generate structured visuals from scientific inputs . If you're excited by real-world ...
With this mission, we are looking for passionated and seasoned machine learning engineers who will ... Work on efficient foundation models (e.g. small LLMs, weight sharing, model quantization, etc ...
With this mission, we are looking for passionated and seasoned machine learning engineers who will ... Work on efficient foundation models (e.g. small LLMs, weight sharing, model quantization, etc ...
With this mission, we are looking for passionated and seasoned machine learning engineers who will ... Work on efficient foundation models (e.g. small LLMs, weight sharing, model quantization, etc ...
With this mission, we are looking for passionated and seasoned machine learning engineers who will ... Work on efficient foundation models (e.g. small LLMs, weight sharing, model quantization, etc ...
Machine Learning Engineer We are looking for talented Machine Learning Engineers to join Prescient Design, a division devoted to developing structural and machine learning based methods for molecular ...
Machine Learning Engineer We are looking for talented Machine Learning Engineers to join Prescient Design, a division devoted to developing structural and machine learning based methods for molecular ...
About the job Machine Learning Engineer Glint Tech Solutions is Hiring an experienced Machine Learning Engineer to join our client's high-performing team, working on cutting-edge ML infrastructure ...
About the job Machine Learning Engineer Glint Tech Solutions is Hiring an experienced Machine Learning Engineer to join our client's high-performing team, working on cutting-edge ML infrastructure ...
We are looking for a highly motivated and skilled Machine Learning Integration Engineer to join our ... Strong knowledge of model compression techniques such as pruning, distillation, quantization and ...
We are looking for a highly motivated and skilled Machine Learning Integration Engineer to join our ... Strong knowledge of model compression techniques such as pruning, distillation, quantization and ...
Machine Learning Engineer Location: Fremont, CA Duration: 12+ Months Tesla/ $65 About the Role Our direct client is seeking a highly skilled Machine Learning Engineer to join their Software Machine ...
Machine Learning Engineer Location: Fremont, CA Duration: 12+ Months Tesla/ $65 About the Role Our direct client is seeking a highly skilled Machine Learning Engineer to join their Software Machine ...
Poesis Machine Learning Engineer At Poesis, machine learning and artificial intelligence open the door to improved alpha discovery, higher quality decision-making and intelligent risk management. We ...
Poesis Machine Learning Engineer At Poesis, machine learning and artificial intelligence open the door to improved alpha discovery, higher quality decision-making and intelligent risk management. We ...
Machine Learning Engineer LeanData helps the world's fastest-growing companies automate, simplify, and accelerate revenue. We are looking for a curious and innovative Machine Learning Engineer to ...
Machine Learning Engineer LeanData helps the world's fastest-growing companies automate, simplify, and accelerate revenue. We are looking for a curious and innovative Machine Learning Engineer to ...
Machine Learning Engineer
$149K - $250K/yr
Machine Learning Engineer Location: San Francisco, CA Salary: $149,998.00 - $250,000.00 Responsibilities * Build, maintain, and improve efficient and reliable data mining and machine learning models.
Machine Learning Engineer
$149K - $250K/yr
Machine Learning Engineer Location: San Francisco, CA Salary: $149,998.00 - $250,000.00 Responsibilities * Build, maintain, and improve efficient and reliable data mining and machine learning models.
Machine Learning Engineer Quantization information
See Sunnyvale, CA salary details
$37K - $54.3K
1% of jobs
$54.3K - $71.5K
1% of jobs
$71.5K - $88.8K
5% of jobs
$88.8K - $106.1K
6% of jobs
$120.4K is the 25th percentile. Wages below this are outliers.
$106.1K - $123.4K
14% of jobs
$123.4K - $140.7K
14% of jobs
The median wage is $149.3K / yr.
$140.7K - $158K
18% of jobs
$158K - $175.2K
14% of jobs
$178.8K is the 75th percentile. Wages above this are outliers.
$175.2K - $192.5K
12% of jobs
$192.5K - $209.8K
11% of jobs
$209.8K - $227.1K
5% of jobs
$37K
$151.1K
$227.1K
How much do machine learning engineer quantization jobs pay per year?
What are some common challenges Machine Learning Engineers face when implementing quantization techniques in production models?
What are the key skills and qualifications needed to thrive as a Machine Learning Engineer Quantization, and why are they important?
What does a Machine Learning Engineer Quantization do?
What is the difference between Machine Learning Engineer Quantization vs Data Scientist?
| Aspect | Machine Learning Engineer Quantization | Data Scientist |
|---|---|---|
| Required Credentials | Bachelor's or master's in CS, ML, or related; certifications in ML or AI | Bachelor's or master's in statistics, CS, or related; certifications in data analysis or statistics |
| Work Environment | Developing optimized ML models, deploying quantized models for efficiency | Analyzing data, building predictive models, interpreting results |
| Industry Usage | Tech companies, AI hardware firms, embedded systems | Finance, healthcare, marketing, research institutions |
Machine Learning Engineer Quantization focuses on optimizing ML models for deployment efficiency, often working closely with hardware and software teams. Data Scientists analyze data and build models for insights. While both roles require ML knowledge, quantization engineers specialize in model compression techniques, whereas data scientists focus on data analysis and interpretation.
- Machine Learning Engineer Hybrid
- Contract Apple Machine Learning Engineer
- Reinforcement Learning Engineer
- Physics Based Machine Learning
- Junior Machine Learning
- Remote Google Machine Learning Engineer
- Online Machine Learning
- Weekend Machine Learning
- Seasonal Medical Imaging Machine Learning
- Machine Learning Engineer Two
Staff Machine Learning Engineer - Model Optimization & Quantization
QualcommSanta Clara, CA • On-site
Full-time
This job post has expired today. Applications are no longer accepted.
Qualcomm rating
9.6
Based on 5 frontline employees who took The Breakroom Quiz
5th of 190 rated software companies
Job description
Qualcomm Technologies, Inc. is seeking a Staff Machine Learning Engineer to join their AI Hub team. The role involves developing tools for optimizing and deploying machine learning models on edge and mobile hardware, focusing on model quantization and compression techniques.
Responsibilities:
• Design, develop, and maintain quantization algorithms and compression pipelines within the AIMET framework (PTQ, QAT, mixed-precision, AdaScale etc.)
• Implement advanced quantization techniques including weight-only quantization, activation quantization, KV-cache quantization, and sub-4-bit quantization for LLMs and generative AI models
• Build tooling to analyze, profile, and debug model accuracy degradation caused by quantization
• Integrate AIMET workflows with popular ML frameworks — PyTorch and ONNX
• Develop APIs and developer-facing tooling to make AIMET accessible and easy to use for external customers and design partners
• Integrate AIMET in AI Hub Workbench Quantize job to enable Quantization at large scale.
• Own end-to-end quantization and optimization of models published on Qualcomm AI Hub, ensuring they meet accuracy, latency, and power targets on Qualcomm hardware
• Quantize and validate a broad range of model families — vision transformers, LLMs, diffusion models, speech, and multimodal architectures — for deployment via AI Hub
• Develop and maintain automated quantization pipelines and evaluation harnesses to scale model onboarding across AI Hub's growing model catalog
Qualifications:
Required:
• Bachelor's degree in Computer Science, Engineering, Information Systems, or related field and 4+ years of Hardware Engineering, Software Engineering, Systems Engineering, or related work experience.
• OR Master's degree in Computer Science, Engineering, Information Systems, or related field and 3+ years of Hardware Engineering, Software Engineering, Systems Engineering, or related work experience.
• OR PhD in Computer Science, Engineering, Information Systems, or related field and 2+ years of Hardware Engineering, Software Engineering, Systems Engineering, or related work experience.
Preferred:
• 3+ years of industry experience in machine learning, deep learning, or AI infrastructure
• Strong proficiency in Python, with hands-on experience in PyTorch, ONNX and/or TensorFlow
• Solid understanding of neural network architectures — CNNs, Transformers, LLMs, diffusion models, multimodal models
• Experience with model quantization techniques — PTQ, QAT, weight-only quantization, mixed-precision, sub-4-bit methods
• Hands-on experience quantizing LLMs (GPT, LLaMA, Mistral, Falcon, or similar families) for inference optimization
• Familiarity with AIMET, GPTQ, AWQ, SmoothQuant, or similar quantization frameworks is a strong plus
• Experience working with ONNX, TFLite/LiteRT, or other model interchange formats
• Understanding of hardware constraints: memory bandwidth, compute precision (INT4/INT8/FP16/BF16), and NPU/DSP execution
• Experience collaborating across teams or BUs to drive technical alignment and model delivery
• Proficiency with git and software development best practices
• Strong written and verbal communication skills — ability to write clean APIs, documentation, and engage directly with external developers
• Experience with C++ for performance-critical components is a bonus
• Familiarity with ARM processors and mobile SoC architecture (Snapdragon) is a plus
• Experience with automated evaluation pipelines and model benchmarking at scale is a plus
Company:
Qualcomm designs wireless technologies and semiconductors that power connectivity, communication, and smart devices. Founded in 1985, the company is headquartered in San Diego, USA, with a team of 10001+ employees. The company is currently Late Stage.
About Qualcomm
Sourced by ZipRecruiter
Qualcomm is enabling a world where everyone and everything can be intelligently connected. You interact with products and technologies made possible by Qualcomm every day, including 5G-enabled smartphones that double as pro-level cameras and gaming devices, smarter vehicles and cities, and the technology behind the smart, connected factories that manufactured your latest purchase. Our powerful connectivity solutions keep you connected—even in remote areas. Qualcomm 5G and AI innovations are the power behind the connected intelligent edge. You’ll find our technologies behind and inside the innovations that deliver significant value across multiple industries and to billions of people every day.
Industry
Technology, communication and media
Company size
10,000+ Employees
Headquarters location
San Diego, CA, US
Year founded
1985