1

Deep Learning Quantization Jobs in Oregon (NOW HIRING)

OR

$466K - $750K/yr

We are looking for an experienced Machine Learning Engineer with deep expertise in training and ... KV cache, batching, quantization, and long-context handling. Scale model training and inference ...

... of learning and excellence Minimum Qualifications * Bachelor's degree in Computer Science ... Deep expertise in LLM-specific infrastructure such as inference optimization (quantization, ONNX ...

Machine Learning/Artificial Intelligence powers innovation in all areas of the business, from ... Deep experience with distributed training at scale (FSDP, parallelism strategies, checkpointing) or ...

Deep Learning Quantization information

What are the key skills and qualifications needed to thrive as a Deep Learning Quantization Engineer, and why are they important?

To excel as a Deep Learning Quantization Engineer, you need a strong background in machine learning, applied mathematics, and computer science, usually supported by an advanced degree in a related field. Familiarity with deep learning frameworks (such as TensorFlow or PyTorch), quantization toolkits, and hardware acceleration platforms is crucial. Analytical thinking, problem-solving, and clear technical communication are standout soft skills in this role. These abilities are essential for efficiently optimizing models for deployment on resource-constrained hardware while maintaining accuracy and performance.

What is the difference between Deep Learning Quantization vs Machine Learning Engineer?

AspectDeep Learning QuantizationMachine Learning Engineer
Required CredentialsAdvanced degrees in AI, Computer Science, or related fields; knowledge of neural networksBachelor's or Master's in CS, Data Science, or related fields; programming skills
Work EnvironmentResearch labs, AI development teams, hardware optimization settingsSoftware development teams, data-driven projects, product-focused environments
Industry UsageAI hardware optimization, model deployment, edge computingModel development, data analysis, software solutions across industries

Deep Learning Quantization focuses on reducing model size and improving inference speed through techniques like weight and activation quantization, often in hardware or embedded systems. Machine Learning Engineers develop, implement, and optimize machine learning models for various applications. While both roles require knowledge of AI and programming, Deep Learning Quantization is more specialized in model optimization techniques, whereas Machine Learning Engineers work broadly on model development and deployment.

What is deep learning quantization?

Deep learning quantization is the process of reducing the precision of the numbers used to represent a neural network's parameters, activations, or both. By converting the typically used 32-bit floating-point values to lower bit-width formats such as 16-bit or 8-bit integers, quantization significantly reduces the memory footprint and computational requirements of deep learning models. This technique helps deploy models efficiently on edge devices and mobile hardware while maintaining acceptable accuracy levels. Quantization is widely used in model optimization for faster inference and lower power consumption.

What are some common challenges faced when implementing deep learning quantization in production environments?

One of the main challenges in implementing deep learning quantization is balancing model accuracy with computational efficiency, as quantization can sometimes lead to a drop in model performance. Additionally, ensuring hardware compatibility and optimizing for different devices (such as CPUs, GPUs, or edge devices) can require extensive testing and tuning. Collaboration with data scientists, software engineers, and hardware specialists is often essential to successfully deploy quantized models at scale. Staying updated with the latest quantization techniques and frameworks is also important for overcoming these challenges.
What are popular job titles related to Deep Learning Quantization jobs in Oregon? For Deep Learning Quantization jobs in Oregon, the most frequently searched job titles are:
What job categories do people searching Deep Learning Quantization jobs in Oregon look for? The top searched job categories for Deep Learning Quantization jobs in Oregon are:
Engineering Manager, Machine Learning (Caper)

Engineering Manager, Machine Learning (Caper)

Instacart

OR • Remote

Other

Posted 24 days ago


Instacart rating

7.0

Company rating: 7.0 out of 10

Based on 30 frontline employees who took The Breakroom Quiz

32nd of 62 rated delivery companies


Job description

Overview

Caper Carts are AI-powered, intelligent shopping carts developed by Instacart that let customers scan, weigh, and pay for items directly on the cart-eliminating checkout lines. Equipped with cameras and sensors, these carts automatically recognize items, offer personalized promotions, and feature a touchscreen for real-time, interactive shopping. This machine learning team builds the brain behind the cart.

We're hiring an Engineering Manager, Machine Learning and Computer Vision to lead a team of talented CV, ML and AI infrastructure engineers who power perception, multimodal understanding, and edge inference for Caper Carts. You will own the roadmap for how our carts see and reason about what's in the basket, and you'll build the platforms and models that make checkout seamless in dynamic, real-world retail environments. Your direct team will be ~10 engineers within a broader organization of ~30 spanning Android and hardware.

This is a high-impact role at the frontier of physical AI-bridging edge devices in stores with cloud-scale data and training systems. You'll partner closely with Android, hardware, product, and operations to deliver measurable improvements in recognition accuracy, latency, and reliability. The role is remote across Canada; West Coast time zones are ideal, but we're open to great talent anywhere in the country. Learn more about our work at Connecting stores from edge to cloud: reinventing retail with physical AI.

About the Job
  • Lead and grow a team of ~10 ML, CV and AI infrastructure engineers building the perception and reasoning systems that power Caper Carts in live retail environments.
  • Define the technical vision, roadmap, and success metrics for cart perception and multimodal understanding; prioritize work that drives measurable gains in item recognition accuracy, checkout speed, and system reliability.
  • Architect scalable training, data, and inference platforms on GCP using Ray, Kubernetes, and modern MLOps practices to enable rapid experimentation and safe, repeatable deployments.
  • Deliver production-grade CV/VLM models for multi-camera item detection, weighing, and basket reasoning; optimize on-device inference for low-latency, high-availability operation at the edge.
  • Build the data flywheel end-to-end-instrumentation, labeling, evaluation, offline/online testing, and monitoring-to continuously improve performance across diverse store conditions.
  • Collaborate cross-functionally with Android, hardware, product, design, operations, and retailer partners; communicate risks, tradeoffs, and timelines clearly in a fast-paced, ever-evolving environment.
About YouMinimum Qualifications
  • 8+ years of experience building and deploying machine learning systems, with a strong focus on computer vision in production environments.
  • 2+ years of experience managing teams of 6+ ML/CV/AI engineers, including hiring, performance management, and career development.
  • Hands-on expertise with computer vision, deep learning (e.g., PyTorch), model training/evaluation, and MLOps practices for reliable CI/CD of ML services.
  • Proven experience architecting and operating ML infrastructure on GCP (e.g., GKE, Vertex AI, BigQuery) and distributed training/inference with Ray; containerization with Docker and orchestration with Kubernetes.
  • Experience delivering real-time edge inference, including model optimization (e.g., TensorRT, ONNX, quantization) and monitoring for latency, throughput, and accuracy.
  • Proficiency in Python and SQL, with a track record of shipping end-to-end CV systems including data pipelines, experimentation, deployment, and post-launch iteration.
  • Bachelor's degree in Computer Science, Electrical/Computer Engineering, or a related technical field, or equivalent practical experience.
Preferred Qualifications
  • Experience integrating on-device ML with Android applications and collaborating closely with Android teams on SDKs and APIs.
  • Background with multimodal vision-language models (VLMs) and large language models (LLMs) for perception, retrieval, or instruction-based reasoning.
  • Experience with sensors and hardware integration (e.g., multi-camera setups, weight sensors), calibration, and dataset generation for robotics or retail environments.
  • Demonstrated success leading cross-functional programs across 3+ partner teams and delivering multi-quarter roadmaps.
  • Graduate degree (MS/PhD) in a relevant field with research or applied focus in computer vision, machine learning, or robotics.

#LI-Remote


What Instacart employees say

Pay

Benefits

Hours and flexibility

Workplace

Get the full story on Breakroom


Instacart logo

About Instacart

Sourced by ZipRecruiter

Instacart, based in San Francisco, CA, US, operates within the retail industry, specifically grocery delivery and pick-up service. It is recognized as a pioneer in this field, delivering fresh groceries from local stores directly to customers' doors. The company, which launched its services in 2012, continues to pioneer change in the online grocery shopping sector through its commitment to cutting-edge technology, new business ideas, and dedicated service.

Industry

Technology, communication and media

Company size

10,000+ Employees

Headquarters location

San Francisco, CA, US

Year founded

2012