1

Ml Inference Jobs in Oregon (NOW HIRING)

Senior Software Engineer II, (ML/AI Platform)

OR · Remote

$122K - $161K/yr

Overview At Instacart, the ML/AI Platform team is a critical part of enabling the business across ... You'll take ownership of defining the platform to enable AI model fine-tuning and batch inference ...

OR · On-site

Experience with AI/ML training workflows and distributed job orchestration using tools like Ray. * Deep expertise of transformer networks and experience deploying NVIDIA inference technologies ...

OR

$466K - $750K/yr

By exploring the frontiers of AI/ML and intersecting fields, the Machine Learning and Inference Research team turns these opportunities into tangible benefits for our members and our business. The ...

OR

$466K - $750K/yr

We are looking for a Senior ML Engineer to bridge the gap between cutting-edge AI research and ... Optimization & Inference: Own the performance side of GenAI. You will optimize model latency and ...

OR · On-site

$35 - $43/hr

Develop, evaluate, and iterate on AI/ML models (including LLMs) to solve a core business problem ... Solid grasp of probability and statistics concepts (e.g., distributions, Bayesian inference ...

OR

$466K - $750K/yr

By exploring the frontiers of AI/ML and intersecting fields, the Machine Learning and Inference Research team turns these opportunities into tangible benefits for our members and our business. The ...

OR

$466K - $750K/yr

We are looking for a Senior ML Engineer to bridge the gap between cutting-edge AI research and ... Optimization & Inference: Own the performance side of GenAI. You will optimize model latency and ...

OR

$104K - $143K/yr

Familiarity with streaming inference, real-time voice pipelines, or media systems. * Experience working closely with infrastructure or platform teams on production ML deployment, observability, and ...

OR · On-site

... inference at scale. * Diagnose and resolve intricate system issues and performance bottlenecks ... Commit to continuous professional growth through formal AI/ML certifications and special projects ...

OR

$300K - $537K/yr

The Role We are looking for a Machine Learning Scientist to join our team and bring deep ML and causal inference rigor to some of the hardest quantitative problems in subscription pricing. You will ...

Lead workshops, demos, and proof-of-concepts to showcase NVIDIA's AI/ML capabilities. * Guide customers on standard processes for scalable AI model deployment and inference optimization. What we need ...

Senior Data Engineer

OR · Remote

$105K - $143K/yr

About the Role As a Senior Data Engineer focused on AI/ML , you'll architect, build, and operate ... time inference features. This is a hands-on role where you will own large data sub-systems ...

OR · On-site

Mentor ML engineers to build expertise in ranking, causal inference, and scalable serving systems. About You Minimum Qualifications * 5+ years applying ML at scale (3+ years in technical leadership ...

OR

$300K - $537K/yr

The Role We are looking for a Machine Learning Scientist to join our team and bring deep ML and causal inference rigor to some of the hardest quantitative problems in subscription pricing. You will ...

next page

Showing results 1-20

Ml Inference information

What is ML inference?

ML inference refers to the process of using a trained machine learning model to make predictions or decisions based on new data. After a model has been trained on historical data, inference is the phase where that model is deployed and used in real-world applications, such as recognizing speech, detecting objects in images, or recommending products. The focus in ML inference is on speed, efficiency, and scalability to ensure quick predictions, often in real time. This process is critical for practical applications like mobile apps, web services, and embedded systems. Optimizing inference involves reducing latency, memory usage, and computational requirements.

What is the difference between Ml Inference vs Data Scientist?

AspectML InferenceData Scientist
Required CredentialsKnowledge of machine learning models, programming skillsDegree in data science, statistics, or related fields
Work EnvironmentDeploying models in production, real-time data processingData analysis, model development, research
Industry UsageAI product deployment, software companiesResearch institutions, tech firms, consulting

ML Inference focuses on deploying trained models to make predictions on new data, often in real-time. Data Scientists develop and analyze models, working primarily in research and development. While both roles require understanding of machine learning, ML Inference emphasizes deployment and operationalization, whereas Data Scientists focus on model creation and analysis.

What are some common challenges faced by ML Inference Engineers when deploying models to production?

ML Inference Engineers often encounter challenges such as optimizing model latency and throughput to meet production requirements, ensuring compatibility with diverse hardware environments, and managing model versioning and updates without disrupting service. Additionally, balancing resource utilization and inference accuracy while monitoring real-time performance metrics is crucial. Collaboration with data scientists, DevOps, and software engineers is typically essential to streamline deployment and maintain robust, scalable inference pipelines.

What are the key skills and qualifications needed to thrive in ML Inference, and why are they important?

To thrive in ML Inference, you need a solid background in machine learning principles, programming (Python or C++), and experience with deploying models at scale, often supported by a degree in computer science or a related field. Familiarity with frameworks and tools such as TensorFlow, PyTorch, ONNX, and cloud platforms like AWS SageMaker or Google AI Platform is typically required. Strong problem-solving skills, attention to detail, and effective communication are crucial soft skills for collaborating with multidisciplinary teams and optimizing model performance. These skills ensure efficient, scalable, and reliable deployment of machine learning solutions in real-world applications.
What are popular job titles related to Ml Inference jobs in Oregon? For Ml Inference jobs in Oregon, the most frequently searched job titles are:
What job categories do people searching Ml Inference jobs in Oregon look for? The top searched job categories for Ml Inference jobs in Oregon are:
What cities in Oregon are hiring for Ml Inference jobs? Cities in Oregon with the most Ml Inference job openings:
Engineering Manager, Machine Learning (Caper)

Engineering Manager, Machine Learning (Caper)

Instacart

OR • Remote

Other

Posted 13 days ago


Instacart rating

6.7

Company rating: 6.7 out of 10

Based on 29 frontline employees who took The Breakroom Quiz


Job description

Overview

Caper Carts are AI-powered, intelligent shopping carts developed by Instacart that let customers scan, weigh, and pay for items directly on the cart-eliminating checkout lines. Equipped with cameras and sensors, these carts automatically recognize items, offer personalized promotions, and feature a touchscreen for real-time, interactive shopping. This machine learning team builds the brain behind the cart.

We're hiring an Engineering Manager, Machine Learning and Computer Vision to lead a team of talented CV, ML and AI infrastructure engineers who power perception, multimodal understanding, and edge inference for Caper Carts. You will own the roadmap for how our carts see and reason about what's in the basket, and you'll build the platforms and models that make checkout seamless in dynamic, real-world retail environments. Your direct team will be ~10 engineers within a broader organization of ~30 spanning Android and hardware.

This is a high-impact role at the frontier of physical AI-bridging edge devices in stores with cloud-scale data and training systems. You'll partner closely with Android, hardware, product, and operations to deliver measurable improvements in recognition accuracy, latency, and reliability. The role is remote across Canada; West Coast time zones are ideal, but we're open to great talent anywhere in the country. Learn more about our work at Connecting stores from edge to cloud: reinventing retail with physical AI.

About the Job
  • Lead and grow a team of ~10 ML, CV and AI infrastructure engineers building the perception and reasoning systems that power Caper Carts in live retail environments.
  • Define the technical vision, roadmap, and success metrics for cart perception and multimodal understanding; prioritize work that drives measurable gains in item recognition accuracy, checkout speed, and system reliability.
  • Architect scalable training, data, and inference platforms on GCP using Ray, Kubernetes, and modern MLOps practices to enable rapid experimentation and safe, repeatable deployments.
  • Deliver production-grade CV/VLM models for multi-camera item detection, weighing, and basket reasoning; optimize on-device inference for low-latency, high-availability operation at the edge.
  • Build the data flywheel end-to-end-instrumentation, labeling, evaluation, offline/online testing, and monitoring-to continuously improve performance across diverse store conditions.
  • Collaborate cross-functionally with Android, hardware, product, design, operations, and retailer partners; communicate risks, tradeoffs, and timelines clearly in a fast-paced, ever-evolving environment.
About YouMinimum Qualifications
  • 8+ years of experience building and deploying machine learning systems, with a strong focus on computer vision in production environments.
  • 2+ years of experience managing teams of 6+ ML/CV/AI engineers, including hiring, performance management, and career development.
  • Hands-on expertise with computer vision, deep learning (e.g., PyTorch), model training/evaluation, and MLOps practices for reliable CI/CD of ML services.
  • Proven experience architecting and operating ML infrastructure on GCP (e.g., GKE, Vertex AI, BigQuery) and distributed training/inference with Ray; containerization with Docker and orchestration with Kubernetes.
  • Experience delivering real-time edge inference, including model optimization (e.g., TensorRT, ONNX, quantization) and monitoring for latency, throughput, and accuracy.
  • Proficiency in Python and SQL, with a track record of shipping end-to-end CV systems including data pipelines, experimentation, deployment, and post-launch iteration.
  • Bachelor's degree in Computer Science, Electrical/Computer Engineering, or a related technical field, or equivalent practical experience.
Preferred Qualifications
  • Experience integrating on-device ML with Android applications and collaborating closely with Android teams on SDKs and APIs.
  • Background with multimodal vision-language models (VLMs) and large language models (LLMs) for perception, retrieval, or instruction-based reasoning.
  • Experience with sensors and hardware integration (e.g., multi-camera setups, weight sensors), calibration, and dataset generation for robotics or retail environments.
  • Demonstrated success leading cross-functional programs across 3+ partner teams and delivering multi-quarter roadmaps.
  • Graduate degree (MS/PhD) in a relevant field with research or applied focus in computer vision, machine learning, or robotics.

#LI-Remote


What Instacart employees say

Pay

Benefits

Hours and flexibility

Workplace

Get the full story on Breakroom


Instacart logo

About Instacart

Sourced by ZipRecruiter

Instacart, based in San Francisco, CA, US, operates within the retail industry, specifically grocery delivery and pick-up service. It is recognized as a pioneer in this field, delivering fresh groceries from local stores directly to customers' doors. The company, which launched its services in 2012, continues to pioneer change in the online grocery shopping sector through its commitment to cutting-edge technology, new business ideas, and dedicated service.

Industry

Technology, communication and media

Company size

10,000+ Employees

Headquarters location

San Francisco, CA, US

Year founded

2012