Ml Inference Jobs (NOW HIRING)

Manager, Software Engineering, ML Inference

Los Angeles, CA · On-site

Manager, Software Engineering, ML Inference

Los Angeles, CA · On-site

ML Inference Platform Manager: Scale ML Infra

Palo Alto, CA · On-site

$229 - $343/hr

We're looking for a Manager, Software Engineering, ML Inference to join Snap Inc.!What you'll do:Lead and mentor a team of ML Infrastructure engineers responsible for building and scaling the systems ...

New

ML Inference Platform Manager: Scale ML Infra

Palo Alto, CA · On-site

$229 - $343/hr

New

Xforia, Inc.

ML engineer

Optimize trained models for inference, ensuring they are ready for deployment. * Work closely with ... Serve as domain experts in ML inference, working closely with both internal teams and external ...

Xforia, Inc.

ML engineer

Anyscale, Inc

Distributed LLM Inference Engineer

San Francisco, CA · On-site

$170K - $245K/yr

Familiarity with running ML inference at large scale with high throughput and low latency ... Familiarity with deep learning and deep learning frameworks (e.g. PyTorch) * Solid understanding of ...

Anyscale, Inc

Distributed LLM Inference Engineer

San Francisco, CA · On-site

$170K - $245K/yr

Staff Backend Engineer, ML Inference Systems

$244K - $317K/yr

Partner with ML engineers to ensure online serving infrastructure scales with growing model complexity and inference volumes, without compromising latency or throughput * Ensure the reliability ...

Staff Backend Engineer, ML Inference Systems

$244K - $317K/yr

Senior Full-Stack Engineer - Web Platforms for ML Inference

Cupertino, CA · On-site

Within AIML, the Annotation AI Services organization builds the foundational systems that enable large-scale model inference, data pipelines, and ML platform capabilities for annotation platforms. We ...

Senior Full-Stack Engineer - Web Platforms for ML Inference

Cupertino, CA · On-site

CYNET SYSTEMS

Senior ML Engineer (GCP) - Remote / Telecommute

New York, NY · Remote

$65 - $70/hr

Strong foundation in ML inference, deployment, and quality testing. * Demonstrated ability to ramp up quickly on new and unfamiliar tech stacks. * End-to-end problem-solving mindset. * Core ML ...

Quick apply

CYNET SYSTEMS

Senior ML Engineer (GCP) - Remote / Telecommute

New York, NY · Remote

$65 - $70/hr

Senior Backend Engineer, ML Inference Systems

$178K - $268K/yr

Senior Backend Engineer, ML Inference Systems

$178K - $268K/yr

Staff Backend Engineer, ML Inference Systems

Mountain View, CA · On-site

$244K - $317K/yr

Staff Backend Engineer, ML Inference Systems

Mountain View, CA · On-site

$244K - $317K/yr

Reveille Technologies

ML Ops Engineer

Jersey City, NJ · On-site

ML Ops Engineer Location: Iselin NJ Experience Required Experience building production Al/ML systems at scale Deploying real-time ML inference pipelines processing millions of records at high ...

Reveille Technologies

ML Ops Engineer

Jersey City, NJ · On-site

ML Ops Engineer Location: Iselin NJ Experience Required Experience building production Al/ML systems at scale Deploying real-time ML inference pipelines processing millions of records at high ...

Third Way Health

AI / ML Engineer

Cambridge, MA · On-site

Develop predictive, real-time analytics systems that combine streaming data, ML inference, and event-driven triggers to surface insights and automate actions at scale. * Implement and maintain end-to ...

Third Way Health

AI / ML Engineer

Cambridge, MA · On-site

Software Engineer - GenAI inference

Solid understanding of ML inference internals: attention, MLPs, recurrent modules, quantization, sparse operations, etc. * Hands-on experience with CUDA, GPU programming, and key libraries (cuBLAS ...

Software Engineer - GenAI inference

Software Engineer - GenAI inference

San Francisco, CA · On-site

... ML inference internals: attention, MLPs, recurrent modules, quantization, sparse operations, etc ... • Hands-on experience with CUDA, GPU programming, and key libraries (cuBLAS, cuDNN, NCCL, etc ...

Software Engineer - GenAI inference

San Francisco, CA · On-site

Senior Full-Stack Engineer - Web Platforms for ML Inference

$184K - $324K/yr

Discover and configure inference services Interact with ML pipelines and workflows Monitor usage, health, and operational signals Establish best practices around testing, maintainability ...

Senior Full-Stack Engineer - Web Platforms for ML Inference

$184K - $324K/yr

Discover and configure inference services Interact with ML pipelines and workflows Monitor usage, health, and operational signals Establish best practices around testing, maintainability ...

Staff Software Engineer - GenAI inference

Deep understanding of ML inference internals: attention, MLPs, recurrent modules, quantization, sparse operations, etc. * Hands-on experience with CUDA, GPU programming, and key libraries (cuBLAS ...

Staff Software Engineer - GenAI inference

Software Development Engineer AI/ML, Inference Serving, AWS Neuron

Cupertino, CA · On-site

Collaborating with research teams on new ML serving capabilities * Driving technical decisions that shape the future of Neuron's inference stack About the team The Neuron Serving team is at the ...

Software Development Engineer AI/ML, Inference Serving, AWS Neuron

Cupertino, CA · On-site

Staff Software Engineer - GenAI inference

San Francisco, CA · On-site

... of ML inference internals: attention, MLPs, recurrent modules, quantization, sparse operations, etc. • Hands-on experience with CUDA, GPU programming, and key libraries (cuBLAS, cuDNN, NCCL, etc ...

Staff Software Engineer - GenAI inference

San Francisco, CA · On-site

Software Development Engineer, AI/ML, AWS Neuron, Model Inference

The Inference Enablement and Acceleration team is at the forefront of running a wide range of models and supporting novel architecture alongside maximizing their performance for AWS's custom ML ...

Software Development Engineer, AI/ML, AWS Neuron, Model Inference

Software Development Engineer AI/ML, Inference Serving, AWS Neuron

Software Development Engineer AI/ML, Inference Serving, AWS Neuron