Deep Learning Quantization Jobs in New York (NOW HIRING)

Lead Machine Learning Engineer-MLOps

$112K - $147K/yr

Implement quantization techniques and deploy large language models (LLMs) to maximize efficiency ... Deep knowledge and passion for data science fundamentals, training and deploying models

Lead Machine Learning Engineer-MLOps

$112K - $147K/yr

Implement quantization techniques and deploy large language models (LLMs) to maximize efficiency ... Deep knowledge and passion for data science fundamentals, training and deploying models

Nvidia

Senior Scientist, Synthetic Data and Privacy

Publish original research at top machine learning and AI conferences to maintain NVIDIA's technical ... Deep technical understanding of LLMs and inference optimization (quantization, distillation ...

Nvidia

Senior Scientist, Synthetic Data and Privacy

Invoca

Senior ML Engineer

New York, NY · On-site +1

$114K - $157K/yr

Advanced Python and deep learning proficiency (PyTorch, HuggingFace Transformers, spaCy ... models via quantization, batching, and throughput tuning * Proficiency with inference ...

Quick apply

Invoca

Senior ML Engineer

New York, NY · On-site +1

$114K - $157K/yr

Advanced Python and deep learning proficiency (PyTorch, HuggingFace Transformers, spaCy ... models via quantization, batching, and throughput tuning * Proficiency with inference ...

Distributed Spectrum Inc

Machine Learning Research, RF Foundation Models Specialist

New York, NY · On-site

$200K - $300K/yr

Deep mathematical and modeling fundamentals * Strong hands-on experience with modern ML frameworks ... Experience building for constrained inference (quantization, kernel-level optimizations, or similar)

Distributed Spectrum Inc

Machine Learning Research, RF Foundation Models Specialist

New York, NY · On-site

$200K - $300K/yr

Sr Gen AI Engineer - NY/NJ

Manhattan, NY · On-site

The ideal candidate blends deep machine learning expertise with modern software engineering ... Knowledge of model fine-tuning techniques and local LLM quantization/hosting. Familiarity with ...

Sr Gen AI Engineer - NY/NJ

Manhattan, NY · On-site

Sr Gen AI Engineer - NY/NJ

Manhattan, NY · On-site

Sr Gen AI Engineer - NY/NJ

Manhattan, NY · On-site

AI Researcher

New York, NY · On-site

$175K - $250K/yr

... data analysis, vector quantization, decision tree methods, EM methods, Bayesian methods ... Demonstration of deep knowledge of large language models and deep neural networks for practical ...

AI Researcher

New York, NY · On-site

$175K - $250K/yr

AI Researcher

New York, NY · On-site

$175K - $250K/yr

AI Researcher

New York, NY · On-site

$175K - $250K/yr

AI Researcher - Vatic Labs

Manhattan, NY · On-site

$175K - $250K/yr

AI Researcher - Vatic Labs

Manhattan, NY · On-site

$175K - $250K/yr

Generative AI - Group Manager - Senior Vice President

Jersey City, NJ · On-site

We're looking for someone who combines deep technical expertise in generative AI with a proven ... learning). * Model Optimization: Expertise in model compression and quantization methods (AWQ, GPTQ ...

Generative AI - Group Manager - Senior Vice President

Jersey City, NJ · On-site

Software Engineer - Model Products

New York, NY · On-site

$180K - $360K/yr

... quantization, batching, and KV‑cache reuse. * Instrument deep observability (metrics, traces ... Exposure to a variety of ML startups, offering unparalleled learning and networking opportunities.

Software Engineer - Model Products

New York, NY · On-site

$180K - $360K/yr

Engineering Manager - Model Performance

New York, NY · On-site

$260K - $380K/yr

... such as quantization, speculative decoding, or continuous batching. * Deep knowledge of GPU ... Exposure to a variety of ML startups, offering unparalleled learning and networking opportunities.

Engineering Manager - Model Performance

New York, NY · On-site

$260K - $380K/yr

Normal Computing

Research Engineer, Inference

New York, NY · On-site

$250K - $325K/yr

Backed by $85M+ from the world's leading deep-tech investors and built by scientists, engineers ... Experience with inference optimization: quantization, sparsity, kernel fusion, or memory-efficient ...

Normal Computing

Research Engineer, Inference

New York, NY · On-site

$250K - $325K/yr

Applied AI/ML Lead - Vice President - Payments

Master's or PhD in Computer Science, Machine Learning, Statistics, Mathematics, Operations Research ... Distillation / compression (teacher - student, quantization-aware approaches, latency/cost-driven ...

New

Applied AI/ML Lead - Vice President - Payments

New

JP Morgan Chase

Applied AI/ML Lead - Vice President - Payments

Manhattan, NY · On-site

Master's or PhD in Computer Science, Machine Learning, Statistics, Mathematics, Operations Research ... Distillation / compression (teacher-student, quantization-aware approaches, latency/cost-driven ...

JP Morgan Chase

Applied AI/ML Lead - Vice President - Payments

Manhattan, NY · On-site

Master's or PhD in Computer Science, Machine Learning, Statistics, Mathematics, Operations Research ... Distillation / compression (teacher-student, quantization-aware approaches, latency/cost-driven ...

JPMorgan Chase & Co

Applied AI/ML Lead - Vice President - Payments

Manhattan, NY · On-site

Master's or PhD in Computer Science, Machine Learning, Statistics, Mathematics, Operations Research ... Distillation / compression (teacher-student, quantization-aware approaches, latency/cost-driven ...

JPMorgan Chase & Co

Applied AI/ML Lead - Vice President - Payments

Manhattan, NY · On-site

Master's or PhD in Computer Science, Machine Learning, Statistics, Mathematics, Operations Research ... Distillation / compression (teacher-student, quantization-aware approaches, latency/cost-driven ...

Senior AI Engineer - Vice President

Jersey City, NJ · On-site

$142K - $213K/yr

... learning initiatives . * Assess and manage risks in business decisions , safeguarding the firm ... Expert Python proficiency for AI/ML development, data engineering, and backend services ; deep ...

Senior AI Engineer - Vice President

Jersey City, NJ · On-site

$142K - $213K/yr

JPMorgan Chase & Co.

Applied AI/ML Lead - Vice President - Payments

Manhattan, NY · On-site

$164K - $260K/yr

Master's or PhD in Computer Science, Machine Learning, Statistics, Mathematics, Operations Research ... Distillation / compression (teacher-student, quantization-aware approaches, latency/cost-driven ...

JPMorgan Chase & Co.

Applied AI/ML Lead - Vice President - Payments

Manhattan, NY · On-site

$164K - $260K/yr

Master's or PhD in Computer Science, Machine Learning, Statistics, Mathematics, Operations Research ... Distillation / compression (teacher-student, quantization-aware approaches, latency/cost-driven ...

Engineering

New York, NY · On-site

$260K - $380K/yr

Deep personal background in GPU kernel engineering. You have written and shipped production CUDA ... Background in LLM inference kernels: attention variants, GEMMs, quantization (FP8/FP4), MoE routing

New