1

Machine Learning Engineer Quantization Jobs (NOW HIRING)

... search, machine learning systems and quantization methods, and determine what translates to ... engineers, traders, and business operations professionals are united by our uniquely collaborative ...

Machine Learning Engineer - Edge

Dover, NH · On-site +1

$86K - $135K/yr

Machine Learning Engineer - Edge *Please consider before applying: This is a hybrid role, and ... Apply techniques such as model compression, quantization, pruning, and distillation to improve ...

Machine Learning Engineer - Edge

Dover, NH · On-site

$86K - $135K/yr

Machine Learning Engineer - Edge *Please consider before applying: This is a hybrid role, and ... Apply techniques such as model compression, quantization, pruning, and distillation to improve ...

Machine Learning Engineer Company: HeyMilo AI Location: New York, NY, USA Contract Details: Full-time HeyMilo AI is a fast-growing startup based in New York City that specializes in developing ...

Senior Machine Learning Engineer

Austin, TX · On-site

$103K - $142K/yr

Senior Machine Learning Engineer We are seeking a Senior Machine Learning Engineer to support our ... Implement model optimization techniques such as quantization, pruning, distillation, and hardware ...

We're looking for researchers and experienced engineers from any background. Trading experience is ... search, machine learning systems and quantization methods, and determine what translates to ...

We are looking for a Machine Learning Engineer to help us create artificial intelligence products. Machine Learning Engineer responsibilities include creating machine learning models and retraining ...

Machine Learning Engineer 100% Remote We are seeking a highly skilled Machine Learning Engineer to design, develop, deploy, and maintain scalable machine learning solutions that drive business value.

Machine Learning Engineer 100% Remote We are seeking a highly skilled Machine Learning Engineer to design, develop, deploy, and maintain scalable machine learning solutions that drive business value.

Senior Machine Learning Engineer

Boston, MA · On-site +1

$174K - $287K/yr

As a Senior Principal Machine Learning Engineer focused on model optimization algorithms, you will ... Benchmark, profile, and evaluate different parallelizations, quantization and sparsification ...

Health insurance Machine Learning Engineer 100% Remote We are seeking a highly skilled Machine Learning Engineer to design, develop, deploy, and maintain scalable machine learning solutions that ...

Senior Machine Learning Engineer

Boston, MA · On-site +1

$174K - $287K/yr

As a Senior Principal Machine Learning Engineer focused on model optimization algorithms, you will ... Benchmark, profile, and evaluate different parallelizations, quantization and sparsification ...

Health insurance Machine Learning Engineer 100% Remote We are seeking a highly skilled Machine Learning Engineer to design, develop, deploy, and maintain scalable machine learning solutions that ...

Machine Learning Engineer Location: Fort Meade, MD Required Clearance : TS/SCI w/ Full-Scope Poly Salary: Competitive We are seeking a highly skilled and motivated Machine Learning Engineer to join ...

Machine Learning Engineer Position Overview Paylocity is growing its Machine Learning Engineering organization! Our machine learning engineering team is responsible for developing infrastructure and ...

Principal Machine Learning Engineer

Boston, MA · On-site +1

$189K - $312K/yr

As a Machine Learning Engineer focused on model optimization algorithms, you will work closely with ... Design, implement, and optimize model compression pipelines using techniques such as quantization ...

Machine Learning Engineer Position Overview Paylocity is growing its Machine Learning Engineering organization! Our machine learning engineering team is responsible for developing infrastructure and ...

next page

Showing results 1-20

Machine Learning Engineer Quantization information

See salary details

$31.5K

$128.8K

$193.5K

How much do machine learning engineer quantization jobs pay per year?

As of Jun 13, 2026, the average yearly pay for machine learning engineer quantization in the United States is $128,769.00, according to ZipRecruiter salary data. Most workers in this role earn between $101,500.00 and $155,000.00 per year, depending on experience, location, and employer.

What are some common challenges Machine Learning Engineers face when implementing quantization techniques in production models?

Machine Learning Engineers working on quantization often encounter challenges such as balancing reduced model size and computational efficiency with maintaining acceptable accuracy levels. Adapting quantization methods to different hardware platforms can also require significant testing and optimization. Additionally, engineers must frequently address compatibility issues with existing deployment pipelines and ensure that quantization-aware training is properly integrated to minimize performance degradation. Collaboration with hardware and software teams is essential to streamline deployment and achieve optimal results.

What are the key skills and qualifications needed to thrive as a Machine Learning Engineer Quantization, and why are they important?

To thrive as a Machine Learning Engineer Quantization, you need a solid background in machine learning, deep learning, and computer science, typically supported by a degree in a related field. Familiarity with quantization techniques, frameworks such as TensorFlow Lite or PyTorch, and experience with hardware accelerators are crucial. Strong problem-solving skills, attention to detail, and effective collaboration set top performers apart. These capabilities are vital for efficiently deploying high-performing models on resource-constrained devices and ensuring scalable, real-world AI solutions.

What does a Machine Learning Engineer Quantization do?

A Machine Learning Engineer specializing in quantization focuses on optimizing machine learning models by reducing their size and computational requirements without significantly sacrificing accuracy. This involves converting model parameters and computations from high-precision formats (like 32-bit floating point) to lower-precision formats (such as 8-bit integers). Quantization enables faster inference, lower memory usage, and allows models to run efficiently on edge devices and mobile platforms. These engineers work closely with data scientists and hardware teams to implement, test, and validate quantized models in production environments.

What is the difference between Machine Learning Engineer Quantization vs Data Scientist?

AspectMachine Learning Engineer QuantizationData Scientist
Required CredentialsBachelor's or master's in CS, ML, or related; certifications in ML or AIBachelor's or master's in statistics, CS, or related; certifications in data analysis or statistics
Work EnvironmentDeveloping optimized ML models, deploying quantized models for efficiencyAnalyzing data, building predictive models, interpreting results
Industry UsageTech companies, AI hardware firms, embedded systemsFinance, healthcare, marketing, research institutions

Machine Learning Engineer Quantization focuses on optimizing ML models for deployment efficiency, often working closely with hardware and software teams. Data Scientists analyze data and build models for insights. While both roles require ML knowledge, quantization engineers specialize in model compression techniques, whereas data scientists focus on data analysis and interpretation.

More about Machine Learning Engineer Quantization jobs
What cities are hiring for Machine Learning Engineer Quantization jobs? Cities with the most Machine Learning Engineer Quantization job openings:
What states have the most Machine Learning Engineer Quantization jobs? States with the most job openings for Machine Learning Engineer Quantization jobs include:
Infographic showing various Machine Learning Engineer Quantization job openings in the United States as of June 2026, with employment types broken down into 100% Part Time. Highlights an 87% Physical, 5% Hybrid, and 8% Remote job distribution, with an average salary of $128,769 per year, or $61.9 per hour.

Hardware Machine Learning Engineer

IMC Inc

Chicago, IL

$200K - $225K/yr

Other

PTO

Posted 29 days ago


Job description

Hardware Machine Learning Engineer

Chicago, United States; New York, United States

We are deploying machine learning directly onto custom hardware – and we want you to help drive it from the ground up. This is an initiative where you'll have the rare opportunity to architect solutions from scratch, influence technical research direction, and see your work drive real impact in one of the most demanding computing environments in the world.

We build the hardware, the software, and the infrastructure, so when you hit a bottleneck, you can fix it - there's no vendor to wait on and no abstraction layer you're not allowed to touch. If you've ever wanted to push the boundaries of what's computationally possible, this role is for you. We're looking for researchers and experienced engineers from any background. Trading experience is a bonus, not a prerequisite.

Your Core Responsibilities
  • Architect and co-design ML models with traders, quant researchers, and software engineers, treating hardware constraints (latency budgets, resource limits, numerical precision) as first-class design inputs
  • Shape our custom hardware roadmap by translating ML model requirements into concrete architectural decisions
  • Work hands-on with hardware engineers to implement, verify, and deploy ML inference solutions from proof-of-concept through production
  • Track and evaluate emerging research in neural architecture search, machine learning systems and quantization methods, and determine what translates to measurable improvements in our systems
Your Skills and Experience
  • Solid understanding of hardware constraints and design trade-offs (e.g., pipelining, resource utilization, fixed-point arithmetic) that shape how ML models can be efficiently mapped onto FPGAs or custom ASICs
  • Experience with hardware fundamentals, whether through VHDL /SystemVerilog development, HLS tools, or ML-to-hardware frameworks like hls4ml, FINN, or Vitis AI
  • Understanding of machine learning fundamentals – neural network architectures, inference optimization, quantization techniques, ML frameworks such as PyTorch/TensorFlow
  • Proficiency in Python, C++, or similar languages for tooling, testing, and simulation
  • Strong communication skills and ability to work collaboratively across disciplines with both technical and non-technical teams
Nice to Have
  • Exposure to ML compiler infrastructure such as MLIR, TVM, XLA, or similar tools for lowering and optimizing models for hardware targets
  • Background in latency-sensitive or resource-constrained systems including high-frequency trading, particle physics data acquisition, real-time signal processing, or similar domains
  • Familiarity with functional verification methodologies (for example SystemVerilog, UVM, Cocotb)
  • Advanced degree (MS or PhD ) in EE, CS, Physics, or related field, or equivalent depth through industry or research experience

The Base Salary range for the role is included below. Base salary is only one component of total compensation; all full-time, permanent positions are eligible for a discretionary bonus and benefits, including paid leave and insurance. Please visit Benefits - US | IMC Trading for more comprehensive information.

Salary Range

$200,000 - $225,000 USD

IMC is a global trading firm powered by a cutting-edge research environment and a world-class technology backbone. Since 1989, we've been a stabilizing force in financial markets, providing essential liquidity upon which market participants depend. Across our offices in the US, Europe, Asia Pacific, and India, our talented quant researchers, engineers, traders, and business operations professionals are united by our uniquely collaborative, high-performance culture, and our commitment to giving back. From entering dynamic new markets to embracing disruptive technologies, and from developing an innovative research environment to diversifying our trading strategies, we dare to continuously innovate and collaborate to succeed.