1

Machine Learning Engineer Quantization Jobs in Raleigh, NC

As a Machine Learning Engineer, you will help build and operate production systems that power fraud detection and risk-related products. You'll work closely with data scientists and engineers to ...

The Machine Learning Engineer will develop software and machine learning algorithms to address real-world customer issues and will have opportunities to present their work to high-level customers.

Machine Learning Engineer About CoVar CoVar is a small AI/ML R&D software company in Durham, NC, that uses artificial intelligence to solve problems that matter. We develop AI/ML tools to help the ...

We are seeking a talented and innovative Machine Learning Engineer to join our dynamic team. In this role, you will be responsible for designing and developing machine learning prototypes, as well as ...

Machine Learning Engineer

Raleigh, NC · On-site

$96K - $137K/yr

We are seeking a talented and innovative Machine Learning Engineer to join our dynamic team. In this role, you will be responsible for designing and developing machine learning prototypes, as well as ...

We are seeking a talented and innovative Machine Learning Engineer to join our dynamic team. In this role, you will be responsible for designing and developing machine learning prototypes, as well as ...

We are seeking a talented and innovative Machine Learning Engineer to join our dynamic team. In this role, you will be responsible for designing and developing machine learning prototypes, as well as ...

Sr. Machine Learning Engineer Duration: 12 -24 Months Location: Merrimack, NH/ Smithfield, RI/ Westlake, TX/ Durham, NC/ Covington, KY/ Jersey City, NJ/ Boston, MA Candidate should be local or ...

... machine learning, Bayesian models, etc. • B.S., preferably M.S. or Ph.D in engineering, math, computer science, or related field • Excellent technical communication skills • Ability to work in ...

Sr Machine Learning Engineer

Raleigh, NC · On-site

$101K - $139K/yr

RIT Solutions, Inc. is seeking a Senior Machine Learning Engineer to work on production machine learning at scale. The role involves deploying LLM/GenAI/RAG systems and requires expertise in cloud ...

New

Must-Have Skills 3+ years of ML engineering experience -- model training, fine-tuning, or post-training pipelines in research or production Strong Python and deep learning proficiency (PyTorch ...

Must-Have Skills 3+ years of ML engineering experience -- model training, fine-tuning, or post-training pipelines in research or production Strong Python and deep learning proficiency (PyTorch ...

We are hiring a Machine Learning Engineer to own the full agentic stack - from LLM orchestration, tool use, memory, and production deployment on devices. Location: Chicago, IL is highly preferred ...

We are seeking a Machine Learning Engineer Lead to design, build, and operate scalable AI/ML systems and agentic architectures that support next-generation legal research and analytics products. This ...

We are seeking a Machine Learning Engineer Lead to design, build, and operate scalable AI/ML systems and agentic architectures that support next-generation legal research and analytics products. This ...

We are seeking a Machine Learning Engineer Lead to design, build, and operate scalable AI/ML systems and agentic architectures that support next-generation legal research and analytics products. This ...

next page

Showing results 1-20

Machine Learning Engineer Quantization information

See Raleigh, NC salary details

$30.6K

$125.2K

$188.1K

How much do machine learning engineer quantization jobs pay per year?

As of Jun 20, 2026, the average yearly pay for machine learning engineer quantization in Raleigh, NC is $125,174.00, according to ZipRecruiter salary data. Most workers in this role earn between $98,700.00 and $150,700.00 per year, depending on experience, location, and employer.

What are some common challenges Machine Learning Engineers face when implementing quantization techniques in production models?

Machine Learning Engineers working on quantization often encounter challenges such as balancing reduced model size and computational efficiency with maintaining acceptable accuracy levels. Adapting quantization methods to different hardware platforms can also require significant testing and optimization. Additionally, engineers must frequently address compatibility issues with existing deployment pipelines and ensure that quantization-aware training is properly integrated to minimize performance degradation. Collaboration with hardware and software teams is essential to streamline deployment and achieve optimal results.

What are the key skills and qualifications needed to thrive as a Machine Learning Engineer Quantization, and why are they important?

To thrive as a Machine Learning Engineer Quantization, you need a solid background in machine learning, deep learning, and computer science, typically supported by a degree in a related field. Familiarity with quantization techniques, frameworks such as TensorFlow Lite or PyTorch, and experience with hardware accelerators are crucial. Strong problem-solving skills, attention to detail, and effective collaboration set top performers apart. These capabilities are vital for efficiently deploying high-performing models on resource-constrained devices and ensuring scalable, real-world AI solutions.

What does a Machine Learning Engineer Quantization do?

A Machine Learning Engineer specializing in quantization focuses on optimizing machine learning models by reducing their size and computational requirements without significantly sacrificing accuracy. This involves converting model parameters and computations from high-precision formats (like 32-bit floating point) to lower-precision formats (such as 8-bit integers). Quantization enables faster inference, lower memory usage, and allows models to run efficiently on edge devices and mobile platforms. These engineers work closely with data scientists and hardware teams to implement, test, and validate quantized models in production environments.

What is the difference between Machine Learning Engineer Quantization vs Data Scientist?

AspectMachine Learning Engineer QuantizationData Scientist
Required CredentialsBachelor's or master's in CS, ML, or related; certifications in ML or AIBachelor's or master's in statistics, CS, or related; certifications in data analysis or statistics
Work EnvironmentDeveloping optimized ML models, deploying quantized models for efficiencyAnalyzing data, building predictive models, interpreting results
Industry UsageTech companies, AI hardware firms, embedded systemsFinance, healthcare, marketing, research institutions

Machine Learning Engineer Quantization focuses on optimizing ML models for deployment efficiency, often working closely with hardware and software teams. Data Scientists analyze data and build models for insights. While both roles require ML knowledge, quantization engineers specialize in model compression techniques, whereas data scientists focus on data analysis and interpretation.

What are popular job titles related to Machine Learning Engineer Quantization jobs in Raleigh, NC? For Machine Learning Engineer Quantization jobs in Raleigh, NC, the most frequently searched job titles are:
What cities near Raleigh, NC are hiring for Machine Learning Engineer Quantization jobs? Cities near Raleigh, NC with the most Machine Learning Engineer Quantization job openings:

Other

Posted yesterday


Job description

The Risk & Fraud team helps customers take a proactive stance against fraud while managing the risks inherent to their business. We build and enhance products that evolve with the ever-changing fraud landscape, delivering tangible value to customers. Our solutions allow financial institutions to focus more of their time and energy on serving their customers and communities.

As a Machine Learning Engineer, you will help build and operate production systems that power fraud detection and risk-related products. You’ll work closely with data scientists and engineers to bring models into production, ensuring they are reliable, scalable, and maintainable.

You’ll gain hands-on experience working across model development, evaluation, deployment, monitoring, and ongoing improvements. This is an applied engineering role — the software you build will solve real-world problems and must be production-ready, reliable, and testable.

A Typical Day

Your Key Responsibilities

  • Build and maintain systems and pipelines that support training, evaluation, and inference for machine learning models

  • Contribute to deploying machine learning models into production environments and ensuring they run reliably at scale

  • Write clean, maintainable, and well-tested code following production engineering best practices

  • Support monitoring and troubleshooting production ML systems, including data pipelines and model performance

  • Collaborate with data scientists and engineers to productionalize models and integrate them into scalable applications

  • Help improve the reliability, scalability, and performance of ML systems over time

  • Contribute to improving tooling and infrastructure that supports the ML development lifecycle

You Are More Likely to Excel If You:

  • Enjoy autonomy in your work and take ownership of team goals while balancing speed with long-term impact

  • Have empathy for end users and measure success through both customer value and technical quality

  • Are enthusiastic about machine learning, engineering excellence, and continuous professional development

Bring Your Passion, Do What You Love. Here’s What We’re Looking For

Must-Haves

  • Bachelor’s degree in a relevant field with 2+ years of related experience, or equivalent practical experience

  • Proficiency in Python

  • Experience writing clean, maintainable code and using version control tools such as Git

  • Experience with machine learning frameworks such as PyTorch, TensorFlow, or scikit-learn

Nice to Have

  • Experience building end-to-end ML systems, including data pipelines, model training, deployment, and monitoring

  • Experience deploying or integrating machine learning models into applications

  • Experience building APIs, backend services, or working with distributed systems

  • Familiarity with cloud platforms such as AWS, GCP, or Azure

  • Exposure to MLOps concepts such as CI/CD and model monitoring

  • Experience working with large datasets or data processing frameworks

  • Experience with additional programming languages such as TypeScript