1

Machine Learning Engineer Quantization Jobs in Tennessee

Collaborate closely with the MLOps, product teams, business stakeholders, machine learning ... Model quantization for LLMs (GPTQ, AWQ, bitsandbytes); GPU memory optimization techniques (tensor ...

Senior AI/ML Engineer

Nashville, TN · Remote

$90 - $100/hr

Remote Our client seeks a Senior AI/ML Engineer to design and deliver cloud-native machine learning solutions on AWS. The role includes LLM orchestration, RAG pipelines, vector database integration ...

... machine learning models and large language models. • Conduct research to provide technical ... & DevOps teams, Data scientists, Machine Learning & GenAI Engineers, and Business teams to pilot ...

AI Solutions Architect

Nashville, TN · On-site

$60.75 - $80.25/hr

Certifications in artificial intelligence, machine learning, or cloud platforms, such as AWS Certified Machine Learning - Specialty, Google Cloud Professional Machine Learning Engineer, Microsoft ...

Senior AI Engineer - SFL Scientific

Nashville, TN · On-site

$100K - $138K/yr

Work You'll Do As a Senior AI Engineer, you'll work cross-functionally with data scientists, machine learning engineers, project managers, and industry experts to develop robust AI infrastructure and ...

Generative AI Engineer III

Nashville, TN · On-site

$55.50 - $74.50/hr

As a Generative AI Engineer III, you will design and deploy machine learning and AI solutions while collaborating with various stakeholders to translate requirements into technical solutions.

1. Programming languages 2. Data modeling & engineering 3. Big data analysis 4. Machine learning models 5. Mathematics and statistics.

AI Engineers candidates should hold a Bachelor's or Master's Degree in Computer Science, Mathematics, Statistics, Robotics, Artificial Intelligence, Machine Learning, Data Science, or related ...

next page

Showing results 1-20

Machine Learning Engineer Quantization information

What are some common challenges Machine Learning Engineers face when implementing quantization techniques in production models?

Machine Learning Engineers working on quantization often encounter challenges such as balancing reduced model size and computational efficiency with maintaining acceptable accuracy levels. Adapting quantization methods to different hardware platforms can also require significant testing and optimization. Additionally, engineers must frequently address compatibility issues with existing deployment pipelines and ensure that quantization-aware training is properly integrated to minimize performance degradation. Collaboration with hardware and software teams is essential to streamline deployment and achieve optimal results.

What are the key skills and qualifications needed to thrive as a Machine Learning Engineer Quantization, and why are they important?

To thrive as a Machine Learning Engineer Quantization, you need a solid background in machine learning, deep learning, and computer science, typically supported by a degree in a related field. Familiarity with quantization techniques, frameworks such as TensorFlow Lite or PyTorch, and experience with hardware accelerators are crucial. Strong problem-solving skills, attention to detail, and effective collaboration set top performers apart. These capabilities are vital for efficiently deploying high-performing models on resource-constrained devices and ensuring scalable, real-world AI solutions.

What does a Machine Learning Engineer Quantization do?

A Machine Learning Engineer specializing in quantization focuses on optimizing machine learning models by reducing their size and computational requirements without significantly sacrificing accuracy. This involves converting model parameters and computations from high-precision formats (like 32-bit floating point) to lower-precision formats (such as 8-bit integers). Quantization enables faster inference, lower memory usage, and allows models to run efficiently on edge devices and mobile platforms. These engineers work closely with data scientists and hardware teams to implement, test, and validate quantized models in production environments.

What is the difference between Machine Learning Engineer Quantization vs Data Scientist?

AspectMachine Learning Engineer QuantizationData Scientist
Required CredentialsBachelor's or master's in CS, ML, or related; certifications in ML or AIBachelor's or master's in statistics, CS, or related; certifications in data analysis or statistics
Work EnvironmentDeveloping optimized ML models, deploying quantized models for efficiencyAnalyzing data, building predictive models, interpreting results
Industry UsageTech companies, AI hardware firms, embedded systemsFinance, healthcare, marketing, research institutions

Machine Learning Engineer Quantization focuses on optimizing ML models for deployment efficiency, often working closely with hardware and software teams. Data Scientists analyze data and build models for insights. While both roles require ML knowledge, quantization engineers specialize in model compression techniques, whereas data scientists focus on data analysis and interpretation.

What cities in Tennessee are hiring for Machine Learning Engineer Quantization jobs? Cities in Tennessee with the most Machine Learning Engineer Quantization job openings:
Senior Data Scientist / AI Engineer (3878)

Senior Data Scientist / AI Engineer (3878)

Navarro Inc.

Oak Ridge, TN • Remote

Full-time

Medical, Dental, Vision, Life, Retirement, PTO

Posted 8 days ago


Job description

Navarro Research and Engineering is recruiting a Senior Data Scientist / AI Engineer (3878). This is a remote position. Citizenship is required.

Navarro Research & Engineering is an award-winning federal contractor dedicated to partnering with clients to advance clean energy and deliver effective solutions for complex challenges in the nuclear and environmental fields. Joining Navarro means being a part of an exceptional team committed to quality and safety while also looking for innovative strategies to create value for the client's success. Headquartered in Oak Ridge, Tennessee, Navarro has active programs in place across the nation for DOE/NNSA, NASA, and the Department of Defense.

We are seeking a Senior Data Scientist / AI Engineer to design, develop, deploy, and maintain machine learning and generative AI solutions within a government environment. This role will support both locally hosted AI systems and cloud-based AI services within Microsoft Azure Government, including Azure AI Foundry and related Azure AI services.

The ideal candidate has hands-on experience building production AI systems, deploying and operating open-source large language models (LLMs), implementing secure MLOps practices, and developing AI applications that meet government security and compliance requirements.

Key Responsibilities

AI/ML Solution Development

  • Design, build, train, evaluate, and deploy machine learning and generative AI solutions.
  • Develop and maintain predictive analytics, NLP, computer vision, and LLM-based applications.
  • Implement Retrieval-Augmented Generation (RAG), agentic workflows, and knowledge management solutions.
  • Evaluate commercial, open-source, and custom AI models for mission-specific use cases.

Local and On-Premises AI Infrastructure

  • Deploy and operate local/open-source models in secure environments.
  • Configure and optimize inference environments using GPUs and containerized deployments.
  • Manage model serving platforms and inference frameworks.
  • Implement monitoring, performance tuning, and lifecycle management for locally hosted models.
  • Support disconnected, restricted, or air-gapped operational environments.

Azure Government AI Platforms

  • Design and deploy AI solutions within Azure Government.
  • Build and manage solutions using Azure AI Foundry, Azure OpenAI, Azure Machine Learning, Azure Kubernetes Service (AKS), and related services.
  • Implement secure model deployment, monitoring, and governance controls.
  • Integrate AI services with enterprise systems and data platforms.

Data Engineering and Analytics

  • Develop data pipelines supporting AI and analytics workloads.
  • Perform data exploration, feature engineering, model evaluation, and performance analysis.
  • Work with structured, semi-structured, and unstructured data sources.
  • Ensure data quality, lineage, and governance standards are maintained.

MLOps and DevSecOps

  • Implement CI/CD pipelines for machine learning and AI workloads.
  • Develop automated testing, validation, and deployment processes.
  • Establish model monitoring, drift detection, and performance reporting.
  • Apply security controls and compliance requirements throughout the AI lifecycle.

Stakeholder Support

  • Collaborate with mission owners, analysts, engineers, cybersecurity personnel, and leadership.
  • Translate operational requirements into technical AI solutions.
  • Prepare technical documentation, architecture diagrams, and presentations.

Requirements

Education

  • Bachelor's degree in Data Science, Computer Science, Engineering, Mathematics, Statistics, or related field.
  • Master's degree preferred.

Professional Experience

  • 5+ years of experience in data science, machine learning, AI engineering, or related fields.
  • 2+ years of experience deploying and operating production AI/ML systems.
  • Experience supporting secure government, defense, or regulated environments preferred.

Technical Skills

Machine Learning & Data Science

  • Strong knowledge of supervised and unsupervised learning techniques.
  • Experience with model development, evaluation, and optimization.
  • Statistical analysis and experimental design experience.
  • Proficiency in Python and common ML frameworks.

Generative AI & LLMs

  • Experience deploying and operating open-source LLMs.
  • Experience with:
    • Llama family models
    • Mistral models
    • Hugging Face models
  • Knowledge of:
    • RAG architectures
    • Agent frameworks
    • Prompt engineering
    • Model evaluation methodologies
    • Fine-tuning approaches

Azure Government and Cloud AI

  • Experience with:
    • Azure AI Foundry
    • Azure Machine Learning
    • Azure OpenAI
    • Azure Kubernetes Service (AKS)
    • Azure Storage and Data Services
    • Azure Identity and Access Management
  • Experience deploying AI workloads in Azure Government environments preferred.

Local AI Infrastructure

  • Experience with:
    • Docker
    • Kubernetes
    • GPU-based inference systems
    • vLLM, Ollama, TGI, or similar inference platforms
    • Linux administration
  • Understanding of model quantization and performance optimization techniques.

Data Platforms

  • SQL and relational databases
  • Data warehousing concepts
  • ETL/ELT pipeline development
  • Vector databases and semantic search platforms

Software Engineering

  • Git-based development workflows
  • REST APIs and microservices
  • CI/CD pipelines
  • Infrastructure-as-Code concepts

Preferred Qualifications

  • Active security clearance or ability to obtain one.
  • Experience with NIST AI Risk Management Framework.
  • Experience with FedRAMP, RMF, or government cybersecurity compliance frameworks.
  • Experience supporting classified or controlled environments.
  • Azure certifications.
  • Experience with distributed GPU environments.
  • Experience implementing AI governance and responsible AI controls.

Desired Technologies

Candidates should have experience with several of the following:

Programming

  • Python
  • SQL
  • PowerShell
  • Bash

AI/ML Frameworks

  • PyTorch
  • TensorFlow
  • Scikit-learn
  • Hugging Face Transformers

LLM Ecosystem

  • LangChain
  • LlamaIndex
  • Semantic Kernel
  • OpenAI APIs
  • Azure OpenAI APIs

Infrastructure

  • Docker
  • Kubernetes
  • AKS
  • Linux
  • GitHub Actions
  • Azure DevOps

Databases

  • PostgreSQL
  • SQL Server
  • Vector databases
  • Azure Data Services

Security Requirements

  • U.S. citizenship required.
  • Ability to pass government background investigation.
  • Ability to comply with all applicable government security and information assurance requirements.

Success Criteria

Within the first 12 months, the selected candidate will:

  • Deploy and support production AI solutions in Azure Government.
  • Establish repeatable MLOps processes for AI model deployment and maintenance.
  • Deploy and manage secure local/open-source LLM environments.
  • Develop mission-focused AI applications leveraging RAG and agentic workflows.
  • Improve operational efficiency through automation and advanced analytics.

Due to the nature of the government contract requirements and/or clearances requirements, US citizenship is required.

Navarro is an equal-opportunity employer. All qualified applicants will receive consideration for employment without regard to sex, race, religion, color, national origin, age, disability, veteran's status, or any classification protected by applicable state or local law.

EEO Employer/Vet/Disabled

Benefits

  • Health Care Plan (Medical, Dental & Vision)
  • Retirement Plan (401k,)
  • Life Insurance (Basic, Voluntary & AD&D)
  • Paid Time Off (Vacation & Public Holidays)
  • Short Term & Long Term Disability