1

Ml Inference Jobs in Tennessee (NOW HIRING)

Lead ML Ops engineer

Nashville, TN · On-site

$99K - $130K/yr

... training, inference, evaluation, monitoring, retraining, and governance, including generative AI ... ML platform engineering, spanning MLOps, LLMOps, and AIOps. • Ability to define and enforce ...

Lead ML Ops engineer

Nashville, TN · On-site

$99K - $130K/yr

Oversee enterprisescale AI platforms supporting model training, inference, evaluation, monitoring ... Leadershiplevel expertise in AI/ML platform engineering, spanning MLOps, LLMOps, and AIOps.

Senior AI/ML Engineer

Nashville, TN · Remote

$90 - $100/hr

Senior AI/ML Engineer Anywhere Type: Contract-to-Hire Category: Development Industry: Government ... Hands-on experience with LLM orchestration, integration, and vLLM-based inference for document ...

AI Data Engineer - Senior Consultant

Hermitage, TN · Hybrid

$91K - $125K/yr

Deliver governed datasets and feature engineering/serving for ML training and real-time inference (online/offline consistency, caching, latency SLOs, backfills). * Implement safety, privacy, and ...

AI Engineer Senior Consultant

Nashville, TN · Hybrid

$100K - $138K/yr

Deliver governed datasets and feature engineering/serving for ML training and real-time inference (online/offline consistency, caching, latency SLOs, backfills). * Implement safety, privacy, and ...

Deliver governed data and features for ML/GenAI (curated datasets, feature pipelines/serving) supporting training and real-time inference, including consistency, caching, backfills, and latency SLOs.

AI Data Engineer - Senior Consultant

Memphis, TN · Hybrid

$101K - $139K/yr

Deliver governed datasets and feature engineering/serving for ML training and real-time inference (online/offline consistency, caching, latency SLOs, backfills). * Implement safety, privacy, and ...

AI Data Engineer - Senior Consultant

Nashville, TN · Hybrid

$100K - $138K/yr

Deliver governed datasets and feature engineering/serving for ML training and real-time inference (online/offline consistency, caching, latency SLOs, backfills). * Implement safety, privacy, and ...

Deliver governed datasets and feature engineering/serving for ML training and real-time inference (online/offline consistency, caching, latency SLOs, backfills). A successful candidate would possess ...

Deliver governed datasets and feature engineering/serving for ML training and real-time inference (online/offline consistency, caching, latency SLOs, backfills). A successful candidate would possess ...

... model inference services. You will learn and apply new techniques from open source packages and ... Work spans classical ML through LLM systems. You improve search and retrieval quality using real ...

Experience deploying ML models into production (batch or real-time inference) Background in research-driven or R&D-focused engineering environments Clearance Requirements Applicants must be a U.S.

Senior Machine Learning Engineer

Nashville, TN · On-site

$100K - $138K/yr

Experience deploying ML models into production (batch or real-time inference) Background in research-driven or R&D-focused engineering environments Clearance Requirements Applicants must be a U.S.

Experience deploying ML models into production (batch or real-time inference) Background in research-driven or R&D-focused engineering environments Clearance Requirements Applicants must be a U.S.

AI and Data Science Engineer III

Nashville, TN · On-site +1

$110K - $132K/yr

... science/ML, security, and platform engineering to deliver reliable, secure, and scalable AI ... inference, and LLM applications using Claude-, GPT/Codex-, and Gemini-class models, and more ...

next page

Showing results 1-20

Ml Inference information

What is ML inference?

ML inference refers to the process of using a trained machine learning model to make predictions or decisions based on new data. After a model has been trained on historical data, inference is the phase where that model is deployed and used in real-world applications, such as recognizing speech, detecting objects in images, or recommending products. The focus in ML inference is on speed, efficiency, and scalability to ensure quick predictions, often in real time. This process is critical for practical applications like mobile apps, web services, and embedded systems. Optimizing inference involves reducing latency, memory usage, and computational requirements.

What is the difference between Ml Inference vs Data Scientist?

AspectML InferenceData Scientist
Required CredentialsKnowledge of machine learning models, programming skillsDegree in data science, statistics, or related fields
Work EnvironmentDeploying models in production, real-time data processingData analysis, model development, research
Industry UsageAI product deployment, software companiesResearch institutions, tech firms, consulting

ML Inference focuses on deploying trained models to make predictions on new data, often in real-time. Data Scientists develop and analyze models, working primarily in research and development. While both roles require understanding of machine learning, ML Inference emphasizes deployment and operationalization, whereas Data Scientists focus on model creation and analysis.

What are some common challenges faced by ML Inference Engineers when deploying models to production?

ML Inference Engineers often encounter challenges such as optimizing model latency and throughput to meet production requirements, ensuring compatibility with diverse hardware environments, and managing model versioning and updates without disrupting service. Additionally, balancing resource utilization and inference accuracy while monitoring real-time performance metrics is crucial. Collaboration with data scientists, DevOps, and software engineers is typically essential to streamline deployment and maintain robust, scalable inference pipelines.

What are the key skills and qualifications needed to thrive in ML Inference, and why are they important?

To thrive in ML Inference, you need a solid background in machine learning principles, programming (Python or C++), and experience with deploying models at scale, often supported by a degree in computer science or a related field. Familiarity with frameworks and tools such as TensorFlow, PyTorch, ONNX, and cloud platforms like AWS SageMaker or Google AI Platform is typically required. Strong problem-solving skills, attention to detail, and effective communication are crucial soft skills for collaborating with multidisciplinary teams and optimizing model performance. These skills ensure efficient, scalable, and reliable deployment of machine learning solutions in real-world applications.
What are popular job titles related to Ml Inference jobs in Tennessee? For Ml Inference jobs in Tennessee, the most frequently searched job titles are:
What job categories do people searching Ml Inference jobs in Tennessee look for? The top searched job categories for Ml Inference jobs in Tennessee are:
What cities in Tennessee are hiring for Ml Inference jobs? Cities in Tennessee with the most Ml Inference job openings:

Lead ML Ops engineer

CLA (CliftonLarsonAllen)

Nashville, TN • On-site

$99K - $130K/yr

Full-time

Posted 18 days ago


Job description

Job Summary:
CLA (CliftonLarsonAllen) is a top 10 national professional services firm dedicated to creating opportunities for clients and communities. They are seeking an experienced Lead Machine Learning Operations Engineer to manage a team, oversee the machine-learning strategy, and ensure alignment with business goals.
Responsibilities:
• Define and execute an enterprise AI/ML platform strategy, encompassing MLOps, LLMOps, and AIOps, and build reusable frameworks and standards adopted across multiple projects and business units.
• Oversee enterprise‑scale AI platforms supporting model training, inference, evaluation, monitoring, retraining, and governance, including generative AI systems.
• Align AI and MLOps initiatives with business objectives, ensuring platforms and pipelines meet scalability, performance, security, regulatory, and cost requirements, including responsible and ethical AI considerations.
• Implement and enforce best practices for model and prompt versioning, monitoring, retraining, and automated workflows, ensuring consistent and reliable AI operations.
• Lead teams delivering shared AI infrastructure, tooling, and platforms, providing day‑to‑day leadership through coaching, development, and performance management.
• Ensure platform reliability and operational excellence by overseeing escalated issue resolution, maintaining high‑quality documentation, and driving continuous improvement.
• Track and evaluate industry trends in AI platforms, LLM ecosystems, and AI operations, translating insights into roadmap decisions and platform evolution.
Qualifications:
Required:
• 6 years of relevant experience required.
• Bachelor's degree is required. Combination of relevant experience, education, and training may be accepted in lieu of degree.
• Advanced proficiency in Python and architectural mastery of object‑oriented design across dynamically typed languages.
• Broad experience integrating and governing multi‑language systems, including Python, JavaScript/TypeScript, and enterprise platforms (e.g., .NET).
• Leadership‑level expertise in AI/ML platform engineering, spanning MLOps, LLMOps, and AIOps.
• Ability to define and enforce enterprise standards for AI model lifecycle management, monitoring, reliability, and cost control.
• Deep understanding of AI system observability, including drift detection, evaluation frameworks, and incident response.
• Strong experience with cloud architecture, security, compliance, and enterprise‑scale deployments.
• Proven ability to guide teams in technical decision‑making and platform strategy.
Preferred:
• Experience in MLOps, DevOps, or related fields, with a focus on enterprise-level solutions preferred.
• Supervisory experience preferred.
• Degree in computer science, data science, or related field preferred.
Company:
CLA exists to create opportunities for our clients, our people, and our communities through industry-focused wealth advisory, outsourcing, audit, tax, and consulting services. Founded in 1960, the company is headquartered in Minneapolis, USA, with a team of 5001-10000 employees. The company is currently Late Stage.