1

Ml Inference Jobs in Kentucky (NOW HIRING)

AI Data Engineer - Senior Consultant

Louisville, KY · Hybrid

$100K - $137K/yr

Deliver governed datasets and feature engineering/serving for ML training and real-time inference (online/offline consistency, caching, latency SLOs, backfills). * Implement safety, privacy, and ...

Understanding of AI/ML fundamentals (model training, inference, adversarial ML, secure data pipelines).Experience with ML platforms (TensorFlow, PyTorch, Scikit-learn) and MLOps tools (MLflow ...

Operationalize ML models using Python, PySpark, and Kafka for both batch and real-time inference. * Leverage advanced orchestration frameworks such as LangChain, LangGraph, CrewAI, and Vertex AI ...

Data Engineer (Remote)

Louisville, KY · On-site +1

$104K - $125K/yr

Enable data for AI/ML use cases by preparing feature-rich datasets, supporting feature engineering, and ensuring data consistency for model training and inference * Support deployment and ...

Data Engineer (Remote)

Louisville, KY · On-site +1

$104K - $125K/yr

Enable data for AI/ML use cases by preparing feature-rich datasets, supporting feature engineering, and ensuring data consistency for model training and inference * Support deployment and ...

Ml Inference information

What is ML inference?

ML inference refers to the process of using a trained machine learning model to make predictions or decisions based on new data. After a model has been trained on historical data, inference is the phase where that model is deployed and used in real-world applications, such as recognizing speech, detecting objects in images, or recommending products. The focus in ML inference is on speed, efficiency, and scalability to ensure quick predictions, often in real time. This process is critical for practical applications like mobile apps, web services, and embedded systems. Optimizing inference involves reducing latency, memory usage, and computational requirements.

What is the difference between Ml Inference vs Data Scientist?

AspectML InferenceData Scientist
Required CredentialsKnowledge of machine learning models, programming skillsDegree in data science, statistics, or related fields
Work EnvironmentDeploying models in production, real-time data processingData analysis, model development, research
Industry UsageAI product deployment, software companiesResearch institutions, tech firms, consulting

ML Inference focuses on deploying trained models to make predictions on new data, often in real-time. Data Scientists develop and analyze models, working primarily in research and development. While both roles require understanding of machine learning, ML Inference emphasizes deployment and operationalization, whereas Data Scientists focus on model creation and analysis.

Which 3 jobs will survive AI?

For ML Inference roles, jobs that require complex problem-solving, creativity, and emotional intelligence are more likely to persist, such as data scientists, AI ethics specialists, and machine learning engineers. These roles involve tasks that are difficult to automate and often require specialized skills, domain knowledge, and critical thinking. Continuous learning and expertise in AI tools and programming languages like Python or TensorFlow can also enhance job security in this field.

What engineers make $500,000?

Senior machine learning engineers with extensive experience, specialized skills in deep learning, and strong industry demand can earn $500,000 or more annually, especially in high-cost-of-living areas or within top tech companies. Achieving this level typically requires advanced degrees, certifications, and a proven track record of impactful projects.

What is a $900,000 AI job?

A $900,000 AI job typically refers to high-level roles in artificial intelligence, such as senior machine learning engineers or AI research directors, often requiring advanced skills in deep learning, data science, and experience with tools like TensorFlow or PyTorch. These positions usually involve leadership responsibilities, strategic planning, and may require multiple years of specialized experience or advanced degrees.

Is ML a high paying job?

Machine Learning (ML) inference roles are generally well-paid due to the specialized skills required, such as knowledge of algorithms, programming, and data analysis. Salaries vary based on experience, location, and industry, but they tend to be higher than average for tech positions. Advanced roles often require proficiency with tools like TensorFlow or PyTorch and may include certifications or advanced degrees.

What are some common challenges faced by ML Inference Engineers when deploying models to production?

ML Inference Engineers often encounter challenges such as optimizing model latency and throughput to meet production requirements, ensuring compatibility with diverse hardware environments, and managing model versioning and updates without disrupting service. Additionally, balancing resource utilization and inference accuracy while monitoring real-time performance metrics is crucial. Collaboration with data scientists, DevOps, and software engineers is typically essential to streamline deployment and maintain robust, scalable inference pipelines.

What are the key skills and qualifications needed to thrive in ML Inference, and why are they important?

To thrive in ML Inference, you need a solid background in machine learning principles, programming (Python or C++), and experience with deploying models at scale, often supported by a degree in computer science or a related field. Familiarity with frameworks and tools such as TensorFlow, PyTorch, ONNX, and cloud platforms like AWS SageMaker or Google AI Platform is typically required. Strong problem-solving skills, attention to detail, and effective communication are crucial soft skills for collaborating with multidisciplinary teams and optimizing model performance. These skills ensure efficient, scalable, and reliable deployment of machine learning solutions in real-world applications.
What are popular job titles related to Ml Inference jobs in Kentucky? For Ml Inference jobs in Kentucky, the most frequently searched job titles are:
What cities in Kentucky are hiring for Ml Inference jobs? Cities in Kentucky with the most Ml Inference job openings:
AI Data Engineer - Senior Consultant

AI Data Engineer - Senior Consultant

Deloitte

Louisville, KY • Hybrid

$100K - $137K/yr

Other

Posted 7 days ago


Deloitte rating

8.1

Company rating: 8.1 out of 10

Based on 86 frontline employees who took The Breakroom Quiz

58th of 139 rated financial services


Job description

AI Engineer Senior Consultant

Our Deloitte Human Capital team transforms technology platforms, drives innovation, and helps make a significant impact on our clients' success. We are hiring an AI Engineer to build and operate the data, features, and GenAI foundations that power Human Capital AI products and analytics. You will work with an AI Data Engineer (data ingestion, curation, governance, platform foundations) and a Lead AI Solutions Architect (end-to-end solution architecture, integration patterns, non-functional requirements), partnering closely with product, data science/ML, security, and platform engineering to deliver reliable, secure, and scalable AI solutions.

This role is hands-on and delivery-oriented: you will ship production pipelines and services that support model training, real-time inference, and LLM applications using Claude-, GPT/Codex-, and Gemini-class models, and more implemented with strong governance, observability, and cost/performance discipline.

Recruiting for this role ends on August 30, 2026

Work You'll Do:

As an AI Engineer Senior Consultant, you will design, build, and run the trusted, governed data + feature + retrieval layer used by AI/ML and GenAI solutions. You will deliver reproducible datasets and features, operationalize quality and lineage, and enable secure consumption patterns for both predictive ML and LLM-based experiences.

Key Responsibilities:

  • Partner with the Lead AI Solutions Architect and AI Data Engineer to translate Human Capital product needs into secure, scalable technical designs and delivered solutions (APIs, services, pipelines, containers/serverless) meeting availability, performance, and security expectations.
  • Build and operationalize LLM-enabled capabilities (e.g., copilots, HR knowledge assistants, summarization, policy Q&A) using Claude/GPT(Codex)/Gemini, including secure endpoints, tool/function calling, and reusable prompt/context patterns.
  • Implement LLM application patterns including RAG, document ingestion/chunking, embeddings, vector/hybrid search, and retrieval/evaluation telemetry.
  • Deliver governed datasets and feature engineering/serving for ML training and real-time inference (online/offline consistency, caching, latency SLOs, backfills).
  • Implement safety, privacy, and access controls (PII handling, prompt-injection defenses, content filtering, policy-based access) with security and risk stakeholders.
  • Establish data/model reliability and cost-performance discipline (data quality, schema evolution, lineage/metadata, monitoring; right-sizing, query tuning, LLM token/cost telemetry).
  • Contribute to MLOps/LLMOps and production operations (versioning, reproducibility, CI/CD, automated testing, observability, incident response); support design reviews, deployment readiness, and runbooks.

The Team

HC Forward is a dedicated innovation partner accelerating the future of Human Capital by building market-aligned products, platforms, and services that apply AI, data, and engineering to modernize HR experiences and outcomes.

Required Qualifications:

  • Bachelor's degree in a STEM field (e.g., Computer Science, Engineering, Statistics, Data Science)
  • 4+ years building and delivering LLM/GenAI solutions with Claude/GPT(Codex)/Gemini-class models, including prompt/context design, tool/function calling, evaluation, and production integration.
  • 4+ years implementing RAG/retrieval (document processing, embeddings, vector/hybrid search) with enterprise governance controls.
  • 4+ years of modern data & AI engineering, including data modeling, batch/streaming pipelines, structured/unstructured processing, and feature engineering/serving fundamentals.
  • 4+ years building production, real-time inference services (API design, latency/performance, reliability patterns).
  • 4+ years leading platform/integration engineering across enterprise systems; strong API/integration experience (REST, GraphQL, event-driven, microservices, middleware).
  • 4+ years DevOps/DevSecOps experience (CI/CD, IaC such as Terraform/CloudFormation, Docker/Kubernetes, observability/monitoring).
  • 4+ years leading security/compliance efforts; familiarity with enterprise security controls (IAM, encryption, secrets, audit logging) and data/privacy (PII, retention, access controls); SOC 2/GDPR/HIPAA exposure a plus.
  • Ability to travel 0-25%, on average, based on client and project needs.
  • Limited immigration sponsorship may be available

Preferred Qualifications:

  • Advanced degree (MS/PhD) and/or relevant certifications (cloud and AI/ML).
  • 4+ years of experience with Human Capital platforms and integrations (e.g., Workday, SAP SuccessFactors, Oracle HCM, Salesforce) and HR data domains.
  • 4+ years of experience operationalizing LLMOps/MLOps capabilities (evaluation, monitoring, governance workflows, model/prompt/version management).
  • 4+ years of cloud experience on AWS/Azure/GCP (one or more), including managed data platforms and scalable compute patterns.
  • 4+ years of experience with structured problem solving, translating business needs into requirements, acceptance criteria, and shippable increments.
  • 4+ years of experience with stakeholder communication: ability to explain AI/GenAI trade-offs (quality vs. latency vs. cost vs. risk) and document decisions.
  • 4+ years of experience collaborating across product, data science/ML, data engineering, platform, and security.
  • 4+ years of experience with treat testing, monitoring, and operational readiness as core responsibilities.
  • 4+ years of experience with ethics and privacy awareness being able to recognize consent/PII/bias boundaries and escalate appropriately.

The wage range for this role takes into account the wide range of factors that are considered in making compensation decisions including but not limited to skill sets; experience and training; licensure and certifications; and other business and organizational needs. The disclosed range estimate has not been adjusted for the applicable geographic differential associated with the location at which the position may be filled. At Deloitte, it is not typical for an individual to be hired at or near the top of the range for their role and compensation decisions are dependent on the facts and circumstances of each case. A reasonable estimate of the current range is $113,100 to $208,300.

You may also be eligible to participate in a discretionary annual incentive program, subject to the rules governing the program, whereby an award, if any, depends on various factors, including, without limitation, individual and organizational performance.

Possible Locations: Atlanta, Austin, Baltimore, Boston, Charlotte, Chicago, Cincinnati, Cleveland, Columbus, Costa Mesa, Dallas, Denver, Detroit, Hartford, Houston, Indianapolis, Jacksonville, Kansas City, Las Vegas, Los Angeles, McLean, Miami, Milwaukee, Nashville, New Orleans, New York, Philadelphia, Pittsburgh, Portland, Raleigh, Richmond, Sacramento, San Antonio, San Diego, San Francisco, San Jose, Seattle, St. Louis, Stamford, Tampa, Tempe

Information for applicants with a need for accommodation: https://www2.deloitte.com/us/en/pages/careers/articles/join-deloitte-assistance-for-disabled-applicants.html

For more information about Human Capital, visit our landing page at: https://www2.deloitte.com/us/en/pages/careers/articles/join-deloitte-human-capital-consulting-jobs.html

#HCFY26 #IIOFY26

Qualifications:

AI Engineer Senior Consultant

Our Deloitte Human Capital team transforms technology platforms, drives innovation, and helps make a significant impact on our clients' success. We are hiring an AI Engineer to build and operate the data, features, and GenAI foundations that power Human Capital AI products and analytics. You will work with an AI Data Engineer (data ingestion, curation, governance, platform foundations) and a Lead AI Solutions Architect (end-to-end solution architecture, integration patterns, non-functional requirements), partnering closely with product, data science/ML, security, and platform engineering to deliver reliable, secure, and scalable AI solutions.

This role is hands-on and delivery-oriented: you will ship production pipelines and services that support model training, real-time inference, and LLM applications using Claude-, GPT/Codex-, and Gemini-class models, and more implemented with strong governance, observability, and cost/performance discipline.

Recruiting for this role ends on August 30, 2026

Work You'll Do:

As an AI Engineer Senior Consultant, you will design, build, and run the trusted, governed data + feature + retrieval layer used by AI/ML and GenAI solutions. You will deliver reproducible datasets and features, operationalize quality and lineage, and enable secure consumption patterns for both predictive ML and LLM-based experiences.

Key Responsibilities:

  • Partner with the Lead AI Solutions Architect and AI Data Engineer to translate Human Capital product needs into secure, scalable technical designs and delivered solutions (APIs, services, pipelines, containers/serverless) meeting availability, performance, and security expectations.
  • Build and operationalize LLM-enabled capabilities (e.g., copilots, HR knowledge assistants, summarization, policy Q&A) using Claude/GPT(Codex)/Gemini, including secure endpoints, tool/function calling, and reusable prompt/context patterns.
  • Implement LLM application patterns including RAG, document ingestion/chunking, embeddings, vector/hybrid search, and retrieval/evaluation telemetry.
  • Deliver governed datasets and feature engineering/serving for ML training and real-time inference (online/offline consistency, caching, latency SLOs, backfills).
  • Implement safety, privacy, and access controls (PII handling, prompt-injection defenses, content filtering, policy-based access) with security and risk stakeholders.
  • Establish data/model reliability and cost-performance discipline (data quality, schema evolution, lineage/metadata, monitoring; right-sizing, query tuning, LLM token/cost telemetry).
  • Contribute to MLOps/LLMOps and production operations (versioning, reproducibility, CI/CD, automated testing, observability, incident response); support design reviews, deployment readiness, and runbooks.

The Team

HC Forward is a dedicated innovation partner accelerating the future of Human Capital by building market-aligned products, platforms, and services that apply AI, data, and engineering to modernize HR experiences and outcomes.

Required Qualifications:

  • Bachelor's degree in a STEM field (e.g., Computer Science, Engineering, Statistics, Data Science)
  • 4+ years building and delivering LLM/GenAI solutions with Claude/GPT(Codex)/Gemini-class models, including prompt/context design, tool/function calling, evaluation, and production integration.
  • 4+ years implementing RAG/retrieval (document processing, embeddings, vector/hybrid search) with enterprise governance controls.
  • 4+ years of modern data & AI engineering, including data modeling, batch/streaming pipelines, structured/unstructured processing, and feature engineering/serving fundamentals.
  • 4+ years building production, real-time inference services (API design, latency/performance, reliability patterns).
  • 4+ years leading platform/integration engineering across enterprise systems; strong API/integration experience (REST, GraphQL, event-driven, microservices, middleware).
  • 4+ years DevOps/DevSecOps experience (CI/CD, IaC such as Terraform/CloudFormation, Docker/Kubernetes, observability/monitoring).
  • 4+ years leading security/compliance efforts; familiarity with enterprise security controls (IAM, encryption, secrets, audit logging) and data/privacy (PII, retention, access controls); SOC 2/GDPR/HIPAA exposure a plus.
  • Ability to travel 0-25%, on average, based on client and project needs.
  • Limited immigration sponsorship may be available

Preferred Qualifications:

  • Advanced degree (MS/PhD) and/or relevant certifications (cloud and AI/ML).
  • 4+ years of experience with Human Capital platforms and integrations (e.g., Workday, SAP SuccessFactors, Oracle HCM, Salesforce) and HR data domains.
  • 4+ years of experience operationalizing LLMOps/MLOps capabilities (evaluation, monitoring, governance workflows, model/prompt/version management).
  • 4+ years of cloud experience on AWS/Azure/GCP (one or more), including managed data platforms and scalable compute patterns.
  • 4+ years of experience with structured problem solving, translating business needs into requirements, acceptance criteria, and shippable increments.
  • 4+ years of experience with stakeholder communication: ability to explain AI/GenAI trade-offs (quality vs. latency vs. cost vs. risk) and document decisions.
  • 4+ years of experience collaborating across product, data science/ML, data engineering, platform, and security.
  • 4+ years of experience with treat testing, monitoring, and operational readiness as core responsibilities.
  • 4+ years of experience with ethics and privacy awareness being able to recognize consent/PII/bias boundaries and escalate appropriately.

The wage range for this role takes into account the wide range of factors that are considered in making compensation decisions including but not limited to skill sets; experience and training; licensure and certifications; and other business and organizational needs. The disclosed range estimate has not been adjusted for the applicable geographic differential associated with the location at which the position may be filled. At Deloitte, it is not typical for an individual to be hired at or near the top of the range for their role and compensation decisions are dependent on the facts and circumstances of each case. A reasonable estimate of the current range is $113,100 to $208,300.

You may also be eligible to participate in a discretionary annual incentive program, subject to the rules governing the program, whereby an award, if any, depends on various factors, including, without limitation, individual and organizational performance.

Possible Locations: Atlanta, Austin, Baltimore, Boston, Charlotte, Chicago, Cincinnati, Cleveland, Columbus, Costa Mesa, Dall...


What Deloitte employees say

Pay

Benefits

Hours and flexibility

Workplace

Get the full story on Breakroom