1

Ml Inference Jobs in Florida (NOW HIRING)

Support model serving and inference infrastructure for a range of ML use cases, including traditional ML, computer vision, speech/audio, and LLM-based systems * Build and maintain CI/CD workflows for ...

Support model serving and inference infrastructure for a range of ML use cases, including traditional ML, computer vision, speech/audio, and LLM-based systems * Build and maintain CI/CD workflows for ...

Senior AI/ML Engineer

Tallahassee, FL ยท Remote

$90 - $100/hr

Senior AI/ML Engineer Anywhere Type: Contract-to-Hire Category: Development Industry: Government ... Hands-on experience with LLM orchestration, integration, and vLLM-based inference for document ...

Lead ML Ops engineer

Naples, FL

$96K - $127K/yr

Oversee enterprisescale AI platforms supporting model training, inference, evaluation, monitoring ... Leadershiplevel expertise in AI/ML platform engineering, spanning MLOps, LLMOps, and AIOps.

Lead ML Ops Engineer

Orlando, FL ยท On-site

$95K - $126K/yr

Oversee enterprisescale AI platforms supporting model training, inference, evaluation, monitoring ... Leadershiplevel expertise in AI/ML platform engineering, spanning MLOps, LLMOps, and AIOps.

Lead ML Ops Engineer

Naples, FL ยท On-site

$96K - $127K/yr

Oversee enterprisescale AI platforms supporting model training, inference, evaluation, monitoring ... Leadershiplevel expertise in AI/ML platform engineering, spanning MLOps, LLMOps, and AIOps.

Lead ML Ops engineer

Orlando, FL

$95K - $126K/yr

Oversee enterprisescale AI platforms supporting model training, inference, evaluation, monitoring ... Leadershiplevel expertise in AI/ML platform engineering, spanning MLOps, LLMOps, and AIOps.

Apply causal inference methods to understand the impact of potential product changes. * Define and build new ML features using text and multimodal embeddings and GenAI. * Validate offline learnings ...

Data Engineer (AI-focused)

Miami, FL ยท On-site

$90K - $110K/yr

Build and maintain scalable data pipelines for AI/ML use cases * Design data architectures for structured and unstructured data * Prepare datasets for training, fine-tuning, and inference * Ensure ...

Deliver governed datasets and feature engineering/serving for ML training and real-time inference (online/offline consistency, caching, latency SLOs, backfills). A successful candidate would possess ...

Deliver governed datasets and feature engineering/serving for ML training and real-time inference (online/offline consistency, caching, latency SLOs, backfills). A successful candidate would possess ...

AI Data Engineer - Senior Consultant

Tallahassee, FL ยท Hybrid

$99K - $136K/yr

Deliver governed datasets and feature engineering/serving for ML training and real-time inference (online/offline consistency, caching, latency SLOs, backfills). * Implement safety, privacy, and ...

Deliver governed datasets and feature engineering/serving for ML training and real-time inference (online/offline consistency, caching, latency SLOs, backfills). A successful candidate would possess ...

AI Data Engineer - Senior Consultant

Tampa, FL ยท Hybrid

$98K - $135K/yr

Deliver governed datasets and feature engineering/serving for ML training and real-time inference (online/offline consistency, caching, latency SLOs, backfills). * Implement safety, privacy, and ...

AI Engineer Senior Consultant

Jacksonville, FL ยท Hybrid

$96K - $133K/yr

Deliver governed datasets and feature engineering/serving for ML training and real-time inference (online/offline consistency, caching, latency SLOs, backfills). * Implement safety, privacy, and ...

Deliver governed data and features for ML/GenAI (curated datasets, feature pipelines/serving) supporting training and real-time inference, including consistency, caching, backfills, and latency SLOs.

Deliver governed datasets and feature engineering/serving for ML training and real-time inference (online/offline consistency, caching, latency SLOs, backfills). A successful candidate would possess ...

Deliver governed data and features for ML/GenAI (curated datasets, feature pipelines/serving) supporting training and real-time inference, including consistency, caching, backfills, and latency SLOs.

Deliver governed datasets and feature engineering/serving for ML training and real-time inference (online/offline consistency, caching, latency SLOs, backfills). A successful candidate would possess ...

next page

Showing results 1-20

Ml Inference information

What is ML inference?

ML inference refers to the process of using a trained machine learning model to make predictions or decisions based on new data. After a model has been trained on historical data, inference is the phase where that model is deployed and used in real-world applications, such as recognizing speech, detecting objects in images, or recommending products. The focus in ML inference is on speed, efficiency, and scalability to ensure quick predictions, often in real time. This process is critical for practical applications like mobile apps, web services, and embedded systems. Optimizing inference involves reducing latency, memory usage, and computational requirements.

What is the difference between Ml Inference vs Data Scientist?

AspectML InferenceData Scientist
Required CredentialsKnowledge of machine learning models, programming skillsDegree in data science, statistics, or related fields
Work EnvironmentDeploying models in production, real-time data processingData analysis, model development, research
Industry UsageAI product deployment, software companiesResearch institutions, tech firms, consulting

ML Inference focuses on deploying trained models to make predictions on new data, often in real-time. Data Scientists develop and analyze models, working primarily in research and development. While both roles require understanding of machine learning, ML Inference emphasizes deployment and operationalization, whereas Data Scientists focus on model creation and analysis.

Which 3 jobs will survive AI?

For ML Inference roles, jobs that require complex problem-solving, creativity, and emotional intelligence are more likely to persist, such as data scientists, AI ethics specialists, and machine learning engineers. These roles involve tasks that are difficult to automate and often require specialized skills, domain knowledge, and critical thinking. Continuous learning and expertise in AI tools and programming languages like Python or TensorFlow can also enhance job security in this field.

What engineers make $500,000?

Senior machine learning engineers with extensive experience, specialized skills in deep learning, and strong industry demand can earn $500,000 or more annually, especially in high-cost-of-living areas or within top tech companies. Achieving this level typically requires advanced degrees, certifications, and a proven track record of impactful projects.

What is a $900,000 AI job?

A $900,000 AI job typically refers to high-level roles in artificial intelligence, such as senior machine learning engineers or AI research directors, often requiring advanced skills in deep learning, data science, and experience with tools like TensorFlow or PyTorch. These positions usually involve leadership responsibilities, strategic planning, and may require multiple years of specialized experience or advanced degrees.

Is ML a high paying job?

Machine Learning (ML) inference roles are generally well-paid due to the specialized skills required, such as knowledge of algorithms, programming, and data analysis. Salaries vary based on experience, location, and industry, but they tend to be higher than average for tech positions. Advanced roles often require proficiency with tools like TensorFlow or PyTorch and may include certifications or advanced degrees.

What are some common challenges faced by ML Inference Engineers when deploying models to production?

ML Inference Engineers often encounter challenges such as optimizing model latency and throughput to meet production requirements, ensuring compatibility with diverse hardware environments, and managing model versioning and updates without disrupting service. Additionally, balancing resource utilization and inference accuracy while monitoring real-time performance metrics is crucial. Collaboration with data scientists, DevOps, and software engineers is typically essential to streamline deployment and maintain robust, scalable inference pipelines.

What are the key skills and qualifications needed to thrive in ML Inference, and why are they important?

To thrive in ML Inference, you need a solid background in machine learning principles, programming (Python or C++), and experience with deploying models at scale, often supported by a degree in computer science or a related field. Familiarity with frameworks and tools such as TensorFlow, PyTorch, ONNX, and cloud platforms like AWS SageMaker or Google AI Platform is typically required. Strong problem-solving skills, attention to detail, and effective communication are crucial soft skills for collaborating with multidisciplinary teams and optimizing model performance. These skills ensure efficient, scalable, and reliable deployment of machine learning solutions in real-world applications.
What job categories do people searching Ml Inference jobs in Florida look for? The top searched job categories for Ml Inference jobs in Florida are:
What cities in Florida are hiring for Ml Inference jobs? Cities in Florida with the most Ml Inference job openings:
Principal MLOps Engineer

Principal MLOps Engineer

Raft

Tampa, FL โ€ข On-site

Other

Medical, Dental, Vision, Retirement, PTO

This job post hasย expired today.ย Applications are no longer accepted.


Job description

This is a U.S. based position. All of the programs we support require U.S. citizenship to be eligible for employment. All work must be conducted within the continental U.S.

Who we are:

Raft (https://TeamRaft.com) is a customer-obsessed non-traditional defense tech company dedicated to empowering U.S. military and government agencies with cutting-edge AI/ML and data solutions. We are a leader in autonomous data fusion and Agentic AI, with a purposeful focus on Distributed Data Systems, Platforms at Scale, and Complex Application Development. With headquarters in McLean, VA, our range of clients includes innovative federal and public agencies leveraging design thinking, cutting-edge tech stack, and cloud-native ecosystem. We build digital solutions that impact the lives of millions of Americans.

We're looking for an experienced Principal ML OpsEngineer to support our customers and join our passionate team of high-impact problem solvers.

About the role:

Raft is building mission-critical AI and data platforms for the Department of Defense (DoD). Our systems ingest and process massive volumes of real-time data from hundreds of sensors and operational sources, transform that data into usable intelligence, and deliver it to operators through mission applications and common operational pictures that support time-sensitive decision-making.

Our platform operates at scale, processing billions of events per day with low-latency data pipelines and cloud-native infrastructure. As Raft expands its AI capabilities, we are investing in a more mature end-to-end machine learning platform to support model development, evaluation, deployment, monitoring, and lifecycle management across both cloud and constrained operational environments.

In this role, you will help design, deploy, and mature Raft's ML platform and MLOps infrastructure. You will work across Kubernetes-based deployment environments, GPU-enabled infrastructure, model serving systems, CI/CD pipelines, and secure production operations to enable rapid and reliable delivery of machine learning capabilities. This role is ideal for someone who understands both the infrastructure needed to run ML systems in production and the practical needs of ML engineers building and deploying models.

What you'll do:
  • Design, build, and maintain secure, scalable MLOps infrastructure and deployment pipelines for production ML systems
  • Help mature Raft's internal ML platform and model lifecycle capabilities, including model packaging, registry/catalog workflows, deployment, monitoring, and operational support
  • Deploy and manage machine learning workloads on Kubernetes, including GPU-enabled clusters
  • Support model serving and inference infrastructure for a range of ML use cases, including traditional ML, computer vision, speech/audio, and LLM-based systems
  • Build and maintain CI/CD workflows for ML services, model artifacts, and platform components
  • Partner closely with ML engineers, software engineers, and product teams to move models from experimentation to reliable operational deployment
  • Improve observability, reliability, security, and maintainability across ML infrastructure and services
  • Help evaluate and standardize runtime patterns, serving frameworks, and deployment architectures for production ML workloads
  • Contribute to infrastructure decisions across edge, on-prem, and cloud-hosted deployment environments
  • Support compliance-driven deployment practices and secure software supply chain requirements in defense environments
  • Get hands-on with customers at the most forward-leaning places in the Department of War


What we are looking for:

  • 7+ years of relevant hands-on experience in software engineering, platform engineering, DevOps, MLOps, or related technical roles
  • 5+ years of experience with Docker and Kubernetes in production environments
  • 5+ years of experience supporting enterprise cloud infrastructure or applications in AWS, Azure, or similar environments
  • Strong experience provisioning, operating, and troubleshooting Kubernetes clusters in production
  • Experience building and maintaining machine learning platforms, infrastructure, or pipelines used by engineering or data science teams
  • Practical experience deploying machine learning workloads on Kubernetes
  • Experience managing clusters or workloads that use GPUs
  • Strong understanding of Helm and Kubernetes deployment patterns
  • Strong scripting or programming skills, preferably in Python
  • Experience with modern software engineering practices including Git, CI/CD, DevOps, and Agile/Scrum workflows
  • Strong troubleshooting, systems thinking, and communication skills
  • Ability to work independently and collaboratively in a fast-moving environment
  • Ability to obtain and maintain a Top Secret clearance
  • Ability to obtain Security+ certification within the first 90 days of employment

Highly preferred:

  • Experience with ML model serving and inference platforms such as Triton Inference Server, KServe, Ray Serve, vLLM, or similar technologies
  • Experience with secure and compliant deployment practices in regulated or government environments
  • Experience with Kubernetes-based ML platforms such as Kubeflow
  • Familiarity with service mesh technologies such as Istio
  • Experience provisioning and debugging complex CI/CD systems
  • Experience with infrastructure as code tools such as Terraform
  • Familiarity with software supply chain security, container hardening, vulnerability management, and runtime scanning
  • Experience supporting ML systems across multiple deployment environments, including cloud, on-prem, and edge
  • Background working with machine learning engineers on model training, evaluation, packaging, and release workflows
  • Familiarity with storage and artifact systems used in ML platforms, such as S3-compatible object stores, registries, and metadata/catalog system
What success looks like:
  • You help Raft stand up a more mature and repeatable ML platform for deploying and managing models in production
  • ML engineers can move faster because deployment, serving, and platform workflows are clearer, more reliable, and easier to use
  • Model deployments become more secure, observable, and supportable across real-world mission environments
  • The organization gains stronger infrastructure for model lifecycle management, including deployment standards, runtime patterns, and platform ownership

Clearance Requirements:

  • Ability to obtain and maintain a Top Secret clearance

Work Type:

  • Remote in DMV; McLean, VA; Boston, MA; San Antonio, TX; Colorado Springs, CO; Tampa, FL; Honolulu, HI Locations ONLY
  • May require up to 40% travel

Salary Range: $150,000.00 - $200,000.00

What we will offer you:

  • Highly competitive salary
  • Fully covered healthcare, dental, and vision coverage
  • 401(k) and company match
  • Take as you need PTO + 11 paid holidays
  • Education & training benefits
  • Annual budget for your tech/gadgets needs
  • Monthly box of yummy snacks to eat while doing meaningful work
  • Remote, hybrid, and flexible work options
  • Team off-site in fun places!
  • Generous Referral Bonuses
  • And More!

Our Vision Statement:

We bridge the gap between humans and data through radical transparency and our obsession withthemission.

Our Customer Obsession:

We will approach every deliverable like it's a product. We will adopt a customer-obsessed mentality. As we grow, and our footprint becomes larger, teams and employees will treat each other not only as teammates but customers. We must live the customer-obsessed mindset, always. This will help us scale and it will translate to the interactions that our Rafters have with their clients and other product teams that they integrate with. Our culture will enable our success and set us apart from other companies.

How do we get there?

Public-sector modernization is critical for us to live in a better world. We, at Raft, want to innovate and solve complex problems. And, if we are successful, our generation and the ones that follow us will live in a delightful, efficient, and accessible world where out-of-box thinking,and collaboration is a norm.

Raft's core philosophy isUbuntu: IAm, BecauseWe are. We support our"nadi"by elevating the other Rafters. We work as a hyper collaborative team where each team member brings a unique perspective, adding value that did not exist before. People make Raft special. We celebrate each other and our cognitive and cultural diversity. We are devoted to our practice of innovation and collaboration.

We're an equal opportunity employer. All applicants will be considered for employment without attention to race, color, religion, sex, sexual orientation, gender identity, national origin, veteran or disability status.