AI Platform Engineer
$140K - $250K/yr
Develop infrastructure for real-time and batch ML inference at scale * Implement model monitoring, drift detection, and automated retraining systems * Design data pipelines and feature stores for ML ...
$140K - $250K/yr
Develop infrastructure for real-time and batch ML inference at scale * Implement model monitoring, drift detection, and automated retraining systems * Design data pipelines and feature stores for ML ...
$140K - $250K/yr
Develop infrastructure for real-time and batch ML inference at scale * Implement model monitoring, drift detection, and automated retraining systems * Design data pipelines and feature stores for ML ...
Track and manage tasks across concurrent projects using Kanban tools (ClickUp, Jira) REQUIRED SKILLS AI / ML & Inference * SGLang, vLLM, Ollama, OpenWebUI * NVIDIA Triton Inference Server, NVIDIA NIM ...
Track and manage tasks across concurrent projects using Kanban tools (ClickUp, Jira) REQUIRED SKILLS AI / ML & Inference * SGLang, vLLM, Ollama, OpenWebUI * NVIDIA Triton Inference Server, NVIDIA NIM ...
Tampa, FL · On-site
... AI / ML & Inference - SGLang, vLLM, Ollama, OpenWebUI - NVIDIA Triton Inference Server, NVIDIA NIM, NVIDIA NeMo, TensorRT - CUDA, cuBLAS, cuDNN, NCCL (multi-GPU) - Hugging Face Transformers ...
Tampa, FL · On-site
... AI / ML & Inference - SGLang, vLLM, Ollama, OpenWebUI - NVIDIA Triton Inference Server, NVIDIA NIM, NVIDIA NeMo, TensorRT - CUDA, cuBLAS, cuDNN, NCCL (multi-GPU) - Hugging Face Transformers ...
Architect and implement scalable ML training and inference pipelines using AWS SageMaker, managing model training, hyperparameter tuning, distributed training for large vision models, and real-time ...
Architect and implement scalable ML training and inference pipelines using AWS SageMaker, managing model training, hyperparameter tuning, distributed training for large vision models, and real-time ...
Tampa, FL · On-site
Architect and implement scalable ML training and inference pipelines using AWS SageMaker, managing model training, hyperparameter tuning, distributed training for large vision models, and real-time ...
Tampa, FL · On-site
Architect and implement scalable ML training and inference pipelines using AWS SageMaker, managing model training, hyperparameter tuning, distributed training for large vision models, and real-time ...
Architect and implement scalable ML training and inference pipelines using AWS SageMaker, managing model training, hyperparameter tuning, distributed training for large vision models, and real-time ...
Architect and implement scalable ML training and inference pipelines using AWS SageMaker, managing model training, hyperparameter tuning, distributed training for large vision models, and real-time ...
ML Engineer Miami, Florida, United States Or refer someone Job Openings ML Engineer About the Job ... Strong understanding of data pipeline design, real-time inference, and model monitoring.
ML Engineer Miami, Florida, United States Or refer someone Job Openings ML Engineer About the Job ... Strong understanding of data pipeline design, real-time inference, and model monitoring.
ML Engineer Tampa, Florida, United States Or refer someone Job Openings ML Engineer About the Job ... Strong understanding of data pipeline design, real-time inference, and model monitoring.
ML Engineer Tampa, Florida, United States Or refer someone Job Openings ML Engineer About the Job ... Strong understanding of data pipeline design, real-time inference, and model monitoring.
Orlando, FL · On-site
Job : Backend Python Developer - AI/ML Location : Orlando preferable, Las Vegas Skills : Python ... Experience integrating LLM APIs (OpenAI, HuggingFace Inference API, etc.) into real-world ...
Quick apply
Orlando, FL · On-site
Job : Backend Python Developer - AI/ML Location : Orlando preferable, Las Vegas Skills : Python ... Experience integrating LLM APIs (OpenAI, HuggingFace Inference API, etc.) into real-world ...
Develop causal inference methodologies to understand true incrementality of product changes ... Proven track record building and deploying ML models in production , particularly in ...
Quick apply
Develop causal inference methodologies to understand true incrementality of product changes ... Proven track record building and deploying ML models in production , particularly in ...
$109K - $131K/yr
Design scalable ML inference systems that handle high-volume, low-latency predictions in production environments * Create comprehensive monitoring and alerting systems for model performance, data ...
$109K - $131K/yr
Design scalable ML inference systems that handle high-volume, low-latency predictions in production environments * Create comprehensive monitoring and alerting systems for model performance, data ...
Doral, FL · On-site
$105K - $127K/yr
Design scalable ML inference systems that handle high-volume, low-latency predictions in production environments * Create comprehensive monitoring and alerting systems for model performance, data ...
Doral, FL · On-site
$105K - $127K/yr
Design scalable ML inference systems that handle high-volume, low-latency predictions in production environments * Create comprehensive monitoring and alerting systems for model performance, data ...
$105K - $127K/yr
Design scalable ML inference systems that handle high-volume, low-latency predictions in production environments * Create comprehensive monitoring and alerting systems for model performance, data ...
$105K - $127K/yr
Design scalable ML inference systems that handle high-volume, low-latency predictions in production environments * Create comprehensive monitoring and alerting systems for model performance, data ...
Tallahassee, FL · Remote
$90K - $123K/yr
Contribute to model deployment, inference services, and production monitoring workflows * Improve data quality, lineage, provenance, and operational transparency across ML pipelines * Contribute to ...
Tallahassee, FL · Remote
$90K - $123K/yr
Contribute to model deployment, inference services, and production monitoring workflows * Improve data quality, lineage, provenance, and operational transparency across ML pipelines * Contribute to ...
We are expanding our AI/ML capabilities to include generative AI-driven solutions, RAG applications ... Build and maintain scalable machine learning pipelines for data processing, training, and inference.
We are expanding our AI/ML capabilities to include generative AI-driven solutions, RAG applications ... Build and maintain scalable machine learning pipelines for data processing, training, and inference.
Aventura, FL · On-site
We are expanding our AI/ML capabilities to include generative AI-driven solutions, RAG applications ... Build and maintain scalable machine learning pipelines for data processing, training, and inference.
Aventura, FL · On-site
We are expanding our AI/ML capabilities to include generative AI-driven solutions, RAG applications ... Build and maintain scalable machine learning pipelines for data processing, training, and inference.
Satellite Beach, FL · Hybrid
$113K - $149K/yr
Integrate AI/ML capabilities into production systems (e.g., model inference APIs, decision-support features, anomaly detection workflows) * Design and optimize data models and persistence layers to ...
Quick apply
Satellite Beach, FL · Hybrid
$113K - $149K/yr
Integrate AI/ML capabilities into production systems (e.g., model inference APIs, decision-support features, anomaly detection workflows) * Design and optimize data models and persistence layers to ...
$105K - $139K/yr
Integrate AI/ML capabilities into production systems (e.g., model inference APIs, decision-support features, anomaly detection workflows) * Design and optimize data models and persistence layers to ...
$105K - $139K/yr
Integrate AI/ML capabilities into production systems (e.g., model inference APIs, decision-support features, anomaly detection workflows) * Design and optimize data models and persistence layers to ...
Architect and implement GenAI observability pipelines that capture LLM inference telemetry ... ML monitoring platforms (e.g., MLflow, Weights & Biases, LangSmith, or internal tooling) to enable ...
Architect and implement GenAI observability pipelines that capture LLM inference telemetry ... ML monitoring platforms (e.g., MLflow, Weights & Biases, LangSmith, or internal tooling) to enable ...
... inference systems and low-latency model servingKnowledge of adversarial ML and AI security/robustness techniquesExperience with graph neural networks for network analysisExperience in design ...
... inference systems and low-latency model servingKnowledge of adversarial ML and AI security/robustness techniquesExperience with graph neural networks for network analysisExperience in design ...
| Aspect | ML Inference | Data Scientist |
|---|---|---|
| Required Credentials | Knowledge of machine learning models, programming skills | Degree in data science, statistics, or related fields |
| Work Environment | Deploying models in production, real-time data processing | Data analysis, model development, research |
| Industry Usage | AI product deployment, software companies | Research institutions, tech firms, consulting |
ML Inference focuses on deploying trained models to make predictions on new data, often in real-time. Data Scientists develop and analyze models, working primarily in research and development. While both roles require understanding of machine learning, ML Inference emphasizes deployment and operationalization, whereas Data Scientists focus on model creation and analysis.
$140K - $250K/yr
Other
Posted 25 days ago
Build and scale the infrastructure that powers AI at enterprise scale. Design robust, automated systems that enable data scientists and ML engineers to deploy, monitor, and maintain machine learning models in production environments.
Key Responsibilities:
Requirements:
Benefits Compensation Range: $140,000 - $250,000+ plus equity