Required : • 4+ years hands-on ML engineering building and deploying production models • Deep ... quantization) • Strong Python with experience in C++ for performance-critical components • ...
Required : • 4+ years hands-on ML engineering building and deploying production models • Deep ... quantization) • Strong Python with experience in C++ for performance-critical components • ...
Develop deep learning pipelines for object detection, segmentation, and pose estimation * Build ... Experience optimizing ML models for edge deployment (TensorRT, ONNX, quantization) Software ...
Develop deep learning pipelines for object detection, segmentation, and pose estimation * Build ... Experience optimizing ML models for edge deployment (TensorRT, ONNX, quantization) Software ...
Senior Machine Learning Engineer (LLMs)
Chicago, IL · On-site
$126.20K - $166.40K/yr
Deep understanding of transformers, attention, and training dynamics * Strong Python plus PyTorch ... Inference optimization (quantization, speculative decoding, vLLM, Triton) * Experience shipping LLM ...
Quick apply
Senior Machine Learning Engineer (LLMs)
Chicago, IL · On-site
$126.20K - $166.40K/yr
Deep understanding of transformers, attention, and training dynamics * Strong Python plus PyTorch ... Inference optimization (quantization, speculative decoding, vLLM, Triton) * Experience shipping LLM ...
Senior Machine Learning Engineer (LLMs)
$126.20K - $166.40K/yr
Deep understanding of transformers, attention, and training dynamics * Strong Python plus PyTorch ... Inference optimization (quantization, speculative decoding, vLLM, Triton) * Experience shipping LLM ...
Senior Machine Learning Engineer (LLMs)
$126.20K - $166.40K/yr
Deep understanding of transformers, attention, and training dynamics * Strong Python plus PyTorch ... Inference optimization (quantization, speculative decoding, vLLM, Triton) * Experience shipping LLM ...
Senior Machine Learning Engineer (LLMs)
Chicago, IL · On-site
$126.20K - $166.40K/yr
Deep understanding of transformers, attention, and training dynamics * Strong Python plus PyTorch ... Inference optimization (quantization, speculative decoding, vLLM, Triton) * Experience shipping LLM ...
Senior Machine Learning Engineer (LLMs)
Chicago, IL · On-site
$126.20K - $166.40K/yr
Deep understanding of transformers, attention, and training dynamics * Strong Python plus PyTorch ... Inference optimization (quantization, speculative decoding, vLLM, Triton) * Experience shipping LLM ...
... product quantization is a plus. * Experience with embeddings, ANN/KNN, vector stores, database ... Foundational understanding of Natural Language Processing and Deep Learning. * Excellent problem ...
... product quantization is a plus. * Experience with embeddings, ANN/KNN, vector stores, database ... Foundational understanding of Natural Language Processing and Deep Learning. * Excellent problem ...
Senior ML Engineer
Chicago, IL · On-site +1
$107.60K - $147.80K/yr
Advanced Python and deep learning proficiency (PyTorch, HuggingFace Transformers, spaCy ... models via quantization, batching, and throughput tuning * Proficiency with inference ...
Quick apply
Senior ML Engineer
Chicago, IL · On-site +1
$107.60K - $147.80K/yr
Advanced Python and deep learning proficiency (PyTorch, HuggingFace Transformers, spaCy ... models via quantization, batching, and throughput tuning * Proficiency with inference ...
... Best Practices, Learning & Development, and key developer workflow evolutions, including AI ... quantization techniques, and more--and help shape our developer relations strategy accordingly.
... Best Practices, Learning & Development, and key developer workflow evolutions, including AI ... quantization techniques, and more--and help shape our developer relations strategy accordingly.
A singular technology platform powered by data and machine learning provides secure, differentiated ... Deep Hands-On AI/ML Expertise : Proven experience building applications with Large Language Models ...
A singular technology platform powered by data and machine learning provides secure, differentiated ... Deep Hands-On AI/ML Expertise : Proven experience building applications with Large Language Models ...
A singular technology platform powered by data and machine learning provides secure, differentiated ... Deep Hands-On AI/ML Expertise : Proven experience building applications with Large Language Models ...
A singular technology platform powered by data and machine learning provides secure, differentiated ... Deep Hands-On AI/ML Expertise : Proven experience building applications with Large Language Models ...
Deep Learning Quantization information
See Chicago, IL salary details
$22.5K is the 25th percentile. Wages below this are outliers.
$11.3K - $23.4K
27% of jobs
$23.4K - $35.5K
0% of jobs
$35.5K - $47.6K
0% of jobs
$47.6K - $59.7K
0% of jobs
$59.7K - $71.7K
0% of jobs
The median wage is $82.8K / yr.
$71.7K - $83.8K
25% of jobs
$83.8K - $95.9K
18% of jobs
$104.5K is the 75th percentile. Wages above this are outliers.
$95.9K - $108K
7% of jobs
$108K - $120.1K
2% of jobs
$120.1K - $132.1K
0% of jobs
$132.1K - $144.2K
21% of jobs
$11.3K
$86.4K
$144.2K
How much do deep learning quantization jobs pay per year?
What are the key skills and qualifications needed to thrive as a Deep Learning Quantization Engineer, and why are they important?
What are some common challenges faced when implementing deep learning quantization in production environments?
What is deep learning quantization?
What is the difference between Deep Learning Quantization vs Machine Learning Engineer?
| Aspect | Deep Learning Quantization | Machine Learning Engineer |
|---|---|---|
| Required Credentials | Advanced degrees in AI, Computer Science, or related fields; knowledge of neural networks | Bachelor's or Master's in CS, Data Science, or related fields; programming skills |
| Work Environment | Research labs, AI development teams, hardware optimization settings | Software development teams, data-driven projects, product-focused environments |
| Industry Usage | AI hardware optimization, model deployment, edge computing | Model development, data analysis, software solutions across industries |
Deep Learning Quantization focuses on reducing model size and improving inference speed through techniques like weight and activation quantization, often in hardware or embedded systems. Machine Learning Engineers develop, implement, and optimize machine learning models for various applications. While both roles require knowledge of AI and programming, Deep Learning Quantization is more specialized in model optimization techniques, whereas Machine Learning Engineers work broadly on model development and deployment.
Job description
LightSpeed Build Technologies is revolutionizing the construction industry through AI-powered robotics. As an AI & Machine Learning Engineer, you will design, build, and deploy intelligent systems for construction robots, focusing on machine learning models for computer vision, predictive analytics, and process optimization.
Responsibilities:
• Design, train, and deploy ML models for robotic control, quality prediction, and process optimization
• Develop reinforcement learning and imitation learning systems for robot task planning
• Build predictive maintenance models using sensor data to anticipate equipment failures
• Implement anomaly detection for real-time quality monitoring during automated assembly
• Optimize model inference for edge deployment on GPU-accelerated hardware in production
• Develop deep learning pipelines for object detection, segmentation, and pose estimation
• Build real-time vision systems for robotic guidance, workpiece tracking, and dimensional verification
• Implement 3D point cloud processing for construction material recognition
• Design and train models for visual quality inspection using depth cameras and industrial imaging
• Build ML data pipelines from sensor acquisition through model training and deployment
• Establish data labeling, versioning, and management workflows for training datasets
• Implement model monitoring, A/B testing, and continuous improvement in production
• Design experiment tracking and reproducibility infrastructure (MLflow, Weights & Biases)
• Integrate ML models with ROS2-based robot control for real-time inference
• Optimize models for NVIDIA Jetson, industrial PCs, and edge computing platforms
• Collaborate with robotics engineers on sensor selection, placement, and calibration
• Support scaling ML systems across multiple production cells and sites
Qualifications:
Required:
• 4+ years hands-on ML engineering building and deploying production models
• Deep proficiency with PyTorch or TensorFlow for model development and training
• Strong computer vision experience: object detection, segmentation, depth estimation, or 3D vision
• Understanding of reinforcement learning, imitation learning, or robot learning approaches
• Experience optimizing ML models for edge deployment (TensorRT, ONNX, quantization)
• Strong Python with experience in C++ for performance-critical components
• Experience with ML infrastructure: data pipelines, experiment tracking, model serving
• Proficiency with Linux, Docker, Git, and CI/CD workflows
• Understanding of real-time system constraints for ML inference in production
Preferred:
• MS or PhD in Machine Learning, Computer Science, Robotics, or related field
• Experience with robotics simulation: MuJoCo, IsaacSIM, or similar
• Background in manufacturing, industrial automation, or construction technology
• Experience with ROS/ROS2 integration for ML-powered robotics
• Published research or patents in computer vision, robot learning, or related ML
• Experience with NVIDIA ecosystem: CUDA, cuDNN, TensorRT, Jetson platforms
Company:
BUILDING TOMORROW'S HOMES, FASTER AND SMARTER The Lightspeed Integrated Walls, Floors and Roof Systems are built with advanced software and AI driven industrial robots, allowing us to seamlessly craft the walls, floors and roofs, integrating the framing, MEPs, insulation, and drywall in a single, efficient manufacturing line. Founded in , the company is headquartered in , , with a team of 11-50 employees. The company is currently Early Stage.