... quantization, pruning, TensorRT, ONNX export) • Collaborating with systems engineers to integrate ... deep learning framework • Have strong intuition for data quality; you can look at annotated ...
... quantization, pruning, TensorRT, ONNX export) • Collaborating with systems engineers to integrate ... deep learning framework • Have strong intuition for data quality; you can look at annotated ...
Are proficient in Python and PyTorch or a comparable deep learning framework * Have strong ... quantization) * Understand multi-object tracking and have implemented or worked with tracking ...
Are proficient in Python and PyTorch or a comparable deep learning framework * Have strong ... quantization) * Understand multi-object tracking and have implemented or worked with tracking ...
Are proficient in Python and PyTorch or a comparable deep learning framework * Have strong ... quantization) * Understand multi-object tracking and have implemented or worked with tracking ...
Are proficient in Python and PyTorch or a comparable deep learning framework * Have strong ... quantization) * Understand multi-object tracking and have implemented or worked with tracking ...
... quantization, on-device inference, and runtime safety. * Set engineering standards and design ... Deep expertise in modern deep learning and generative AI: agents, VLA models, multimodal perception ...
... quantization, on-device inference, and runtime safety. * Set engineering standards and design ... Deep expertise in modern deep learning and generative AI: agents, VLA models, multimodal perception ...
Director of AI Engineering & Research, Frontier Systems
Washington, DC · On-site
$335K - $444K/yr
... quantization, on-device inference, and runtime safety. * Set engineering standards and design ... Deep expertise in modern deep learning and generative AI: agents, VLA models, multimodal perception ...
Director of AI Engineering & Research, Frontier Systems
Washington, DC · On-site
$335K - $444K/yr
... quantization, on-device inference, and runtime safety. * Set engineering standards and design ... Deep expertise in modern deep learning and generative AI: agents, VLA models, multimodal perception ...
Director of AI Engineering & Research, Frontier Systems
Washington, DC · On-site
$335K - $444K/yr
... quantization, on-device inference, and runtime safety. * Set engineering standards and design ... Deep expertise in modern deep learning and generative AI: agents, VLA models, multimodal perception ...
Director of AI Engineering & Research, Frontier Systems
Washington, DC · On-site
$335K - $444K/yr
... quantization, on-device inference, and runtime safety. * Set engineering standards and design ... Deep expertise in modern deep learning and generative AI: agents, VLA models, multimodal perception ...
Director of AI Engineering & Research, Frontier Systems with Security Clearance
Washington, DC · On-site
$335K - $444K/yr
... quantization, on-device inference, and runtime safety. * Set engineering standards and design ... Deep expertise in modern deep learning and generative AI: agents, VLA models, multimodal perception ...
Director of AI Engineering & Research, Frontier Systems with Security Clearance
Washington, DC · On-site
$335K - $444K/yr
... quantization, on-device inference, and runtime safety. * Set engineering standards and design ... Deep expertise in modern deep learning and generative AI: agents, VLA models, multimodal perception ...
AI Solutions Architect
Washington, DC · Remote
Optimize model inference for production environments using quantization, pruning, and hardware ... Expertise in Python and deep learning frameworks (PyTorch, TensorFlow, Hugging Face). * Hands-on ...
AI Solutions Architect
Washington, DC · Remote
Optimize model inference for production environments using quantization, pruning, and hardware ... Expertise in Python and deep learning frameworks (PyTorch, TensorFlow, Hugging Face). * Hands-on ...
Design and train deep learning models for insect classification and morphological recognition ... Investigate quantization, pruning, and other model optimization techniques to ensure efficient ...
Design and train deep learning models for insect classification and morphological recognition ... Investigate quantization, pruning, and other model optimization techniques to ensure efficient ...
Machine Learning Engineer
Washington, DC · On-site +1
You dive deep. It's important for you to really know how things work. You're always building ... Experience with model compression techniques (quantization, pruning, distillation) * Contributions ...
Machine Learning Engineer
Washington, DC · On-site +1
You dive deep. It's important for you to really know how things work. You're always building ... Experience with model compression techniques (quantization, pruning, distillation) * Contributions ...
URGENT NEED - AI/ML Subject Matter Expert (SME) ___________Baltimore, MD - ONSITE
Baltimore, MD · On-site
Cost Awareness: * Demonstrate cost control strategies such as model quantization, optimized ... Strong background in AI/ML and deep understanding of machine learning algorithms and techniques.
URGENT NEED - AI/ML Subject Matter Expert (SME) ___________Baltimore, MD - ONSITE
Baltimore, MD · On-site
Cost Awareness: * Demonstrate cost control strategies such as model quantization, optimized ... Strong background in AI/ML and deep understanding of machine learning algorithms and techniques.
Data Scientist Level 4 with Security Clearance
Fort George G Meade, MD · On-site
$220K - $235K/yr
Deep understanding of machine learning architectures, model selection, training, and optimization ... Strong background in AI/ML performance optimization, including model compression, quantization, or ...
Data Scientist Level 4 with Security Clearance
Fort George G Meade, MD · On-site
$220K - $235K/yr
Deep understanding of machine learning architectures, model selection, training, and optimization ... Strong background in AI/ML performance optimization, including model compression, quantization, or ...
Data Scientist Level 4
Fort George G Meade, MD · On-site
$220K - $235K/yr
Deep understanding of machine learning architectures, model selection, training, and optimization ... Strong background in AI/ML performance optimization, including model compression, quantization, or ...
Data Scientist Level 4
Fort George G Meade, MD · On-site
$220K - $235K/yr
Deep understanding of machine learning architectures, model selection, training, and optimization ... Strong background in AI/ML performance optimization, including model compression, quantization, or ...
Data Scientist Level 4
Fort George G Meade, MD · On-site
$220K - $235K/yr
Deep understanding of machine learning architectures, model selection, training, and optimization ... Strong background in AI/ML performance optimization, including model compression, quantization, or ...
Data Scientist Level 4
Fort George G Meade, MD · On-site
$220K - $235K/yr
Deep understanding of machine learning architectures, model selection, training, and optimization ... Strong background in AI/ML performance optimization, including model compression, quantization, or ...
Data Scientist Level 4
Fort George G Meade, MD · On-site
$220K - $235K/yr
Deep understanding of machine learning architectures, model selection, training, and optimization ... Strong background in AI/ML performance optimization, including model compression, quantization, or ...
Data Scientist Level 4
Fort George G Meade, MD · On-site
$220K - $235K/yr
Deep understanding of machine learning architectures, model selection, training, and optimization ... Strong background in AI/ML performance optimization, including model compression, quantization, or ...
Principal Engineer - AI Platform & Operations
Washington, DC · On-site
$168K - $230K/yr
... quantization, batching, and latency) at scale. * Enhance Observability: Design sophisticated ... Expert-level knowledge of model serving (e.g., Triton, vLLM, Ray Serve) and deep experience with ...
Quick apply
Principal Engineer - AI Platform & Operations
Washington, DC · On-site
$168K - $230K/yr
... quantization, batching, and latency) at scale. * Enhance Observability: Design sophisticated ... Expert-level knowledge of model serving (e.g., Triton, vLLM, Ray Serve) and deep experience with ...
Principal Engineer - AI Platform & Operations
Washington, DC · On-site
$168K - $230K/yr
... quantization, batching, and latency) at scale. * Enhance Observability: Design sophisticated ... Expert-level knowledge of model serving (e.g., Triton, vLLM, Ray Serve) and deep experience with ...
Principal Engineer - AI Platform & Operations
Washington, DC · On-site
$168K - $230K/yr
... quantization, batching, and latency) at scale. * Enhance Observability: Design sophisticated ... Expert-level knowledge of model serving (e.g., Triton, vLLM, Ray Serve) and deep experience with ...
... learning --capabilities that are central to Shield AI's strategic direction. This is a high-impact ... Define requirements for distillation, quantization, and inference tooling as part of the "three ...
New
Quick apply
... learning --capabilities that are central to Shield AI's strategic direction. This is a high-impact ... Define requirements for distillation, quantization, and inference tooling as part of the "three ...
New
Deep Learning Quantization information
See Severn, MD salary details
$24.3K is the 25th percentile. Wages below this are outliers.
$12.2K - $25.3K
27% of jobs
$25.3K - $38.3K
0% of jobs
$38.3K - $51.3K
0% of jobs
$51.3K - $64.4K
0% of jobs
$64.4K - $77.4K
0% of jobs
The median wage is $89.4K / yr.
$77.4K - $90.5K
25% of jobs
$90.5K - $103.5K
18% of jobs
$112.8K is the 75th percentile. Wages above this are outliers.
$103.5K - $116.5K
7% of jobs
$116.5K - $129.6K
2% of jobs
$129.6K - $142.6K
0% of jobs
$142.6K - $155.6K
21% of jobs
$12.2K
$93.3K
$155.6K
How much do deep learning quantization jobs pay per year?
What are the key skills and qualifications needed to thrive as a Deep Learning Quantization Engineer, and why are they important?
What is the difference between Deep Learning Quantization vs Machine Learning Engineer?
| Aspect | Deep Learning Quantization | Machine Learning Engineer |
|---|---|---|
| Required Credentials | Advanced degrees in AI, Computer Science, or related fields; knowledge of neural networks | Bachelor's or Master's in CS, Data Science, or related fields; programming skills |
| Work Environment | Research labs, AI development teams, hardware optimization settings | Software development teams, data-driven projects, product-focused environments |
| Industry Usage | AI hardware optimization, model deployment, edge computing | Model development, data analysis, software solutions across industries |
Deep Learning Quantization focuses on reducing model size and improving inference speed through techniques like weight and activation quantization, often in hardware or embedded systems. Machine Learning Engineers develop, implement, and optimize machine learning models for various applications. While both roles require knowledge of AI and programming, Deep Learning Quantization is more specialized in model optimization techniques, whereas Machine Learning Engineers work broadly on model development and deployment.
What is deep learning quantization?
What are some common challenges faced when implementing deep learning quantization in production environments?
Full-time
Posted 9 days ago
Job description
Helsing develops artificial intelligence-enabled capabilities to protect and defend democracies. The Machine Learning Engineer will own the detection and tracking models powering Helsing's products, managing the full model lifecycle from data assessment to deployment on edge platforms.
Responsibilities:
• Training and fine-tuning detection models (YOLO, DETR, Faster R-CNN, and similar architectures) on mission-specific datasets
• Implementing and improving multi-object tracking pipelines (SORT, DeepSORT, ByteTrack, or similar)
• Evaluating model performance: analyzing metrics, diagnosing failure modes, and iterating on data and model improvements
• Managing the data pipeline end-to-end: assessing raw data, coordinating annotation, curating datasets, and implementing augmentation strategies
• Optimizing models for deployment on SWaP-constrained and embedded platforms (quantization, pruning, TensorRT, ONNX export)
• Collaborating with systems engineers to integrate models into the broader Altra platform
• Working across sensor modalities as needed, including electro-optical, infrared, and other imaging sources
Qualifications:
Required:
• Have 5+ years of experience in applied machine learning or computer vision
• Have a Bachelor's degree in Computer Science, Electrical Engineering, or a related field; Master's or PhD strongly preferred
• Have production experience training and deploying object detection models — not just research or academic projects
• Are proficient in Python and PyTorch or a comparable deep learning framework
• Have strong intuition for data quality; you can look at annotated datasets, training curves, and evaluation metrics and know what's wrong
• Have experience with the full model training lifecycle: data curation, annotation management, training, evaluation, and deployment
• Have experience optimizing models for deployment on SWaP-constrained and edge platforms (TensorRT, ONNX, quantization)
• Understand multi-object tracking and have implemented or worked with tracking algorithms in practice
• Can read and contextualize scientific papers in computer vision and apply findings to production systems
• Are a U.S. citizen with an active security clearance or the ability to obtain one
Preferred:
• Strong proficiency in Rust or C++ for production model deployment and optimization
• Experience with multiple sensor modalities — particularly infrared or thermal imaging
• Familiarity with MLOps tooling: experiment tracking (MLflow, Weights & Biases), dataset versioning, model registries
• Experience with annotation tools and workflows (CVAT, Label Studio, or similar)
• Background in computer vision beyond detection — segmentation, pose estimation, activity recognition
• Experience with simulators, emulators, or synthetic data generation for training and evaluation
• Experience deploying models on GPU-accelerated embedded platforms (NVIDIA Jetson, similar)
• Background in defense, intelligence, or other mission-critical environments
Company:
Helsing develops AI-powered defense tech, focusing on drones and software, to enhance military capabilities for democratic nations. Founded in 2021, the company is headquartered in Munich, DEU, with a team of 501-1000 employees. The company is currently Late Stage.