Deep Learning Quantization Jobs in Reston, VA (NOW HIRING)

Senior Wireless Machine Learning Engineer, AI-RAN

$120.30K - $165.10K/yr

Applied AI Research : Design and train modern deep learning models (Transformers, Vision ... Experience with model quantization (FP16/INT8), pruning, or using TensorRT for real-time inference

DeepSig, Inc

Senior Wireless Machine Learning Engineer, AI-RAN

Arlington, VA · On-site

$120.30K - $165.10K/yr

Applied AI Research : Design and train modern deep learning models (Transformers, Vision ... Experience with model quantization (FP16/INT8), pruning, or using TensorRT for real-time inference

DeepSig Inc

Senior Wireless Machine Learning Engineer, AI-RAN

Arlington, VA · On-site +1

Applied AI Research : Design and train modern deep learning models (Transformers, Vision ... Experience with model quantization (FP16/INT8), pruning, or using TensorRT for real-time inference

DeepSig Inc

Senior Wireless Machine Learning Engineer, AI-RAN

Arlington, VA · On-site +1

Applied AI Research : Design and train modern deep learning models (Transformers, Vision ... Experience with model quantization (FP16/INT8), pruning, or using TensorRT for real-time inference

DeepSig Inc

Senior Wireless Machine Learning Engineer, AI-RAN

Arlington, VA · On-site +1

$120.40K - $165.40K/yr

Applied AI Research : Design and train modern deep learning models (Transformers, Vision ... Experience with model quantization (FP16/INT8), pruning, or using TensorRT for real-time inference

DeepSig Inc

Senior Wireless Machine Learning Engineer, AI-RAN

Arlington, VA · On-site +1

$120.40K - $165.40K/yr

Applied AI Research : Design and train modern deep learning models (Transformers, Vision ... Experience with model quantization (FP16/INT8), pruning, or using TensorRT for real-time inference

DeepSig Inc Careers Page

Senior Wireless Machine Learning Engineer, AI-RAN

Arlington, VA · On-site

Applied AI Research : Design and train modern deep learning models (Transformers, Vision ... Experience with model quantization (FP16/INT8), pruning, or using TensorRT for real-time inference

Quick apply

DeepSig Inc Careers Page

Senior Wireless Machine Learning Engineer, AI-RAN

Arlington, VA · On-site

Applied AI Research : Design and train modern deep learning models (Transformers, Vision ... Experience with model quantization (FP16/INT8), pruning, or using TensorRT for real-time inference

Quartermaster AI Inc

Senior RF Machine Learning Engineer

Arlington, VA · On-site

Proficiency in Python and deep learning frameworks; familiarity with RF-native tooling such as ... Familiarity with edge inference constraints and optimization techniques (quantization, pruning ...

Quartermaster AI Inc

Senior RF Machine Learning Engineer

Arlington, VA · On-site

Quartermaster AI Inc

Senior RF Machine Learning Engineer

Arlington, VA · On-site

Quartermaster AI Inc

Senior RF Machine Learning Engineer

Arlington, VA · On-site

thechronicle

Intern, Information Tech

Washington, DC · On-site

$17 - $22.75/hr

Performance Optimization: Implement techniques such as quantization (INT8/FP8), KV cache ... Familiarity with deep learning libraries like PyTorch, TensorFlow, or JAX. Hands-on experience or ...

thechronicle

Intern, Information Tech

Washington, DC · On-site

$17 - $22.75/hr

The Chronicle of Higher Education, Inc.

Intern, Information Tech

Washington, DC · On-site

$21.34/hr

The Chronicle of Higher Education, Inc.

Intern, Information Tech

Washington, DC · On-site

$21.34/hr

Cloudera

AI Solutions Architect

Washington, DC · Remote

Optimize model inference for production environments using quantization, pruning, and hardware ... Expertise in Python and deep learning frameworks (PyTorch, TensorFlow, Hugging Face). * Hands-on ...

Cloudera

AI Solutions Architect

Washington, DC · Remote

Cloudera

AI Solutions Architect

Washington, DC · Remote

Cloudera

AI Solutions Architect

Washington, DC · Remote

Twenty Inc.

Applied AI Engineer

Washington, DC · On-site

You have strong Python skills and deep learning experience with PyTorch, TensorFlow, or JAX. * You ... You have implemented quantization or other optimization techniques to improve inference efficiency.

Twenty Inc.

Applied AI Engineer

Washington, DC · On-site

Twenty Technologies

Applied AI Engineer

Washington, DC · On-site

Twenty Technologies

Applied AI Engineer

Washington, DC · On-site

EnCharge AI

AI Compiler Engineer

Washington, DC · On-site

Implement parsing, semantic analysis, and IR generation for deep learning frameworks. * Research ... quantization, and tiling. * Familiarity with neural networks operators and code generation.

EnCharge AI

AI Compiler Engineer

Washington, DC · On-site

Spear AI

Machine Learning Engineer

Washington, DC · On-site +1

You dive deep. It's important for you to really know how things work. You're always building ... Experience with model compression techniques (quantization, pruning, distillation) * Contributions ...

Spear AI

Machine Learning Engineer

Washington, DC · On-site +1

Spear AI

Machine Learning Engineer

Washington, DC · On-site +1

Spear AI

Machine Learning Engineer

Washington, DC · On-site +1

Serko Ltd

Principal Engineer - AI Platform & Operations

Washington, DC · On-site

$168K - $230K/yr

... quantization, batching, and latency) at scale. * Enhance Observability: Design sophisticated ... Expert-level knowledge of model serving (e.g., Triton, vLLM, Ray Serve) and deep experience with ...

Serko Ltd

Principal Engineer - AI Platform & Operations

Washington, DC · On-site

$168K - $230K/yr

Serko Ltd

Principal Engineer - AI Platform & Operations

Washington, DC · On-site

$168K - $230K/yr

Quick apply

Serko Ltd

Principal Engineer - AI Platform & Operations

Washington, DC · On-site

$168K - $230K/yr

Navstar

Artificial Intelligence Integration Engineer

Arlington, VA · On-site

$119.80K - $161.30K/yr

Deploy and manage machine learning models in production using tools like MLflow, Kubeflow, or AWS ... Optimization: Optimize models for production (e.g., via quantization or pruning) and ensure ...

New

Navstar

Artificial Intelligence Integration Engineer

Arlington, VA · On-site

$119.80K - $161.30K/yr

New

Sev1Tech

MLOps Engineer with Security Clearance

Arlington, VA

Sev1Tech

MLOps Engineer with Security Clearance

Arlington, VA

Entarian

MLOps Engineer

Arlington, VA · On-site

Deploy and manage machine learning models in production using tools like MLflow, Kubeflow, or AWS ... Optimize models for production (e.g., via quantization or pruning) and ensure efficient resource ...

Entarian

MLOps Engineer

Arlington, VA · On-site

Showing results 1-20

People also search for

Ai Mod

Deep Learning Quantization Jobs in Reston, VA

Deep Learning Quantization information

See Reston, VA salary details

$11.4K

$87.3K

$145.6K

How much do deep learning quantization jobs pay per year?

As of May 28, 2026, the average yearly pay for deep learning quantization in Reston, VA is $87,271.00, according to ZipRecruiter salary data. Most workers in this role earn between $74,900.00 and $144,600.00 per year, depending on experience, location, and employer.

What are the key skills and qualifications needed to thrive as a Deep Learning Quantization Engineer, and why are they important?

To excel as a Deep Learning Quantization Engineer, you need a strong background in machine learning, applied mathematics, and computer science, usually supported by an advanced degree in a related field. Familiarity with deep learning frameworks (such as TensorFlow or PyTorch), quantization toolkits, and hardware acceleration platforms is crucial. Analytical thinking, problem-solving, and clear technical communication are standout soft skills in this role. These abilities are essential for efficiently optimizing models for deployment on resource-constrained hardware while maintaining accuracy and performance.

What are some common challenges faced when implementing deep learning quantization in production environments?

One of the main challenges in implementing deep learning quantization is balancing model accuracy with computational efficiency, as quantization can sometimes lead to a drop in model performance. Additionally, ensuring hardware compatibility and optimizing for different devices (such as CPUs, GPUs, or edge devices) can require extensive testing and tuning. Collaboration with data scientists, software engineers, and hardware specialists is often essential to successfully deploy quantized models at scale. Staying updated with the latest quantization techniques and frameworks is also important for overcoming these challenges.

What is deep learning quantization?

Deep learning quantization is the process of reducing the precision of the numbers used to represent a neural network's parameters, activations, or both. By converting the typically used 32-bit floating-point values to lower bit-width formats such as 16-bit or 8-bit integers, quantization significantly reduces the memory footprint and computational requirements of deep learning models. This technique helps deploy models efficiently on edge devices and mobile hardware while maintaining acceptable accuracy levels. Quantization is widely used in model optimization for faster inference and lower power consumption.

What is the difference between Deep Learning Quantization vs Machine Learning Engineer?

Aspect	Deep Learning Quantization	Machine Learning Engineer
Required Credentials	Advanced degrees in AI, Computer Science, or related fields; knowledge of neural networks	Bachelor's or Master's in CS, Data Science, or related fields; programming skills
Work Environment	Research labs, AI development teams, hardware optimization settings	Software development teams, data-driven projects, product-focused environments
Industry Usage	AI hardware optimization, model deployment, edge computing	Model development, data analysis, software solutions across industries

Deep Learning Quantization focuses on reducing model size and improving inference speed through techniques like weight and activation quantization, often in hardware or embedded systems. Machine Learning Engineers develop, implement, and optimize machine learning models for various applications. While both roles require knowledge of AI and programming, Deep Learning Quantization is more specialized in model optimization techniques, whereas Machine Learning Engineers work broadly on model development and deployment.

What job categories do people searching Deep Learning Quantization jobs in Reston, VA look for? The top searched job categories for Deep Learning Quantization jobs in Reston, VA are:

What cities near Reston, VA are hiring for Deep Learning Quantization jobs? Cities near Reston, VA with the most Deep Learning Quantization job openings:

Deep Learning Quantization jobs near you

Senior Wireless Machine Learning Engineer, AI-RAN

DeepSig, Inc

Arlington, VA • On-site

Apply

$120.30K - $165.10K/yr

Full-time

Posted 11 days ago

Job description

Job Type
Full-time
Description
Type: Full-Time(W2) On-site/Hybrid, Arlington, VA (Remote option available for the right candidate)
DeepSig is defining the future of wireless communications by merging deep learning with the Radio Access Network (RAN). We are seeking an experienced Technical Lead to architect and drive the development of our next-generation AI-native RAN.
In this role, you will design, prototype, and validate novel AI/ML components-such as neural receivers, neural beamforming, neural scheduling, digital twin, and ISAC (Integrated Sensing and Communications)-that outperform traditional signal processing methods. You will work at the cutting edge of 6G innovation, taking concepts from mathematical intuition to simulation (e.g. NVIDIA Sionna) and real-time implementation.
What You'll be Doing

Applied AI Research: Design and train modern deep learning models (Transformers, Vision architectures, etc.) to solve complex physical layer problems, including channel estimation, MIMO detection, and beam management
Simulation & Validation: Build high-fidelity link-level simulations using NVIDIA Sionna and ray-tracing to train, test, and benchmark AI models against legacy 5G baselines
Prototyping & Deployment: Transition research models into deployable "dApps" for the Distributed Unit (DU), optimizing inference for latency and compute efficiency on NVIDIA GPUs
New Capabilities: Explore emerging AI-RAN frontiers such as Integrated Sensing and Communications (ISAC), neural scheduling, and channel digital twins
Innovation & IPR: Drive technical innovation by authoring invention disclosures, filing patents, and generating technical reports to support our standardization team in 3GPP and O-RAN Alliance contributions
Data Engineering: Architect data pipelines for generating synthetic training datasets and developing "Sim-to-Real" transfer techniques to ensure robust performance in real-world networks

Required Qualifications

Education: Ph.D. or Master's in Computer Science, Electrical Engineering, or Applied Mathematics with a focus on Deep Learning and/or Communications Systems
AI/ML Expertise: 3+ years of experience designing and training deep neural networks from scratch. Strong grasp of modern architectures and optimization techniques
Applied Signal Processing: Experience applying machine learning to real-time time-series data, signal processing, or physics-based problems (Audio, RF, or similar domains)
Research to Code: Proven ability to read academic papers and implement their methods in robust Python code
Simulation Skills: Experience with differentiable simulation or digital twins (e.g., Sionna, JAX-based physics sims)

Preferred Qualifications

Wireless Knowledge: Understanding of wireless fundamentals (OFDM, MIMO, IQ data) is highly helpful, though we prioritize strong ML intuition over pure communication theory
Performance Optimization: Experience with model quantization (FP16/INT8), pruning, or using TensorRT for real-time inference
Standardization Support: Experience writing technical whitepapers or supporting patent filings in a research environment
C++ Integration: Ability to write C++ bindings or integrate Python models into C++, SIMD, and Cuda production pipelines

Working at DeepSig
DeepSig is growing its technical team while cultivating a collaborative, agile, and fun small-team culture. We value creativity, knowledge sharing, and employee growth, and we encourage participation in scientific publications, conferences, and open-source software. We offer competitive salaries and benefits, an employee stock option grant program, an environment where we are excited to be transforming and disrupting how signal processing is done with AI/ML, a welcoming and inclusive environment, a flexible schedule, and a great work / life balance.
DeepSig is an equal-opportunity employer and does not discriminate based on race, ethnicity, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability. We are dedicated to cultivating an inclusive, diverse, and engaging workplace where individuals feel fulfilled, inspired, and motivated. We value the unique perspectives that our team brings.

About DeepSig

Sourced by ZipRecruiter

Company size

11 - 50 Employees

Headquarters location

Arlington, VA, US

Year founded

2016

Website

deepsig.ai

Social media

View All DeepSig Jobs

Apply

Deep Learning Quantization Jobs in Reston, VA (NOW HIRING)

Senior Wireless Machine Learning Engineer, AI-RAN

Senior Wireless Machine Learning Engineer, AI-RAN

Senior Wireless Machine Learning Engineer, AI-RAN

Senior Wireless Machine Learning Engineer, AI-RAN

Senior Wireless Machine Learning Engineer, AI-RAN

Senior Wireless Machine Learning Engineer, AI-RAN

Senior Wireless Machine Learning Engineer, AI-RAN

Senior Wireless Machine Learning Engineer, AI-RAN

Senior RF Machine Learning Engineer

Senior RF Machine Learning Engineer

Senior RF Machine Learning Engineer

Senior RF Machine Learning Engineer

Intern, Information Tech

Intern, Information Tech

Intern, Information Tech

Intern, Information Tech

AI Solutions Architect

AI Solutions Architect

AI Solutions Architect

AI Solutions Architect

Applied AI Engineer

Applied AI Engineer

Applied AI Engineer

Applied AI Engineer

AI Compiler Engineer

AI Compiler Engineer

Machine Learning Engineer

Machine Learning Engineer

Machine Learning Engineer

Machine Learning Engineer

Principal Engineer - AI Platform & Operations

Principal Engineer - AI Platform & Operations

Principal Engineer - AI Platform & Operations

Principal Engineer - AI Platform & Operations

Artificial Intelligence Integration Engineer

Artificial Intelligence Integration Engineer

MLOps Engineer with Security Clearance

MLOps Engineer with Security Clearance

MLOps Engineer

MLOps Engineer

People also search for

Deep Learning Quantization information

See Reston, VA salary details

How much do deep learning quantization jobs pay per year?

What are the key skills and qualifications needed to thrive as a Deep Learning Quantization Engineer, and why are they important?

What are some common challenges faced when implementing deep learning quantization in production environments?

What is deep learning quantization?

What is the difference between Deep Learning Quantization vs Machine Learning Engineer?

Senior Wireless Machine Learning Engineer, AI-RAN

Share this job

Job description

About DeepSig

Company size

Headquarters location

Year founded

Website

Social media

Share this job