Deep Learning Quantization Jobs in Massachusetts

Staff Research Scientist, AI-Hardware Co-Design

We fuse state-of-the-art machine learning with deep domain expertise to convert raw physical data ... quantization (INT8, mixed-precision) to maximize arithmetic intensity within strict edge power ...

Analogdevices

Staff Research Scientist, AI-Hardware Co-Design

Boston, MA

Analog Devices

Staff Research Scientist, AI-Hardware Co-Design

Boston, MA · On-site

Analog Devices

Staff Research Scientist, AI-Hardware Co-Design

Boston, MA · On-site

SimpliSafe

Staff CV Applied Research Engineer, Edge AI

Boston, MA · On-site

Quantization (PTQ/QAT), pruning, mixed precision, and operator-aware optimization * Translate ... Strong computer vision background with deep learning expertise across detection/classification ...

SimpliSafe

Staff CV Applied Research Engineer, Edge AI

Boston, MA · On-site

SimpliSafe

Staff CV Applied Research Engineer, Edge AI

Boston, MA

Quick apply

SimpliSafe

Staff CV Applied Research Engineer, Edge AI

Boston, MA

SimpliSafe

Staff CV Applied Research Engineer, Edge AI

Boston, MA · On-site

SimpliSafe

Staff CV Applied Research Engineer, Edge AI

Boston, MA · On-site

Redhat

Senior Manager, AI Inference

Boston, MA · On-site +1

$210K - $358K/yr

... quantization and sparsification, our team provides a stable platform for enterprises to build ... deep learning in the open source way, this is the role for you. Join us in shaping the future of AI ...

Redhat

Senior Manager, AI Inference

Boston, MA · On-site +1

$210K - $358K/yr

Zoox

Senior Software Engineer - C++ GPU Performance

Boston, MA

$217K - $307K/yr

The scope for GPU usage ranges from traditional computer vision and deep learning architectures to ... Hands-on work with ML model optimization (post-training quantization, layer pruning, etc) or hand ...

Zoox

Senior Software Engineer - C++ GPU Performance

Boston, MA

$217K - $307K/yr

Zoox

Software Engineer - C++ GPU Performance

Boston, MA

$168K - $239K/yr

Zoox

Software Engineer - C++ GPU Performance

Boston, MA

$168K - $239K/yr

Zoox

Software Engineer - C++ GPU Performance

Boston, MA · On-site

$168K - $239K/yr

Zoox

Software Engineer - C++ GPU Performance

Boston, MA · On-site

$168K - $239K/yr

Zoox

Software Engineer - C++ GPU Performance

Boston, MA

$152K/yr

Quick apply

Zoox

Software Engineer - C++ GPU Performance

Boston, MA

$152K/yr

Zoox

Senior Software Engineer - C++ GPU Performance

Boston, MA

$152K/yr

Quick apply

Zoox

Senior Software Engineer - C++ GPU Performance

Boston, MA

$152K/yr

Analog Devices

Edge Compute Architect for Optical Systems

Boston, MA · On-site

$264K/yr

The Deep Tech Innovation group within the Analog Garage brings together systems architecture ... quantization strategies * Own real-time performance budgeting, including throughput and latency ...

Analog Devices

Edge Compute Architect for Optical Systems

Boston, MA · On-site

$264K/yr

Analogdevices

Edge Compute Architect for Optical Systems

Boston, MA

$264K/yr

Analogdevices

Edge Compute Architect for Optical Systems

Boston, MA

$264K/yr

Qualcomm

GPU HW Research Engineer (San Diego/Boxborough)

Boxborough, MA · On-site

$161K - $242K/yr

... AI) , Machine Learning (ML) , and General-Purpose GPU (GPGPU) computing. This is a unique ... Deep understanding of quantization techniques and data types for large language models (LLMs) and ...

Qualcomm

GPU HW Research Engineer (San Diego/Boxborough)

Boxborough, MA · On-site

$161K - $242K/yr

Showing results 1-20

Deep Learning Quantization Jobs in Massachusetts

Deep Learning Quantization information

What are the key skills and qualifications needed to thrive as a Deep Learning Quantization Engineer, and why are they important?

To excel as a Deep Learning Quantization Engineer, you need a strong background in machine learning, applied mathematics, and computer science, usually supported by an advanced degree in a related field. Familiarity with deep learning frameworks (such as TensorFlow or PyTorch), quantization toolkits, and hardware acceleration platforms is crucial. Analytical thinking, problem-solving, and clear technical communication are standout soft skills in this role. These abilities are essential for efficiently optimizing models for deployment on resource-constrained hardware while maintaining accuracy and performance.

What is the difference between Deep Learning Quantization vs Machine Learning Engineer?

Aspect	Deep Learning Quantization	Machine Learning Engineer
Required Credentials	Advanced degrees in AI, Computer Science, or related fields; knowledge of neural networks	Bachelor's or Master's in CS, Data Science, or related fields; programming skills
Work Environment	Research labs, AI development teams, hardware optimization settings	Software development teams, data-driven projects, product-focused environments
Industry Usage	AI hardware optimization, model deployment, edge computing	Model development, data analysis, software solutions across industries

Deep Learning Quantization focuses on reducing model size and improving inference speed through techniques like weight and activation quantization, often in hardware or embedded systems. Machine Learning Engineers develop, implement, and optimize machine learning models for various applications. While both roles require knowledge of AI and programming, Deep Learning Quantization is more specialized in model optimization techniques, whereas Machine Learning Engineers work broadly on model development and deployment.

What is deep learning quantization?

Deep learning quantization is the process of reducing the precision of the numbers used to represent a neural network's parameters, activations, or both. By converting the typically used 32-bit floating-point values to lower bit-width formats such as 16-bit or 8-bit integers, quantization significantly reduces the memory footprint and computational requirements of deep learning models. This technique helps deploy models efficiently on edge devices and mobile hardware while maintaining acceptable accuracy levels. Quantization is widely used in model optimization for faster inference and lower power consumption.

What are some common challenges faced when implementing deep learning quantization in production environments?

One of the main challenges in implementing deep learning quantization is balancing model accuracy with computational efficiency, as quantization can sometimes lead to a drop in model performance. Additionally, ensuring hardware compatibility and optimizing for different devices (such as CPUs, GPUs, or edge devices) can require extensive testing and tuning. Collaboration with data scientists, software engineers, and hardware specialists is often essential to successfully deploy quantized models at scale. Staying updated with the latest quantization techniques and frameworks is also important for overcoming these challenges.

What are popular job titles related to Deep Learning Quantization jobs in Massachusetts? For Deep Learning Quantization jobs in Massachusetts, the most frequently searched job titles are:

What job categories do people searching Deep Learning Quantization jobs in Massachusetts look for? The top searched job categories for Deep Learning Quantization jobs in Massachusetts are:

What cities in Massachusetts are hiring for Deep Learning Quantization jobs? Cities in Massachusetts with the most Deep Learning Quantization job openings:

Boston

Deep Learning Quantization jobs near you

Staff Research Scientist, AI-Hardware Co-Design

Analogdevices

Boston, MA

Apply

Full-time

Medical, Dental, Vision, Retirement, PTO

Posted yesterday

Job description

About Analog Devices

Analog Devices, Inc. (NASDAQ: ADI) is a global semiconductor leader that bridges the physical and digital worlds to enable breakthroughs at the Intelligent Edge. ADI combines analog, digital, AI, and software technologies into solutions that combat climate change, reliably connect humans and the world, and help drive advancements in automation and robotics, mobility, healthcare, energy and data centers. With revenue of more than $11 billion in FY25, ADI ensures today's innovators stay Ahead of What's Possible. Learn more at www.analog.comand on LinkedIn and X.

The Analog Garage is ADI's Innovation Lab, located in the heart of downtown Boston. We pioneer breakthrough technologies to solve high-impact problems that drive tangible value. Bringing together engineers, research scientists, and business leaders, we develop new technologies and solutions in a fast-moving, experiment-focused startup atmosphere.

The Role

The Algorithmic Solutions Group develops cutting-edge, efficient algorithms to bring intelligence to the physical world. We fuse state-of-the-art machine learning with deep domain expertise to convert raw physical data into actionable insights, solving the hard problems where off-the-shelf solutions fall short.

We are seeking aStaff Research Scientist, AI-Hardware Co-Design to architect the solutions that power the next generation of intelligent systems. Through rigorous analysis and proof-of-concepts, you will bridge the gap between advanced AI algorithms and hardware implementation to define the optimal compute strategy. Operating at the boundary of software and silicon, you will drive the co-design of algorithms and architecture, defining the computational foundation to enable physical intelligence.

Key Responsibilities

Strategic Problem Definition:Collaborate with business leads and domain experts to identify opportunities whereintelligent systems-integrating sensing, actuation, and tightly coupled algorithms-can solve previously impossible problems. You will focus on challenges that requireholistic system innovationrather than just off-the-shelf components.
Research-to-Product:Lead technical execution fromarchitectural designto validated proof-of-concept. You will partner with researchers and hardware engineers to bridge the gap betweenabstract research ideasanddeployable solutions.
Feasibility analysis:Act as the "Physics of Compute" anchor for the research team. You will use high-fidelity simulation, modeling, and proof-of-concepts to quantify the impact of memory hierarchy, dataflow, and precision on system performance-distinguishing between viable product paths and impractical research concepts.
System-Level Architecture & Co-Design:Drive the simultaneous optimization of algorithms and hardware. You will treat the algorithm and the compute engine as a unified design space, adapting neural architectures to exploit specific hardware capabilities while selecting the optimal compute substrate-from ultra-low-power MCUs to custom accelerators-to meet strict power and area constraints.
Thought Leadership:Maintain a deep awareness of the evolving AI hardware and algorithm landscape. You will bring the best ideas from the academic and industrial communities into ADI, mentoring junior engineers and guiding the team toward state-of-the-art compute paradigms.

The Ideal Candidate

You are aComputer Architectwith a deep appreciation for AI, or anAI Researcherwith a deep understanding of silicon. You think in terms of data movement, memory bandwidth, and energy-per-operation. You understand that hardware constraints shape algorithmic innovation, just as algorithmic needs must dictate architectural choices.

Educational & Professional Background:You hold a PhD specialized in Computer Architecture or Integrated Circuit Design for AI workloads, and have 3+ years of industry experience applying architectural principles to real-world engineering constraints.

Deep System-Level Hardware Expertise:You possess a profound understanding of how to organize computation and data. You have demonstrated this by either leading the design of complex AI SoCs, or by validating novel architectures through rigorous,cycle-accurate simulation. You operate at thestructural levelof the machine, optimizing memory hierarchies, on-chip networks, and execution models to solve complex data movement and efficiency challenges.

Edge AI & Model Optimization:You possess deep expertise inhardware-aware deep learning. You are proficient inmodern frameworks (PyTorch, JAX)and capable oftraining or fine-tuning modelsto validate architectural hypotheses. You go beyond standard backbones to master theoptimal mapping of computational graphs to silicon, orchestrating dataflow, tiling, and quantization (INT8, mixed-precision) to maximize arithmetic intensity within strict edge power budgets.
Innovation Mindset:You navigate the ambiguity of early-stage innovation with creative persistence, translating open challenges into concrete technical roadmaps. You excel at decision-making under uncertainty, justifying how your architectural trade-offs directly address the problem and create value.

You distinguish yourself with:

Proven Silicon Execution:You have successfully taped out a complex SoC or a custom AI accelerator. You understand the harsh reality of physical design-from timing closure to power delivery-and how these downstream constraints influence early-stage architectural decisions.
DSP & Signal Fluency:You are comfortable discussingFourier transforms, noise floors, and sampling rates. You understand the intersection ofclassical Digital Signal Processing (DSP)and deep learning, capable of architecting systems where neural networks and traditional signal chains work in concert to extract information from noisy physical data.
Hands-on RTL Experience:Familiarity with Verilog/SystemVerilog or modern hardware construction languages (Chisel, PyMTL) is a strong plus, even if youwill not be writingproduction RTL daily.
Strong publication record in top conferences and/or journals

For positions requiring access to technical data, Analog Devices, Inc. may have to obtain export licensing approval from the U.S. Department of Commerce - Bureau of Industry and Security and/or the U.S. Department of State - Directorate of Defense Trade Controls. As such, applicants for this position - except US Citizens, US Permanent Residents, and protected individuals as defined by 8 U.S.C. 1324b(a)(3) - may have to go through an export licensing review process.

Analog Devices is an equal opportunity employer. We foster a culture where everyone has an opportunity to succeed regardless of their race, color, religion, age, ancestry, national origin, social or ethnic origin, sex, sexual orientation, gender, gender identity, gender expression, marital status, pregnancy, parental status, disability, medical condition, genetic information, military or veteran status, union membership, and political affiliation, or any other legally protected group.

EEO is the Law: Notice of Applicant Rights Under the Law.

Job Req Type: ExperiencedRequired Travel: Yes, 10% of the timeShift Type: 1st Shift/DaysThe expected wage range for a new hire into this position is $172,000 to $236,500.

Actual wage offered may vary depending on work location, experience, education, training, external market data, internal pay equity, or other bona fide factors.
This position qualifies for a discretionary performance-based bonus which is based on personal and company factors.
This position includes medical, vision and dental coverage, 401k, paid vacation, holidays, and sick time, and other benefits.

Apply

Deep Learning Quantization Jobs in Massachusetts

Staff Research Scientist, AI-Hardware Co-Design

Staff Research Scientist, AI-Hardware Co-Design

Staff Research Scientist, AI-Hardware Co-Design

Staff Research Scientist, AI-Hardware Co-Design

Staff CV Applied Research Engineer, Edge AI

Staff CV Applied Research Engineer, Edge AI

Staff CV Applied Research Engineer, Edge AI

Staff CV Applied Research Engineer, Edge AI

Staff CV Applied Research Engineer, Edge AI

Staff CV Applied Research Engineer, Edge AI

Senior Manager, AI Inference

Senior Manager, AI Inference

Senior Software Engineer - C++ GPU Performance

Senior Software Engineer - C++ GPU Performance

Software Engineer - C++ GPU Performance

Software Engineer - C++ GPU Performance

Software Engineer - C++ GPU Performance

Software Engineer - C++ GPU Performance

Software Engineer - C++ GPU Performance

Software Engineer - C++ GPU Performance

Senior Software Engineer - C++ GPU Performance

Senior Software Engineer - C++ GPU Performance

Edge Compute Architect for Optical Systems

Edge Compute Architect for Optical Systems

Edge Compute Architect for Optical Systems

Edge Compute Architect for Optical Systems

GPU HW Research Engineer (San Diego/Boxborough)

GPU HW Research Engineer (San Diego/Boxborough)

Deep Learning Quantization information

What are the key skills and qualifications needed to thrive as a Deep Learning Quantization Engineer, and why are they important?

What is the difference between Deep Learning Quantization vs Machine Learning Engineer?

What is deep learning quantization?

What are some common challenges faced when implementing deep learning quantization in production environments?

Staff Research Scientist, AI-Hardware Co-Design

Share this job

Job description

Share this job