... compress cycle time using modern tooling-including AI-without losing rigor. What You'll Be Doing ... Design and implement automation tools for system speed modeling; apply AI and LLM-assisted ...
... compress cycle time using modern tooling-including AI-without losing rigor. What You'll Be Doing ... Design and implement automation tools for system speed modeling; apply AI and LLM-assisted ...
Senior Machine Learning Engineer - Fine-Tuning and On-device AI
Palo Alto, CA ยท On-site
$120K - $215K/yr
About the Role We are seeking a Senior Machine Learning Engineer to lead the fine-tuning ... Prune, quantize and compress models (e.g., INT8, INT4, mixed-precision) for CPU, GPU, NPU and edge ...
Senior Machine Learning Engineer - Fine-Tuning and On-device AI
Palo Alto, CA ยท On-site
$120K - $215K/yr
About the Role We are seeking a Senior Machine Learning Engineer to lead the fine-tuning ... Prune, quantize and compress models (e.g., INT8, INT4, mixed-precision) for CPU, GPU, NPU and edge ...
... compress development timelines while de-risking production decisions. ABOUT THE JOB As an ... You will use simulation and modeling tools to visualize production systems, identify bottlenecks ...
... compress development timelines while de-risking production decisions. ABOUT THE JOB As an ... You will use simulation and modeling tools to visualize production systems, identify bottlenecks ...
Tool Designer / CAD Engineer
Boulder, CO ยท On-site
$70K - $105K/yr
... in CAD modeling for softgoods (2D patterns & 3D geometry) using tools such as Rhino, SolidWorks ... compress build cycles and reduce rework Cross Functional Collaboration โข Work closely with ...
Tool Designer / CAD Engineer
Boulder, CO ยท On-site
$70K - $105K/yr
... in CAD modeling for softgoods (2D patterns & 3D geometry) using tools such as Rhino, SolidWorks ... compress build cycles and reduce rework Cross Functional Collaboration โข Work closely with ...
GTM Engineer
Madison, WI ยท Remote
Use AI tools (Claude, Cursor, etc.) to compress reporting and analysis cycles--drafting queries ... Deep Salesforce fluency: data model, flows, custom fields/objects, reports, and the difference ...
Quick apply
GTM Engineer
Madison, WI ยท Remote
Use AI tools (Claude, Cursor, etc.) to compress reporting and analysis cycles--drafting queries ... Deep Salesforce fluency: data model, flows, custom fields/objects, reports, and the difference ...
Check drawing, model, simulation quality, and/or engineering calculations generated by others ... Codeware COMPRESS. Must have a working knowledge of ERP systems and proficiency using Microsoft ...
Check drawing, model, simulation quality, and/or engineering calculations generated by others ... Codeware COMPRESS. Must have a working knowledge of ERP systems and proficiency using Microsoft ...
Design and Support Engineer
$80K - $100K/yr
Check drawing, model, simulation quality, and/or engineering calculations generated by others ... Codeware COMPRESS. Must have a working knowledge of ERP systems and proficiency using Microsoft ...
Design and Support Engineer
$80K - $100K/yr
Check drawing, model, simulation quality, and/or engineering calculations generated by others ... Codeware COMPRESS. Must have a working knowledge of ERP systems and proficiency using Microsoft ...
Senior Machine Learning Engineer, Data Mining
Pittsburgh, PA ยท On-site +1
$118K - $156K/yr
Your work will directly influence how we compress knowledge into efficient encoders for fast search ... Ensure student models maintain high accuracy while drastically reducing inference latency and ...
Quick apply
Senior Machine Learning Engineer, Data Mining
Pittsburgh, PA ยท On-site +1
$118K - $156K/yr
Your work will directly influence how we compress knowledge into efficient encoders for fast search ... Ensure student models maintain high accuracy while drastically reducing inference latency and ...
Senior Machine Learning Engineer, Data Mining
Boston, MA ยท On-site +1
$133K - $175K/yr
Your work will directly influence how we compress knowledge into efficient encoders for fast search ... Ensure student models maintain high accuracy while drastically reducing inference latency and ...
Senior Machine Learning Engineer, Data Mining
Boston, MA ยท On-site +1
$133K - $175K/yr
Your work will directly influence how we compress knowledge into efficient encoders for fast search ... Ensure student models maintain high accuracy while drastically reducing inference latency and ...
Senior Machine Learning Engineer, Data Mining
Pittsburgh, PA ยท On-site +1
$118K - $156K/yr
Your work will directly influence how we compress knowledge into efficient encoders for fast search ... Ensure student models maintain high accuracy while drastically reducing inference latency and ...
Senior Machine Learning Engineer, Data Mining
Pittsburgh, PA ยท On-site +1
$118K - $156K/yr
Your work will directly influence how we compress knowledge into efficient encoders for fast search ... Ensure student models maintain high accuracy while drastically reducing inference latency and ...
Senior Product Designer, SeekrFlow
Austin, TX ยท On-site
AI/ML engineers, data scientists, domain experts, and enterprise stakeholders. You'll work within a ... models, and validated product direction * AI-accelerated design execution: Use AI tools to compress ...
Senior Product Designer, SeekrFlow
Austin, TX ยท On-site
AI/ML engineers, data scientists, domain experts, and enterprise stakeholders. You'll work within a ... models, and validated product direction * AI-accelerated design execution: Use AI tools to compress ...
Senior Machine Learning Engineer, Data Mining
Las Vegas, NV ยท On-site +1
$117K - $154K/yr
Your work will directly influence how we compress knowledge into efficient encoders for fast search ... Ensure student models maintain high accuracy while drastically reducing inference latency and ...
Senior Machine Learning Engineer, Data Mining
Las Vegas, NV ยท On-site +1
$117K - $154K/yr
Your work will directly influence how we compress knowledge into efficient encoders for fast search ... Ensure student models maintain high accuracy while drastically reducing inference latency and ...
Senior Machine Learning Engineer, Data Mining
$125K - $165K/yr
Your work will directly influence how we compress knowledge into efficient encoders for fast search ... Ensure student models maintain high accuracy while drastically reducing inference latency and ...
Senior Machine Learning Engineer, Data Mining
$125K - $165K/yr
Your work will directly influence how we compress knowledge into efficient encoders for fast search ... Ensure student models maintain high accuracy while drastically reducing inference latency and ...
Senior Machine Learning Engineer, Data Mining
Las Vegas, NV ยท On-site
$117K - $154K/yr
Your work will directly influence how we compress knowledge into efficient encoders for fast search ... Ensure student models maintain high accuracy while drastically reducing inference latency and ...
Senior Machine Learning Engineer, Data Mining
Las Vegas, NV ยท On-site
$117K - $154K/yr
Your work will directly influence how we compress knowledge into efficient encoders for fast search ... Ensure student models maintain high accuracy while drastically reducing inference latency and ...
Senior Machine Learning Engineer, Data Mining
Boston, MA ยท On-site +1
$133K - $175K/yr
Your work will directly influence how we compress knowledge into efficient encoders for fast search ... Ensure student models maintain high accuracy while drastically reducing inference latency and ...
Quick apply
Senior Machine Learning Engineer, Data Mining
Boston, MA ยท On-site +1
$133K - $175K/yr
Your work will directly influence how we compress knowledge into efficient encoders for fast search ... Ensure student models maintain high accuracy while drastically reducing inference latency and ...
Senior Machine Learning Engineer, Data Mining
Las Vegas, NV ยท On-site +1
$117K - $154K/yr
Your work will directly influence how we compress knowledge into efficient encoders for fast search ... Ensure student models maintain high accuracy while drastically reducing inference latency and ...
Quick apply
Senior Machine Learning Engineer, Data Mining
Las Vegas, NV ยท On-site +1
$117K - $154K/yr
Your work will directly influence how we compress knowledge into efficient encoders for fast search ... Ensure student models maintain high accuracy while drastically reducing inference latency and ...
Senior Machine Learning Engineer, Data Mining
San Francisco, CA ยท On-site +1
$144K - $190K/yr
Your work will directly influence how we compress knowledge into efficient encoders for fast search ... Ensure student models maintain high accuracy while drastically reducing inference latency and ...
Quick apply
Senior Machine Learning Engineer, Data Mining
San Francisco, CA ยท On-site +1
$144K - $190K/yr
Your work will directly influence how we compress knowledge into efficient encoders for fast search ... Ensure student models maintain high accuracy while drastically reducing inference latency and ...
Senior Machine Learning Engineer, Data Mining
Pittsburgh, PA ยท On-site
$118K - $156K/yr
Your work will directly influence how we compress knowledge into efficient encoders for fast search ... Ensure student models maintain high accuracy while drastically reducing inference latency and ...
Senior Machine Learning Engineer, Data Mining
Pittsburgh, PA ยท On-site
$118K - $156K/yr
Your work will directly influence how we compress knowledge into efficient encoders for fast search ... Ensure student models maintain high accuracy while drastically reducing inference latency and ...
Senior Edge AI Perception Engineer
Arvada, CO ยท On-site
$150K - $250K/yr
Implement, compress, and optimize models (pruning, quantization, scheduling) to run on GPU, DLA ... Camera Systems Engineering * Implement real-time dewarped camera pipelines using GMSL drivers, VIC ...
Senior Edge AI Perception Engineer
Arvada, CO ยท On-site
$150K - $250K/yr
Implement, compress, and optimize models (pruning, quantization, scheduling) to run on GPU, DLA ... Camera Systems Engineering * Implement real-time dewarped camera pipelines using GMSL drivers, VIC ...
Senior Machine Learning Engineer, Data Mining
Boston, MA ยท On-site
$133K - $175K/yr
Your work will directly influence how we compress knowledge into efficient encoders for fast search ... Ensure student models maintain high accuracy while drastically reducing inference latency and ...
Senior Machine Learning Engineer, Data Mining
Boston, MA ยท On-site
$133K - $175K/yr
Your work will directly influence how we compress knowledge into efficient encoders for fast search ... Ensure student models maintain high accuracy while drastically reducing inference latency and ...
Model Compress Engineer information
See salary details
$38K - $48.2K
17% of jobs
$58.1K is the 25th percentile. Wages below this are outliers.
$48.2K - $58.5K
8% of jobs
$58.5K - $68.7K
0% of jobs
$68.7K - $78.9K
2% of jobs
$78.9K - $89.1K
7% of jobs
The median wage is $94.4K / yr.
$89.1K - $99.4K
29% of jobs
$103.6K is the 75th percentile. Wages above this are outliers.
$99.4K - $109.6K
26% of jobs
$109.6K - $119.8K
5% of jobs
$119.8K - $130K
0% of jobs
$130K - $140.3K
2% of jobs
$140.3K - $150.5K
2% of jobs
$38K
$90.5K
$150.5K
How much do model compress engineer jobs pay per year?
What are some typical challenges faced by a Model Compress Engineer when optimizing machine learning models for deployment?
What are the key skills and qualifications needed to thrive as a Model Compression Engineer, and why are they important?
What is a Model Compress Engineer?

Full-time
Posted 9 days ago
Job description
What You'll Be Doing:
- Collaborate cross-functionally with system architects, hardware, firmware/software, process/reliability, and operations teams to co-design system-level speed features and deliver industry-defining products.
- Define System level specifications, margins, bounding box constraints that satisfy design expectations and product quality.
- Provide system requirements for hardware and features affecting speed and reliability, from pre-silicon through productization.
- Translate hardware features and architectural requirements into validation techniques that achieve full coverage across testing flows.
- Perform closed loop validation by correlating silicon behavior against timing simulation and design expectations; provide actionable feedback to improve future designs.
- Define, prototype, and refine pre- and post-silicon bring-up flows to ensure product quality, performance, and schedule efficiency.
- Design and implement automation tools for system speed modeling; apply AI and LLM-assisted workflows (e.g., automated log analysis, pattern detection, scripting acceleration) to compress characterization and debug cycles.
- Architect and influence testability features critical to performance, power, and reliability in partnership with design, DFx, and ATE teams.
- Lead debug of complex silicon and system-level issues, including show-stopper defects, to enable on-time product shipment.
What We Need to See:
- MS in EE, CE, Systems Engineering, or equivalent experience.
- 4+ years of experience in a related hardware engineering role.
- Hands-on experience with silicon bring-up, frequency and power characterization, PPA analysis in pre- and post-silicon phases, System/Platform level understanding, tester-to-system correlation, and lab instrumentation (oscilloscopes, multimeters, DAQs).
- Scripting proficiency in Python and/or Perl; comfortable in Windows, Linux, and Android environments.
- Familiarity with statistical methods and data analysis tools (JMP or equivalent).
- Demonstrated use of AI or LLM-based tools (e.g., Claude, Copilot, ChatGPT) in an engineering workflow-scripting acceleration, log triage, data analysis-with clear judgment about output validation and where automation introduces risk.
Ways to stand out from the crowd:
- Background in gaming, automotive, or datacenter segments.
- Experience building or deploying AI-assisted characterization, log analysis, or debug automation workflows in a production silicon environment.
- Familiarity with LLM evaluation, prompt engineering, or agentic scripting pipelines applied to silicon data analysis.
Our team is at the forefront of silicon innovation, advancing groundbreaking technologies. We offer a dynamic work environment where your contributions will directly impact the company's success. Join us to advance your career in a role where you can truly make a difference. With competitive salaries and a generous benefits package, we are widely considered one of the technology industry's most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us, and due to unprecedented growth, our exclusive engineering teams are rapidly growing. If you're a creative and autonomous engineer with a real passion for technology, we want to hear from you!
#LI-Hybrid
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 136,000 USD - 218,500 USD for Level 3, and 168,000 USD - 264,500 USD for Level 4.
You will also be eligible for equity and benefits.
Applications for this job will be accepted at least until June 1, 2026.
This posting is for an existing vacancy.
NVIDIA uses AI tools in its recruiting processes.
NVIDIA is committed to fostering an inclusive work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
About Nvidia
Sourced by ZipRecruiter
NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It's a unique legacy of innovation that's fueled by great technology--and amazing people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what's never been done before takes vision, innovation, and the world's best talent.
Industry
Computer and electronic product manufacturing
Company size
10,000+ Employees
Headquarters location
Santa Clara, CA, US
Year founded
1993