1

Model Compress Engineer Jobs (NOW HIRING)

Generating 3D models and 2D drawings for fabrication projects Qualifications * Bachelor's degree in ... Knowledge of ASME design software like Codeware Compress, DesignCalc, PV Elite, etc. * Strong ...

Generating 3D models and 2D drawings for fabrication projects Qualifications * Bachelor's degree in ... Knowledge of ASME design software like Codeware Compress, DesignCalc, PV Elite, etc. * Strong ...

API Engineer

Geismar, LA

$100K - $120K/yr

Centennial is seeking an API Design Engineer to join a specialized engineering firm in Geismar ... Compress (required). * Experience with modeling tools such as Autodesk Inventor, AutoCAD ...

API Design Engineer

Geismar, LA · On-site

$80K - $109K/yr

Centennial is seeking an API Design Engineer to join a specialized engineering firm in Geismar ... Compress (required). * Experience with modeling tools such as Autodesk Inventor, AutoCAD ...

API Design Engineer

Geismar, LA

$80K - $109K/yr

Centennial is seeking an API Design Engineer to join a specialized engineering firm in Geismar ... Compress (required). * Experience with modeling tools such as Autodesk Inventor, AutoCAD ...

Centennial is seeking an API Design Engineer to join a specialized engineering firm in Geismar ... Compress (required). * Experience with modeling tools such as Autodesk Inventor, AutoCAD ...

Collaborate closely with Designers and Modelers. Validate dimensional change request for nozzles ... Be able to complete ASME Section VIII, Div 1 calculations using the COMPRESS software and through ...

... model and drawings for assigned projects in accordance with Westerman's standard operating ... Codeware COMPRESS • Other duties as required and assigned QUALIFICATIONS To perform this job ...

... model and drawings for assigned projects in accordance with Westerman's standard operating ... Codeware COMPRESS • Other duties as required and assigned QUALIFICATIONS To perform this job ...

next page

Showing results 1-20

Model Compress Engineer information

See salary details

$38K

$90.5K

$150.5K

How much do model compress engineer jobs pay per year?

As of Jun 5, 2026, the average yearly pay for model compress engineer in the United States is $90,538.00, according to ZipRecruiter salary data. Most workers in this role earn between $71,500.00 and $100,000.00 per year, depending on experience, location, and employer.

What are some typical challenges faced by a Model Compress Engineer when optimizing machine learning models for deployment?

Model Compress Engineers often encounter challenges such as maintaining model accuracy while significantly reducing size and computational requirements. Balancing the trade-offs between compression rate, latency, and performance can be complex, especially when deploying models to resource-constrained environments like mobile devices or embedded systems. Additionally, integrating compressed models into existing production pipelines and ensuring compatibility across diverse hardware platforms can require close collaboration with data scientists, ML engineers, and software developers.

What are the key skills and qualifications needed to thrive as a Model Compression Engineer, and why are they important?

To thrive as a Model Compression Engineer, you need a strong background in machine learning, deep learning frameworks (such as TensorFlow or PyTorch), and a solid understanding of neural network architectures, usually supported by a degree in computer science or a related field. Familiarity with model compression techniques like pruning, quantization, knowledge distillation, and experience with relevant tools and libraries (e.g., ONNX, TensorRT) are essential. Strong problem-solving abilities, collaboration, and effective communication skills help in translating research into practical, efficient solutions. These skills are crucial for optimizing AI models to run efficiently on resource-constrained devices, improving deployment speed, and reducing computational costs.

What is a Model Compress Engineer?

A Model Compress Engineer is a professional who specializes in reducing the size and computational requirements of machine learning models without significantly impacting their performance. This role involves applying advanced techniques such as model pruning, quantization, knowledge distillation, and other optimization methods to make models more efficient. Model compress engineers are crucial for deploying AI models on resource-constrained devices like smartphones, IoT devices, and edge computing platforms. Their work helps improve inference speed, reduce memory usage, and lower energy consumption, making AI solutions more accessible and scalable.
Infographic showing various Model Compress Engineer job openings in the United States as of May 2026, with employment types broken down into 93% Full Time, and 7% Contract. Highlights an 93% In-person, and 7% Remote job distribution, with an average salary of $90,538 per year, or $43.5 per hour.

AI Specialist (AI Engineering)

Hyphen Connect Limited

San Francisco, CA • On-site

Full-time

Posted 13 days ago


Job description

We are looking for an AI Specialist Engineer to enhance the performance of large language and vision models for on-device inference. Your expertise will be crucial in developing and deploying cutting-edge AI solutions, ensuring optimal efficiency across diverse hardware architectures.
Responsibilities:
  • Compress and optimize large language and vision models for on-device inference.
  • Develop pipelines for model distillation and hardware-specific compilation.
  • Benchmark performance across various NPU/GPU architectures.

Qualifications:
  • Expertise in model distillation, pruning, and 4-bit/8-bit quantization techniques.
  • Hands-on experience with TensorRT, ONNX Runtime, and edge deployment.
  • Strong C++ and Python skills.