1

Ml Inference Jobs in Michigan (NOW HIRING)

Hands-on experience with major ML frameworks and inference runtimes. * Practical experience with model compression techniques and the associated accuracy/performance trade-offs. * Working knowledge ...

... model inference services. You will learn and apply new techniques from open source packages and ... Work spans classical ML through LLM systems. You improve search and retrieval quality using real ...

Machine Learning Engineer, App SW

Detroit, MI ยท Hybrid

$283K - $381K/yr

Familiarity with personalization, human behavior modeling, or driver intent inference. * Experience integrating ML systems into production hardware or multi-agent simulation. This role is a full-time ...

Demonstrated experience with MLOps principles and tools (e.g., Azure ML, AWS SageMaker, GCP AI ... Execute model inference against graph data to provide prescriptions for N-tier supplier risk and ...

Lead Architect- Automotive AI Cockpit

Plymouth, MI ยท On-site

$52.50 - $72/hr

Lead design and implementation of AI/ML pipelines, model optimization, deployment, and lifecycle ... Experience with on-device voice pipelines (ASR/TTS) and inference-serving stacks on various SoC ...

... ML systems across the full lifecycle, from data ingestion and training to model development and inference. This role closely partners with software engineers, IT operations and market segment ...

Lead Architect- Automotive AI Cockpit

Plymouth, MI ยท On-site

$52.50 - $72/hr

Lead design and implementation of AI/ML pipelines, model optimization, deployment, and lifecycle ... Experience with on-device voice pipelines (ASR/TTS) and inference-serving stacks on various SoC ...

Lead Architect- Automotive AI Cockpit

Plymouth, MI ยท On-site

$52.50 - $72/hr

Lead design and implementation of AI/ML pipelines, model optimization, deployment, and lifecycle ... Experience with on-device voice pipelines (ASR/TTS) and inference-serving stacks on various SoC ...

Execute model inference against graph data to provide prescriptions for N-tier supplier risk and ... Demonstrated experience with MLOps principles and tools (e.g., Azure ML, AWS SageMaker, Google ...

next page

Showing results 1-20

Ml Inference information

What is ML inference?

ML inference refers to the process of using a trained machine learning model to make predictions or decisions based on new data. After a model has been trained on historical data, inference is the phase where that model is deployed and used in real-world applications, such as recognizing speech, detecting objects in images, or recommending products. The focus in ML inference is on speed, efficiency, and scalability to ensure quick predictions, often in real time. This process is critical for practical applications like mobile apps, web services, and embedded systems. Optimizing inference involves reducing latency, memory usage, and computational requirements.

What is the difference between Ml Inference vs Data Scientist?

AspectML InferenceData Scientist
Required CredentialsKnowledge of machine learning models, programming skillsDegree in data science, statistics, or related fields
Work EnvironmentDeploying models in production, real-time data processingData analysis, model development, research
Industry UsageAI product deployment, software companiesResearch institutions, tech firms, consulting

ML Inference focuses on deploying trained models to make predictions on new data, often in real-time. Data Scientists develop and analyze models, working primarily in research and development. While both roles require understanding of machine learning, ML Inference emphasizes deployment and operationalization, whereas Data Scientists focus on model creation and analysis.

What are some common challenges faced by ML Inference Engineers when deploying models to production?

ML Inference Engineers often encounter challenges such as optimizing model latency and throughput to meet production requirements, ensuring compatibility with diverse hardware environments, and managing model versioning and updates without disrupting service. Additionally, balancing resource utilization and inference accuracy while monitoring real-time performance metrics is crucial. Collaboration with data scientists, DevOps, and software engineers is typically essential to streamline deployment and maintain robust, scalable inference pipelines.

What are the key skills and qualifications needed to thrive in ML Inference, and why are they important?

To thrive in ML Inference, you need a solid background in machine learning principles, programming (Python or C++), and experience with deploying models at scale, often supported by a degree in computer science or a related field. Familiarity with frameworks and tools such as TensorFlow, PyTorch, ONNX, and cloud platforms like AWS SageMaker or Google AI Platform is typically required. Strong problem-solving skills, attention to detail, and effective communication are crucial soft skills for collaborating with multidisciplinary teams and optimizing model performance. These skills ensure efficient, scalable, and reliable deployment of machine learning solutions in real-world applications.
What are popular job titles related to Ml Inference jobs in Michigan? For Ml Inference jobs in Michigan, the most frequently searched job titles are:
What job categories do people searching Ml Inference jobs in Michigan look for? The top searched job categories for Ml Inference jobs in Michigan are:
What cities in Michigan are hiring for Ml Inference jobs? Cities in Michigan with the most Ml Inference job openings:

ADAS Engineer

Astemo Ltd

Farmington Hills, MI โ€ข On-site

Full-time

Posted 12 days ago


Job description

Company Name:
ASTEMO AMERICAS, INC.
Job Family:
Engineering
Job Description:
Position Overview and Objective
Astemo's Advanced Development Division is hiring a Senior Engineer to own the deployment and optimization of AI/ML workloads, including ADAS perception models, LLMs, and Vision-Language-Action (VLA) models, on embedded automotive SoCs. The engineer will work across multiple areas as priorities evolve and is expected to contribute to both current development needs and emerging software initiatives.
Job Responsibilities:
  • Deploy and optimize AI/ML models on embedded automotive SoCs to meet performance, memory, and efficiency targets.
  • Apply advanced model optimization techniques while preserving accuracy and intended behavior.
  • Profile inference pipelines and tune workloads (kernels and model graphs) across heterogeneous compute resources for real-time use.
  • Develop orchestration and scheduling approaches for AI workloads across heterogeneous compute resources under real-time and power/thermal constraints.
  • Diagnose deployment-related issues and define validation approaches to evaluate new techniques.
  • Manage and evaluate trade-offs across accuracy, latency, throughput, memory footprint, and energy consumption; produce data-driven recommendations.
  • Collaborate with cross-functional teams to transition advanced work into the production stack.

Qualifications:
  • Knowledge of AD/ADAS systems and automotive hardware platforms.
  • Solid understanding of deep learning fundamentals and the numerical behavior of neural networks.
  • Hands-on experience with major ML frameworks and inference runtimes.
  • Practical experience with model compression techniques and the associated accuracy/performance trade-offs.
  • Working knowledge of embedded SoC architectures and their implications for ML workload performance.
  • Strong programming proficiency in C/C++ (modern C++ preferred) and Python.
  • Familiarity with profiling tools and a structured approach to performance analysis.
  • Exposure to low-level optimization techniques for AI workloads on embedded accelerators.
  • Flexibility and willingness to work across multiple software layers as project needs evolve.
  • Excellent communication and presentation skills, with the ability to influence and persuade stakeholders at all levels of the organization.

Additionally, the ability to work independently with minimal direction is required as are strong verbal and written communication skills. Experience with PCs and application software, such as MS Office tools, is also required.
Education: Master's or Ph.D. degree in Computer Science, Computer Engineering, Electrical Engineering, or a related field.
Experience: Minimum of 5+ years with Masters and 1+ years with Ph.D. or relevant industrial experience is required.
Job level is determined by various factors such as organization size, responsibility, career stage, and capabilities.
Supervisory Responsibilities: n/a
Working conditions:
Physical Demands: Required to sit or stand for long periods of time. The employee may occasionally lift and/or move up to 25 pounds.
Travel: Domestic and international may be required as needed. The candidate will occasionally need to travel to multiple global locations to support project development
Equal Opportunity Employer (EOE) - Qualified applicants will receive consideration without regard to their race, color, religion, sex, sexual orientation, gender, identity, disability, protected veteran status and national origin.
At Astemo, we're challenging the status quo with the power of diversity, inclusion, and collaboration. Our goal is to build an inclusive work environment that celebrates the differences of our employees. We want to ensure that every employee feels valued, respected and empowered. We don't just accept difference-we celebrate it, we support it, and we thrive on it for the benefit of our employees, our products, and our community. Astemo is proud to be an equal opportunity employer.
If you need a reasonable accommodation to apply for a job at Astemo, please send the nature of the request and contact information to am-jobs@hitachiastemo.com when applying for the position.