Research Engineer
Mundelein, IL ยท On-site
Deep understanding of modern machine learning and deep learning techniques * Experience training ... Embeddings, Quantization, Model Compression, Infrastructure Engineering, Cloud Computing ...
Mundelein, IL ยท On-site
Deep understanding of modern machine learning and deep learning techniques * Experience training ... Embeddings, Quantization, Model Compression, Infrastructure Engineering, Cloud Computing ...
Mundelein, IL ยท On-site
Deep understanding of modern machine learning and deep learning techniques * Experience training ... Embeddings, Quantization, Model Compression, Infrastructure Engineering, Cloud Computing ...
Optimize model inference for production environments using quantization, pruning, and hardware ... Expertise in Python and deep learning frameworks (PyTorch, TensorFlow, Hugging Face). * Hands-on ...
Optimize model inference for production environments using quantization, pruning, and hardware ... Expertise in Python and deep learning frameworks (PyTorch, TensorFlow, Hugging Face). * Hands-on ...
Virginia, IL ยท Remote
Optimize model inference for production environments using quantization, pruning, and hardware ... Expertise in Python and deep learning frameworks (PyTorch, TensorFlow, Hugging Face). * Hands-on ...
Virginia, IL ยท Remote
Optimize model inference for production environments using quantization, pruning, and hardware ... Expertise in Python and deep learning frameworks (PyTorch, TensorFlow, Hugging Face). * Hands-on ...
$126K - $166K/yr
Deep understanding of transformers, attention, and training dynamics * Strong Python plus PyTorch ... Inference optimization (quantization, speculative decoding, vLLM, Triton) * Experience shipping LLM ...
$126K - $166K/yr
Deep understanding of transformers, attention, and training dynamics * Strong Python plus PyTorch ... Inference optimization (quantization, speculative decoding, vLLM, Triton) * Experience shipping LLM ...
Chicago, IL ยท On-site
$126K - $166K/yr
Deep understanding of transformers, attention, and training dynamics * Strong Python plus PyTorch ... Inference optimization (quantization, speculative decoding, vLLM, Triton) * Experience shipping LLM ...
Quick apply
Chicago, IL ยท On-site
$126K - $166K/yr
Deep understanding of transformers, attention, and training dynamics * Strong Python plus PyTorch ... Inference optimization (quantization, speculative decoding, vLLM, Triton) * Experience shipping LLM ...
Chicago, IL ยท On-site
$126K - $166K/yr
Deep understanding of transformers, attention, and training dynamics * Strong Python plus PyTorch ... Inference optimization (quantization, speculative decoding, vLLM, Triton) * Experience shipping LLM ...
Chicago, IL ยท On-site
$126K - $166K/yr
Deep understanding of transformers, attention, and training dynamics * Strong Python plus PyTorch ... Inference optimization (quantization, speculative decoding, vLLM, Triton) * Experience shipping LLM ...
Chicago, IL ยท On-site
$126K - $166K/yr
Deep understanding of transformers, attention, and training dynamics * Strong Python plus PyTorch ... Inference optimization (quantization, speculative decoding, vLLM, Triton) * Experience shipping LLM ...
Chicago, IL ยท On-site
$126K - $166K/yr
Deep understanding of transformers, attention, and training dynamics * Strong Python plus PyTorch ... Inference optimization (quantization, speculative decoding, vLLM, Triton) * Experience shipping LLM ...
Cary, IL ยท On-site
Strong fundamentals in deep learning, transformers, and modern LLM mechanics (attention ... GPU performance tuning, quantization. * Coursework or projects in compilers, formal methods ...
Cary, IL ยท On-site
Strong fundamentals in deep learning, transformers, and modern LLM mechanics (attention ... GPU performance tuning, quantization. * Coursework or projects in compilers, formal methods ...
... product quantization is a plus. * Experience with embeddings, ANN/KNN, vector stores, database ... Foundational understanding of Natural Language Processing and Deep Learning. * Excellent problem ...
... product quantization is a plus. * Experience with embeddings, ANN/KNN, vector stores, database ... Foundational understanding of Natural Language Processing and Deep Learning. * Excellent problem ...
Chicago, IL ยท On-site +1
... product quantization is a plus. * Experience with embeddings, ANN/KNN, vector stores, database ... Foundational understanding of Natural Language Processing and Deep Learning. * Excellent problem ...
Chicago, IL ยท On-site +1
... product quantization is a plus. * Experience with embeddings, ANN/KNN, vector stores, database ... Foundational understanding of Natural Language Processing and Deep Learning. * Excellent problem ...
Chicago, IL ยท On-site +1
$107K - $147K/yr
Advanced Python and deep learning proficiency (PyTorch, HuggingFace Transformers, spaCy ... models via quantization, batching, and throughput tuning * Proficiency with inference ...
Quick apply
Chicago, IL ยท On-site +1
$107K - $147K/yr
Advanced Python and deep learning proficiency (PyTorch, HuggingFace Transformers, spaCy ... models via quantization, batching, and throughput tuning * Proficiency with inference ...
A singular technology platform powered by data and machine learning provides secure, differentiated ... Deep Hands-On AI/ML Expertise : Proven experience building applications with Large Language Models ...
A singular technology platform powered by data and machine learning provides secure, differentiated ... Deep Hands-On AI/ML Expertise : Proven experience building applications with Large Language Models ...
A singular technology platform powered by data and machine learning provides secure, differentiated ... Deep Hands-On AI/ML Expertise : Proven experience building applications with Large Language Models ...
A singular technology platform powered by data and machine learning provides secure, differentiated ... Deep Hands-On AI/ML Expertise : Proven experience building applications with Large Language Models ...
| Aspect | Deep Learning Quantization | Machine Learning Engineer |
|---|---|---|
| Required Credentials | Advanced degrees in AI, Computer Science, or related fields; knowledge of neural networks | Bachelor's or Master's in CS, Data Science, or related fields; programming skills |
| Work Environment | Research labs, AI development teams, hardware optimization settings | Software development teams, data-driven projects, product-focused environments |
| Industry Usage | AI hardware optimization, model deployment, edge computing | Model development, data analysis, software solutions across industries |
Deep Learning Quantization focuses on reducing model size and improving inference speed through techniques like weight and activation quantization, often in hardware or embedded systems. Machine Learning Engineers develop, implement, and optimize machine learning models for various applications. While both roles require knowledge of AI and programming, Deep Learning Quantization is more specialized in model optimization techniques, whereas Machine Learning Engineers work broadly on model development and deployment.

Other
Medical, Dental, Vision, Retirement, PTO
Posted 14 days ago
Research Engineer, Foundation Models
About the Opportunity
We are seeking a Research Engineer to help advance the next generation of large-scale AI systems. This role sits at the intersection of research and engineering, focusing on the development, training, evaluation, and deployment of state-of-the-art machine learning models.
You will work across the full model lifecycle, from building large-scale datasets and training infrastructure to experimenting with new model architectures and inference techniques. This is an opportunity to contribute directly to cutting-edge work in large language models, reinforcement learning, long-context systems, and scalable AI infrastructure.
Responsibilities
Qualifications
Required
Preferred
What We Value
Compensation & Benefits
Keywords:
Machine Learning, Artificial Intelligence, Deep Learning, Large Language Models, LLMs, Foundation Models, Generative AI, Applied AI, AI Research, Research Engineering, Model Training, Distributed Training, Pretraining, Fine-Tuning, Post-Training, Reinforcement Learning, RLHF, Reinforcement Learning from Human Feedback, Inference Optimization, Model Serving, Model Evaluation, Long Context Models, Reasoning Models, AI Infrastructure, GPU Clusters, High Performance Computing, HPC, Distributed Systems, CUDA, PyTorch, JAX, TensorFlow, Neural Networks, Transformer Models, Retrieval Augmented Generation, RAG, Synthetic Data, Data Engineering, Data Pipelines, ETL, Data Processing, Web Crawling, Data Collection, Feature Engineering, MLOps, ML Systems, Scalable Systems, Parallel Computing, Model Architecture Design, Experimentation, Research Scientists, Research Engineers, Software Engineering, Backend Engineering, Performance Optimization, Production ML, AI Agents, Agentic AI, Autonomous Systems, Prompt Engineering, Multi-Agent Systems, Vector Databases, Embeddings, Quantization, Model Compression, Infrastructure Engineering, Cloud Computing, Kubernetes, Python, C++, Open Source AI, Frontier Models, Applied Research, Statistical Learning, Computer Science, Algorithms, Large Scale Computing, Model Alignment, AI Safety, Training Infrastructure, Compute Optimization, Inference Systems, Foundation Model Research, Machine Learning Infrastructure, AI Platform Engineering, Systems Engineering, Data Infrastructure, Production Systems, Scalable AI Systems, Research & Development, Advanced AI Systems, Emerging Technologies, Distributed Computing, GPU Optimization, AI Product Development,