Deep Learning Research Intern
$18 - $59/hr
Deep Learning Research Intern (Embodied AI, Multimodal Foundation Models & Efficient Systems) About ... Model compression, pruning, quantization, and distillation * Efficient inference and deployment ...
$18 - $59/hr
Deep Learning Research Intern (Embodied AI, Multimodal Foundation Models & Efficient Systems) About ... Model compression, pruning, quantization, and distillation * Efficient inference and deployment ...
$18 - $59/hr
Deep Learning Research Intern (Embodied AI, Multimodal Foundation Models & Efficient Systems) About ... Model compression, pruning, quantization, and distillation * Efficient inference and deployment ...
San Francisco, CA · On-site
$161K - $175K/yr
Deep Learning Engineer II POSITION DUTIES: Lead the research, development, and deployment of ... Drive innovation in model compression, quantization, and efficient inference techniques to optimize ...
San Francisco, CA · On-site
$161K - $175K/yr
Deep Learning Engineer II POSITION DUTIES: Lead the research, development, and deployment of ... Drive innovation in model compression, quantization, and efficient inference techniques to optimize ...
San Jose, CA · On-site
$18 - $59/hr
Deep Learning Research Intern (Embodied AI, Multimodal Foundation Models & Efficient Systems) About ... Model compression, pruning, quantization, and distillation * Efficient inference and deployment ...
San Jose, CA · On-site
$18 - $59/hr
Deep Learning Research Intern (Embodied AI, Multimodal Foundation Models & Efficient Systems) About ... Model compression, pruning, quantization, and distillation * Efficient inference and deployment ...
NVIDIA is searching for an outstanding Senior Researcher working on efficient deep learning to join ... A background in pruning, quantization, NAS, efficient backbones is required. * Experience with ...
NVIDIA is searching for an outstanding Senior Researcher working on efficient deep learning to join ... A background in pruning, quantization, NAS, efficient backbones is required. * Experience with ...
Santa Clara, CA · On-site
$115K - $147K/yr
NVIDIA is searching for an outstanding Senior Researcher working on efficient deep learning to join ... A background in pruning, quantization, NAS, efficient backbones is required. * Experience with ...
Santa Clara, CA · On-site
$115K - $147K/yr
NVIDIA is searching for an outstanding Senior Researcher working on efficient deep learning to join ... A background in pruning, quantization, NAS, efficient backbones is required. * Experience with ...
Degree or equivalent experience in Computer Science, Machine Learning, Robotics, Computer Vision ... Deep expertise in the theory and low-level implementation of modern quantization algorithms (e.g ...
Degree or equivalent experience in Computer Science, Machine Learning, Robotics, Computer Vision ... Deep expertise in the theory and low-level implementation of modern quantization algorithms (e.g ...
We are now looking for an Applied Deep Learning Research Scientist, Efficiency! Join our ADLR ... Topics include quantization/sparsity/optimizers/reinforcement learning, efficient architectures and ...
We are now looking for an Applied Deep Learning Research Scientist, Efficiency! Join our ADLR ... Topics include quantization/sparsity/optimizers/reinforcement learning, efficient architectures and ...
Santa Clara, CA · On-site
$115K - $147K/yr
We are now looking for an Applied Deep Learning Research Scientist, Efficiency! Join our ADLR ... Topics include quantization/sparsity/optimizers/reinforcement learning, efficient architectures and ...
Santa Clara, CA · On-site
$115K - $147K/yr
We are now looking for an Applied Deep Learning Research Scientist, Efficiency! Join our ADLR ... Topics include quantization/sparsity/optimizers/reinforcement learning, efficient architectures and ...
$123K - $169K/yr
We are seeking a highly skilled Senior Deep Learning Engineer to drive the development and ... Proficiency in model optimization techniques such as quantization, pruning, and knowledge ...
$123K - $169K/yr
We are seeking a highly skilled Senior Deep Learning Engineer to drive the development and ... Proficiency in model optimization techniques such as quantization, pruning, and knowledge ...
We are looking for outstanding Senior Deep Learning Software Engineers to develop and productize ... Working across a wide range of abstractions from model fine-tuning and quantization to low-level ...
We are looking for outstanding Senior Deep Learning Software Engineers to develop and productize ... Working across a wide range of abstractions from model fine-tuning and quantization to low-level ...
Preferred : • 3+ years of industry experience in machine learning, deep learning, or AI ... quantization, mixed-precision, sub-4-bit methods • Hands-on experience quantizing LLMs (GPT ...
Preferred : • 3+ years of industry experience in machine learning, deep learning, or AI ... quantization, mixed-precision, sub-4-bit methods • Hands-on experience quantizing LLMs (GPT ...
Preferred : • 3+ years of industry experience in machine learning, deep learning, or AI ... quantization, mixed-precision, sub-4-bit methods • Hands-on experience quantizing LLMs (GPT ...
Preferred : • 3+ years of industry experience in machine learning, deep learning, or AI ... quantization, mixed-precision, sub-4-bit methods • Hands-on experience quantizing LLMs (GPT ...
Santa Clara, CA · On-site
$143K - $189K/yr
We are looking for outstanding Senior Deep Learning Software Engineers to develop and productize ... Working across a wide range of abstractions from model fine-tuning and quantization to low-level ...
Santa Clara, CA · On-site
$143K - $189K/yr
We are looking for outstanding Senior Deep Learning Software Engineers to develop and productize ... Working across a wide range of abstractions from model fine-tuning and quantization to low-level ...
Implement advanced quantization techniques including weight-only quantization, activation ... Preferred Qualifications: * 3+ years of industry experience in machine learning, deep learning, or ...
Implement advanced quantization techniques including weight-only quantization, activation ... Preferred Qualifications: * 3+ years of industry experience in machine learning, deep learning, or ...
Mountain View, CA · On-site
$124K - $170K/yr
Responsibilities : • Design and implement advanced deep learning architectures to enhance ... quantization, pruning, and knowledge distillation. • Doctorate (Ph.D.) in Computer Science ...
Mountain View, CA · On-site
$124K - $170K/yr
Responsibilities : • Design and implement advanced deep learning architectures to enhance ... quantization, pruning, and knowledge distillation. • Doctorate (Ph.D.) in Computer Science ...
Implement advanced quantization techniques including weight-only quantization, activation ... Preferred Qualifications: * 3+ years of industry experience in machine learning, deep learning, or ...
Implement advanced quantization techniques including weight-only quantization, activation ... Preferred Qualifications: * 3+ years of industry experience in machine learning, deep learning, or ...
Santa Clara, CA · On-site
$143K - $189K/yr
... like quantization, scheduling, memory management, and distributed inference to set the gold ... Scale performance of deep learning models across different architectures and types of NVIDIA ...
Santa Clara, CA · On-site
$143K - $189K/yr
... like quantization, scheduling, memory management, and distributed inference to set the gold ... Scale performance of deep learning models across different architectures and types of NVIDIA ...
Work with deep learning compiler and architecture teams to analyze and validate sophisticated ... DL model internals depth: experience with quantization, operator fusion, mixed-precision, or graph ...
Work with deep learning compiler and architecture teams to analyze and validate sophisticated ... DL model internals depth: experience with quantization, operator fusion, mixed-precision, or graph ...
Santa Clara, CA · On-site
Deep Learning Expertise: Strong familiarity with PyTorch and deep knowledge of inference engines like TensorRT , ONNX Runtime, or TVM. * Quantization Depth: Hands-on experience with INT8/FP8/INT4 ...
Santa Clara, CA · On-site
Deep Learning Expertise: Strong familiarity with PyTorch and deep knowledge of inference engines like TensorRT , ONNX Runtime, or TVM. * Quantization Depth: Hands-on experience with INT8/FP8/INT4 ...
Santa Clara, CA · Hybrid
$143K - $189K/yr
... like quantization, scheduling, memory management, and distributed inference to set the gold ... Scale performance of deep learning models across different architectures and types of NVIDIA ...
Santa Clara, CA · Hybrid
$143K - $189K/yr
... like quantization, scheduling, memory management, and distributed inference to set the gold ... Scale performance of deep learning models across different architectures and types of NVIDIA ...
| Aspect | Deep Learning Quantization | Machine Learning Engineer |
|---|---|---|
| Required Credentials | Advanced degrees in AI, Computer Science, or related fields; knowledge of neural networks | Bachelor's or Master's in CS, Data Science, or related fields; programming skills |
| Work Environment | Research labs, AI development teams, hardware optimization settings | Software development teams, data-driven projects, product-focused environments |
| Industry Usage | AI hardware optimization, model deployment, edge computing | Model development, data analysis, software solutions across industries |
Deep Learning Quantization focuses on reducing model size and improving inference speed through techniques like weight and activation quantization, often in hardware or embedded systems. Machine Learning Engineers develop, implement, and optimize machine learning models for various applications. While both roles require knowledge of AI and programming, Deep Learning Quantization is more specialized in model optimization techniques, whereas Machine Learning Engineers work broadly on model development and deployment.

$18 - $59/hr
Other
Posted 11 days ago
Deep Learning Research Intern
(Embodied AI, Multimodal Foundation Models & Efficient Systems)
About Us
Futurewei is a well-funded independent research organization with a long history of R&D innovation in Silicon Valley. We are committed to open-source development, fundamental research, and advancing next-generation intelligent systems through collaboration and standards development.
About the Role
We are seeking a strong deep learning research intern to join our ASID team in San Jose, CA. This role focuses on building learning systems for embodied intelligence, emphasizing how multimodal foundation models can be trained, compressed, and deployed efficiently in embodied and interactive environments.
Our work goes beyond static perception. We study intelligence grounded in embodied experience-the interaction of perception, action, and environment over time-while ensuring models remain efficient, scalable, and deployable in real-world systems.
Core Research Focus Areas
The intern will contribute to one or more of the following interconnected research directions:
1. Multimodal Foundation Models
Fine-tuning and adaptation of large language models (LLMs), vision-language models (VLMs), and vision-language-action (VLA) models
Multimodal representation learning across vision, language, and action
Grounding foundation models in embodied experience and temporal interaction
2. Neural (Generative) Image and Video Compression
Learning-based image and video compression models
Efficient visual representations for perception and downstream embodied tasks
Joint optimization of compression efficiency, reconstruction quality, and task relevance
3. Embodied AI
Learning frameworks that couple perception, action, and environment dynamics
World models, predictive learning, and agent-centric representations
Embodied learning in simulation or real-world-inspired environments
4. Model Compression & Inference Acceleration for Embodied Systems
Model compression, pruning, quantization, and distillation
Efficient inference and deployment strategies for embodied and real-time applications
Hardware- and system-aware optimization for edge or robotic platforms
Responsibilities
Conduct research in one or more of the focus areas above
Design and implement learning algorithms and experimental pipelines
Develop prototype systems or demos for embodied and multimodal AI applications
Collaborate closely with researchers in a fast-paced, research-driven environment
Qualifications
MS or PhD in Computer Science, Electrical Engineering, Artificial Intelligence, Robotics, Mathematics, or a related field
Strong foundation in machine learning and deep learning
Experience or strong interest in multimodal models, embodied AI, compression, or efficient inference
Proficiency with PyTorch; experience with HuggingFace or similar frameworks is a plus
Solid Python programming skills
Research experience with publications in top conferences or journals preferred
Strong communication skills and ability to work effectively in a global research team
Location: San Jose, CA
Hourly interns pay range: $18 to $59, depending on degree-seeking academic program (PhD, Master's, Bachelor's, etc.), years of relevant experience, year in school, geographic location, credentials, qualifications, and other job-related factors.
Housing allowance and relocation benefit might be provided to intern candidates who meet the qualifications. Additional details on the compensation package will be provided to candidates during the interview process.
Employment Type: InternSourced by ZipRecruiter
Telecommunications
201 - 500 Employees
Santa Clara, CA, US
2001