AI/ ML Engineer
$120K - $125K/yr
Strong grasp of deep learning architectures including CNNs, RNNs, Transformers, and multimodal ... post training quantization, ONNX, CoreML, and edge deployment. * Understanding of software ...
$120K - $125K/yr
Strong grasp of deep learning architectures including CNNs, RNNs, Transformers, and multimodal ... post training quantization, ONNX, CoreML, and edge deployment. * Understanding of software ...
$120K - $125K/yr
Strong grasp of deep learning architectures including CNNs, RNNs, Transformers, and multimodal ... post training quantization, ONNX, CoreML, and edge deployment. * Understanding of software ...
$43.27 - $61/hr
Proficiency in Python and deep learning frameworks (PyTorch or TensorFlow) * Experience designing ... Familiarity with inference optimization (TensorRT, ONNX, quantization techniques) * Ability to work ...
Quick apply
$43.27 - $61/hr
Proficiency in Python and deep learning frameworks (PyTorch or TensorFlow) * Experience designing ... Familiarity with inference optimization (TensorRT, ONNX, quantization techniques) * Ability to work ...
$43.27 - $61/hr
Proficiency in Python and deep learning frameworks (PyTorch or TensorFlow) * Experience designing ... Familiarity with inference optimization (TensorRT, ONNX, quantization techniques) * Ability to work ...
Quick apply
$43.27 - $61/hr
Proficiency in Python and deep learning frameworks (PyTorch or TensorFlow) * Experience designing ... Familiarity with inference optimization (TensorRT, ONNX, quantization techniques) * Ability to work ...
Dallas, TX · On-site
... quantization, and deployment efficiency - for production, including frameworks such as vLLM ... deep-learning tooling; experience with distributed training frameworks such as DeepSpeed, FSDP ...
Dallas, TX · On-site
... quantization, and deployment efficiency - for production, including frameworks such as vLLM ... deep-learning tooling; experience with distributed training frameworks such as DeepSpeed, FSDP ...
Dallas, TX · On-site
... quantization, and deployment efficiency - for production, including frameworks such as vLLM ... deep-learning tooling; experience with distributed training frameworks such as DeepSpeed, FSDP ...
Dallas, TX · On-site
... quantization, and deployment efficiency - for production, including frameworks such as vLLM ... deep-learning tooling; experience with distributed training frameworks such as DeepSpeed, FSDP ...
Dallas, TX · On-site +1
$89K - $123K/yr
Apply advanced inference optimization techniques (quantization, pruning, ONNX Runtime) and memory ... Deep, hands-on experience with the AWS ecosystem, specifically AWS ECS and Lambda . Solid ...
Dallas, TX · On-site +1
$89K - $123K/yr
Apply advanced inference optimization techniques (quantization, pruning, ONNX Runtime) and memory ... Deep, hands-on experience with the AWS ecosystem, specifically AWS ECS and Lambda . Solid ...
Dallas, TX · On-site +1
$141K - $249K/yr
Examples include designing new CUDA kernels, quantization-aware training and inference, and ... deep learning frameworks such as PyTorch or Jax. - Skilled in profiling CPU and GPU code using ...
Dallas, TX · On-site +1
$141K - $249K/yr
Examples include designing new CUDA kernels, quantization-aware training and inference, and ... deep learning frameworks such as PyTorch or Jax. - Skilled in profiling CPU and GPU code using ...
Dallas, TX · On-site +1
$141K - $249K/yr
Examples include designing new CUDA kernels, quantization-aware training and inference, and ... deep learning frameworks such as PyTorch or Jax. - Skilled in profiling CPU and GPU code using ...
Dallas, TX · On-site +1
$141K - $249K/yr
Examples include designing new CUDA kernels, quantization-aware training and inference, and ... deep learning frameworks such as PyTorch or Jax. - Skilled in profiling CPU and GPU code using ...
Dallas, TX · On-site +1
$141K - $249K/yr
Examples include designing new CUDA kernels, quantization-aware training and inference, and ... deep learning frameworks such as PyTorch or Jax. - Skilled in profiling CPU and GPU code using ...
Quick apply
Dallas, TX · On-site +1
$141K - $249K/yr
Examples include designing new CUDA kernels, quantization-aware training and inference, and ... deep learning frameworks such as PyTorch or Jax. - Skilled in profiling CPU and GPU code using ...
Dallas, TX · On-site +1
$141K - $249K/yr
Examples include designing new CUDA kernels, quantization-aware training and inference, and ... deep learning frameworks such as PyTorch or Jax. - Skilled in profiling CPU and GPU code using ...
Dallas, TX · On-site +1
$141K - $249K/yr
Examples include designing new CUDA kernels, quantization-aware training and inference, and ... deep learning frameworks such as PyTorch or Jax. - Skilled in profiling CPU and GPU code using ...
$85 - $110/hr
Deep understanding of LLMs, embeddings, vector databases (e.g., FAISS, Pinecone, Weaviate ... Use techniques like quantization, distillation, and caching to improve efficiency.
Quick apply
$85 - $110/hr
Deep understanding of LLMs, embeddings, vector databases (e.g., FAISS, Pinecone, Weaviate ... Use techniques like quantization, distillation, and caching to improve efficiency.
The ideal candidate blends deep machine learning expertise with modern software engineering ... Knowledge of model fine-tuning techniques and local LLM quantization/hosting. Familiarity with ...
The ideal candidate blends deep machine learning expertise with modern software engineering ... Knowledge of model fine-tuning techniques and local LLM quantization/hosting. Familiarity with ...
The ideal candidate blends deep machine learning expertise with modern software engineering ... Knowledge of model fine-tuning techniques and local LLM quantization/hosting. Familiarity with ...
The ideal candidate blends deep machine learning expertise with modern software engineering ... Knowledge of model fine-tuning techniques and local LLM quantization/hosting. Familiarity with ...
The ideal candidate blends deep machine learning expertise with modern software engineering ... Knowledge of model fine-tuning techniques and local LLM quantization/hosting. Familiarity with ...
The ideal candidate blends deep machine learning expertise with modern software engineering ... Knowledge of model fine-tuning techniques and local LLM quantization/hosting. Familiarity with ...
The ideal candidate blends deep machine learning expertise with modern software engineering ... Knowledge of model fine-tuning techniques and local LLM quantization/hosting. Familiarity with ...
The ideal candidate blends deep machine learning expertise with modern software engineering ... Knowledge of model fine-tuning techniques and local LLM quantization/hosting. Familiarity with ...
The ideal candidate blends deep machine learning expertise with modern software engineering ... Knowledge of model fine-tuning techniques and local LLM quantization/hosting. Familiarity with ...
The ideal candidate blends deep machine learning expertise with modern software engineering ... Knowledge of model fine-tuning techniques and local LLM quantization/hosting. Familiarity with ...
The ideal candidate blends deep machine learning expertise with modern software engineering ... Knowledge of model fine-tuning techniques and local LLM quantization/hosting. Familiarity with ...
The ideal candidate blends deep machine learning expertise with modern software engineering ... Knowledge of model fine-tuning techniques and local LLM quantization/hosting. Familiarity with ...
The ideal candidate blends deep machine learning expertise with modern software engineering ... Knowledge of model fine-tuning techniques and local LLM quantization/hosting. Familiarity with ...
The ideal candidate blends deep machine learning expertise with modern software engineering ... Knowledge of model fine-tuning techniques and local LLM quantization/hosting. Familiarity with ...
The ideal candidate blends deep machine learning expertise with modern software engineering ... Knowledge of model fine-tuning techniques and local LLM quantization/hosting. Familiarity with ...
The ideal candidate blends deep machine learning expertise with modern software engineering ... Knowledge of model fine-tuning techniques and local LLM quantization/hosting. Familiarity with ...
The ideal candidate blends deep machine learning expertise with modern software engineering ... Knowledge of model fine-tuning techniques and local LLM quantization/hosting. Familiarity with ...
The ideal candidate blends deep machine learning expertise with modern software engineering ... Knowledge of model fine-tuning techniques and local LLM quantization/hosting. Familiarity with ...
$21.6K is the 25th percentile. Wages below this are outliers.
$10.9K - $22.5K
27% of jobs
$22.5K - $34.1K
0% of jobs
$34.1K - $45.7K
0% of jobs
$45.7K - $57.3K
0% of jobs
$57.3K - $68.9K
0% of jobs
The median wage is $79.5K / yr.
$68.9K - $80.5K
25% of jobs
$80.5K - $92.1K
18% of jobs
$100.4K is the 75th percentile. Wages above this are outliers.
$92.1K - $103.7K
7% of jobs
$103.7K - $115.3K
2% of jobs
$115.3K - $126.9K
0% of jobs
$126.9K - $138.5K
21% of jobs
$10.9K
$83K
$138.5K
| Aspect | Deep Learning Quantization | Machine Learning Engineer |
|---|---|---|
| Required Credentials | Advanced degrees in AI, Computer Science, or related fields; knowledge of neural networks | Bachelor's or Master's in CS, Data Science, or related fields; programming skills |
| Work Environment | Research labs, AI development teams, hardware optimization settings | Software development teams, data-driven projects, product-focused environments |
| Industry Usage | AI hardware optimization, model deployment, edge computing | Model development, data analysis, software solutions across industries |
Deep Learning Quantization focuses on reducing model size and improving inference speed through techniques like weight and activation quantization, often in hardware or embedded systems. Machine Learning Engineers develop, implement, and optimize machine learning models for various applications. While both roles require knowledge of AI and programming, Deep Learning Quantization is more specialized in model optimization techniques, whereas Machine Learning Engineers work broadly on model development and deployment.
Location: Irving, TX
Salary Range: $120,000 - $125,000 a year (plus full time benefits)
Job DescriptionSourced by ZipRecruiter
Diverse Lynx, based in Princeton, NJ, US, is a reputable company in the Information Technology sector. The firm, as reflected through its website diverselynx.com, specializes in delivering comprehensive IT solutions. These solutions range from IT consulting to robust digital transformation strategies, IT staffing, and full-time placements services. The company was established in 2008, and it prides itself on providing simplified, efficient technology solutions designed to meet the unique needs of each client.
It services
51 - 200 Employees
Princeton, NJ, US
2002