The ideal candidate blends deep machine learning expertise with modern software engineering ... Knowledge of model fine-tuning techniques and local LLM quantization/hosting. Familiarity with ...
The ideal candidate blends deep machine learning expertise with modern software engineering ... Knowledge of model fine-tuning techniques and local LLM quantization/hosting. Familiarity with ...
The ideal candidate blends deep machine learning expertise with modern software engineering ... Knowledge of model fine-tuning techniques and local LLM quantization/hosting. Familiarity with ...
The ideal candidate blends deep machine learning expertise with modern software engineering ... Knowledge of model fine-tuning techniques and local LLM quantization/hosting. Familiarity with ...
The ideal candidate blends deep machine learning expertise with modern software engineering ... Knowledge of model fine-tuning techniques and local LLM quantization/hosting. Familiarity with ...
The ideal candidate blends deep machine learning expertise with modern software engineering ... Knowledge of model fine-tuning techniques and local LLM quantization/hosting. Familiarity with ...
The ideal candidate blends deep machine learning expertise with modern software engineering ... Knowledge of model fine-tuning techniques and local LLM quantization/hosting. Familiarity with ...
The ideal candidate blends deep machine learning expertise with modern software engineering ... Knowledge of model fine-tuning techniques and local LLM quantization/hosting. Familiarity with ...
The ideal candidate blends deep machine learning expertise with modern software engineering ... Knowledge of model fine-tuning techniques and local LLM quantization/hosting. Familiarity with ...
The ideal candidate blends deep machine learning expertise with modern software engineering ... Knowledge of model fine-tuning techniques and local LLM quantization/hosting. Familiarity with ...
The ideal candidate blends deep machine learning expertise with modern software engineering ... Knowledge of model fine-tuning techniques and local LLM quantization/hosting. Familiarity with ...
The ideal candidate blends deep machine learning expertise with modern software engineering ... Knowledge of model fine-tuning techniques and local LLM quantization/hosting. Familiarity with ...
The ideal candidate blends deep machine learning expertise with modern software engineering ... Knowledge of model fine-tuning techniques and local LLM quantization/hosting. Familiarity with ...
The ideal candidate blends deep machine learning expertise with modern software engineering ... Knowledge of model fine-tuning techniques and local LLM quantization/hosting. Familiarity with ...
The ideal candidate blends deep machine learning expertise with modern software engineering ... Knowledge of model fine-tuning techniques and local LLM quantization/hosting. Familiarity with ...
The ideal candidate blends deep machine learning expertise with modern software engineering ... Knowledge of model fine-tuning techniques and local LLM quantization/hosting. Familiarity with ...
The ideal candidate blends deep machine learning expertise with modern software engineering ... Knowledge of model fine-tuning techniques and local LLM quantization/hosting. Familiarity with ...
The ideal candidate blends deep machine learning expertise with modern software engineering ... Knowledge of model fine-tuning techniques and local LLM quantization/hosting. Familiarity with ...
The ideal candidate blends deep machine learning expertise with modern software engineering ... Knowledge of model fine-tuning techniques and local LLM quantization/hosting. Familiarity with ...
The ideal candidate blends deep machine learning expertise with modern software engineering ... Knowledge of model fine-tuning techniques and local LLM quantization/hosting. Familiarity with ...
The ideal candidate blends deep machine learning expertise with modern software engineering ... Knowledge of model fine-tuning techniques and local LLM quantization/hosting. Familiarity with ...
The ideal candidate blends deep machine learning expertise with modern software engineering ... Knowledge of model fine-tuning techniques and local LLM quantization/hosting. Familiarity with ...
The ideal candidate blends deep machine learning expertise with modern software engineering ... Knowledge of model fine-tuning techniques and local LLM quantization/hosting. Familiarity with ...
The ideal candidate blends deep machine learning expertise with modern software engineering ... Knowledge of model fine-tuning techniques and local LLM quantization/hosting. Familiarity with ...
We're looking for someone who combines deep technical expertise in generative AI with a proven ... learning). * Model Optimization: Expertise in model compression and quantization methods (AWQ, GPTQ ...
We're looking for someone who combines deep technical expertise in generative AI with a proven ... learning). * Model Optimization: Expertise in model compression and quantization methods (AWQ, GPTQ ...
Senior Gen AI Developer - Vice President
Irving, TX · On-site
$125.76K - $188.64K/yr
You will collaborate with cross-functional teams, contribute deep technical expertise, and play a ... Proficient with machine learning frameworks (PyTorch, TensorFlow, Keras) and distributed training.
Senior Gen AI Developer - Vice President
Irving, TX · On-site
$125.76K - $188.64K/yr
You will collaborate with cross-functional teams, contribute deep technical expertise, and play a ... Proficient with machine learning frameworks (PyTorch, TensorFlow, Keras) and distributed training.
Engineering Lead Analyst - Vice President
Irving, TX · On-site
$188/hr
As a Senior Software Engineer, you will need to demonstrate a deep understanding of user needs and ... Utilize Python for scripting, automation, data processing, machine learning integration, and API ...
Engineering Lead Analyst - Vice President
Irving, TX · On-site
$188/hr
As a Senior Software Engineer, you will need to demonstrate a deep understanding of user needs and ... Utilize Python for scripting, automation, data processing, machine learning integration, and API ...
Deep Learning Quantization information
See Dallas, TX salary details
$21.6K is the 25th percentile. Wages below this are outliers.
$10.9K - $22.5K
27% of jobs
$22.5K - $34.1K
0% of jobs
$34.1K - $45.7K
0% of jobs
$45.7K - $57.3K
0% of jobs
$57.3K - $68.9K
0% of jobs
The median wage is $79.5K / yr.
$68.9K - $80.5K
25% of jobs
$80.5K - $92.1K
18% of jobs
$100.4K is the 75th percentile. Wages above this are outliers.
$92.1K - $103.7K
7% of jobs
$103.7K - $115.3K
2% of jobs
$115.3K - $126.9K
0% of jobs
$126.9K - $138.5K
21% of jobs
$10.9K
$83K
$138.5K
How much do deep learning quantization jobs pay per year?
What are the key skills and qualifications needed to thrive as a Deep Learning Quantization Engineer, and why are they important?
What are some common challenges faced when implementing deep learning quantization in production environments?
What is deep learning quantization?
What is the difference between Deep Learning Quantization vs Machine Learning Engineer?
| Aspect | Deep Learning Quantization | Machine Learning Engineer |
|---|---|---|
| Required Credentials | Advanced degrees in AI, Computer Science, or related fields; knowledge of neural networks | Bachelor's or Master's in CS, Data Science, or related fields; programming skills |
| Work Environment | Research labs, AI development teams, hardware optimization settings | Software development teams, data-driven projects, product-focused environments |
| Industry Usage | AI hardware optimization, model deployment, edge computing | Model development, data analysis, software solutions across industries |
Deep Learning Quantization focuses on reducing model size and improving inference speed through techniques like weight and activation quantization, often in hardware or embedded systems. Machine Learning Engineers develop, implement, and optimize machine learning models for various applications. While both roles require knowledge of AI and programming, Deep Learning Quantization is more specialized in model optimization techniques, whereas Machine Learning Engineers work broadly on model development and deployment.
- Contract Software Engineer Remote
- Remote Rf Optimization Engineer
- Contract Compiler Engineer
- Senior Machine Learning Software Engineer
- Mlops Engineer Salary
- Full Time Machine Learning Engineer New Grad
- Gpu Compiler Engineer
- Weekend Remote Ai Engineer
- Assistant Compiler Software Engineer
- Temporary Ai Programmer
Other
Medical, Dental, Vision, Retirement, PTO
Posted 3 days ago
Job description
Role Summary:
We are seeking a Generative AI Engineer to build, optimize, and scale production-ready AI applications. You will design complex multi-agent systems, implement advanced RAG pipelines, and manage the deployment of both frontier and local LLMs. The ideal candidate blends deep machine learning expertise with modern software engineering practices.
Technical Stack:
LLMs: Gemini, OpenAI, Claude, Llama, and Local Model deployment.
Frameworks: LangChain, LlamaIndex, and Hugging Face.
Orchestration: LangGraph and Multi-Agent Systems (MAS).
Development: Python, FastAPI, and Asynchronous Programming.
RAG & Data: PostgreSQL, Vector Databases, and Advanced Retrieval strategies.
ML/DL: PyTorch, TensorFlow, and Model Fine-tuning.
Deployment: Docker, Production API management, and LLM monitoring.
Tools: Prompt Engineering, Workflow Design, and GenAI Optimization.
Key Responsibilities:
Develop and orchestrate sophisticated AI workflows using LangGraph and multi-agent architectures.
Build and maintain Advanced RAG systems utilizing LlamaIndex and vector databases for high-accuracy retrieval.
Integrate and swap diverse LLMs (commercial and open-source) based on performance and cost requirements.
Design and deploy high-performance, scalable backend services using FastAPI and Async Python.
Fine-tune large language models (LLMs) using PyTorch/TensorFlow to improve domain-specific performance.
Optimize GenAI workflows for latency, cost, and reliability using advanced prompt engineering and monitoring tools.
Containerize and deploy AI services via Docker to production environments.
Required Qualifications:
Hands-on experience building and deploying GenAI applications in a production setting.
Strong proficiency in Python and the modern AI library ecosystem (LangChain, LlamaIndex, etc.).
Experience with vector search, embedding models, and advanced data retrieval patterns.
Knowledge of model fine-tuning techniques and local LLM quantization/hosting.
Familiarity with production-grade monitoring, API security, and CI/CD for ML.
Compensation, Benefits and Duration
Minimum Compensation: USDÂ 79,000
Maximum Compensation: USD 276,000
Compensation is based on actual experience and qualifications of the candidate. The above is a reasonable and a good faith estimate for the role.
Medical, vision, and dental benefits, 401k retirement plan, variable pay/incentives, paid time off, and paid holidays are available for full time employees.
This position is not available for independent contractors
No applications will be considered if received more than 120 days after the date of this post
About Photon
Sourced by ZipRecruiter
Company size
1 - 10 Employees
Headquarters location
Cambridge, MA, US
Year founded
1984