Technical Architect - AI/ML, LLMLocation: Santa Clara, CA 95054 (Onsite)
Full-TimeOverview:We are hiring a
Technical Architect - AI/ML with strong hands-on experience in
Large Language Models (LLMs) to design, build, and deploy
production-grade AI solutions. This role focuses on advanced model development, cloud deployment, and responsible AI practices, working closely with engineering, data, and LLM Ops teams.
Key Responsibilities: - Design, develop, and deploy advanced AI/ML and LLM-based models
- Collaborate with data engineering, LLM Ops, and software teams on end-to-end AI solutions
- Evaluate, fine-tune, and optimize models for performance, scalability, and reliability
- Research emerging AI/ML technologies and recommend innovative approaches
- Integrate AI models into applications using APIs and pipelines
- Troubleshoot deployment and integration issues proactively
- Mentor junior engineers and review models/code for best practices
- Communicate complex AI concepts clearly to technical and non-technical stakeholders
- Work in Agile/Scrum environments using tools such as Jira or Azure DevOps
Required Skills & Experience: - 6-10 years of experience in AI/ML or Deep Learning
- 2-3 years hands-on experience with LLMs, NLP, or speech/voice AI
- 3-5 years deploying AI models in production environments
- Strong experience with Python, PyTorch, TensorFlow, or similar frameworks
- Experience designing, training, and fine-tuning AI or language models
- Hands-on experience with cloud AI platforms (AWS SageMaker, Azure ML, or GCP AI)
- Experience integrating AI models via APIs or pipelines
- Knowledge of model evaluation, bias detection, optimization, and Responsible AI
- Strong mathematical and statistical foundation
- Bachelor's or Master's degree in Computer Science, AI/ML, Data Science, or related field