Job Title:ย Gen AI Architect
Location:ย Pleasanton , California (Remote)
ย
JD:ย We are seeking an experiencedย
Generative AI Architectย to lead the design, development, and deployment of cutting-edge generative AI systems. The ideal candidate will combine deep technical knowledge of AI/ML (particularly large language models and diffusion models) with strong architecture and leadership skills. You will play a critical role in shaping our AI strategy and enabling innovative products powered by generative technologies.
ย
Key Responsibilities:- Architect and designย end-to-end generative AI solutions (text, image, audio, or multimodal) that align with business objectives.
- Evaluate and select appropriateย foundation models (e.g., GPT, LLaMA, Stable Diffusion)ย and fine-tuning strategies.
- Lead the development ofย custom LLM applications, including prompt engineering, fine-tuning, RLHF, and model compression.
- Collaborate with cross-functional teams (engineering, product, design, data science) to integrate AI into products and platforms.
- Ensure responsible and ethical AI practices are embedded in system design (e.g., fairness, privacy, explainability).
- Guide the implementation of AIย infrastructureย (data pipelines, vector databases, model serving, APIs).
- Stay up-to-date on the latest AI research and tools, and make recommendations for adoption.
- Conductย proofs-of-concept, prototypes, and performance benchmarking.
- Mentor junior engineers and contribute to best practices and internal knowledge sharing.
ย
ย
Required Qualifications:- Bachelorโs or Masterโs degree in Computer Science, Artificial Intelligence, Machine Learning
- 7+ years of experience in AI/ML, with 3+ years in generative AI (LLMs, diffusion models, etc.).
- Proven experience designing and deploying large-scale AI systems.
- Deep understanding ofย transformer architectures,ย tokenization, andย pretraining/fine-tuning paradigms.
- Hands-on experience with AI/ML frameworks such asย PyTorch, TensorFlow, Hugging Face Transformers, LangChain, etc.
- Strong knowledge ofย MLOps, cloud platformsย (AWS, GCP, Azure), and scalable architectures (e.g., microservices, serverless).
- Experience withย vector databasesย (e.g., Pinecone, Weaviate, FAISS) andย retrieval-augmented generation (RAG)ย systems.
- Familiarity with responsible AI frameworks and privacy-preserving techniques.
ย
ย
Preferred Qualifications:- Experience withย open-source LLMsย and model distillation/quantization techniques.
- Exposure toย multimodal AI modelsย (e.g., CLIP, DALLยทE, Imagen).
- Contributions to AI/ML research (e.g., published papers, open-source projects).
- Experience buildingย GenAI copilots, chatbots, or productivity tools.
ย
ย
Soft Skills:- Strong problem-solving and analytical skills.
- Excellent communication and stakeholder management abilities.
- Ability to translate complex AI concepts into business value.
- Entrepreneurial mindset and passion for innovation