1

Summer Retrieval Augmented Generation Jobs (NOW HIRING)

Gen AI Lead - RAG (Retrieval-Augmented Generation) Specialist We are looking for a highly skilled Gen AI Lead specializing in Retrieval-Augmented Generation (RAG) to join our AI team in Pleasanton ...

Apply Early

Responsibilities : • Building production-ready AI services and APIs using Python • LLM integration • Prompt engineering frameworks • RAG (Retrieval-Augmented Generation) pipelines • Vector ...

Additionally, experience in building Retrieval-Augmented Generation (RAG) pipelines for search and chat applications is highly desired. Key Responsibilities: * Develop and optimize NLP models for ...

AI/ML architecture, ML and Generative AI (GenAI) use cases, Large Language Models (LLMs), RAG (Retrieval-Augmented Generation), LangChain, LangGraph,CI/CD for AI pipelines Min experience: 10+ Years

Apply Early

Closure Technologies is seeking a AI/ML Engineer who will Implement and maintain Retrieval-Augmented Generation (RAG) pipelines and integrate Large Language Models (LLMs) into applications, supported ...

Build Retrieval-Augmented Generation (RAG) pipelines using embeddings, semantic search, vector databases, chunking strategies, reranking, response grounding, and citation mechanisms. * Fine-tune and ...

The ideal candidate will have a strong background in investment banking, hands-on experience with Microsoft Azure OpenAI, and expertise in Retrieval-Augmented Generation (RAG). Key Responsibilities:

Apply Early

GPT, Claude • Prompt Engineering • RAG (Retrieval Augmented Generation) • AWS Cloud • Strong architectural and hands on GenAI expertise • Experience with enterprise automation and testing ...

Working under the guidance of senior engineers, the role assists with developing and maintaining enterprise AI models, retrieval-augmented generation (RAG) systems, and agentic workflows. The AI ...

Additionally, experience in building Retrieval-Augmented Generation (RAG) pipelines for search and chat applications is highly desired. Key Responsibilities: * Develop and optimize NLP models for ...

next page

Showing results 1-20

Summer Retrieval Augmented Generation information

What are the key skills and qualifications needed to thrive as a Retrieval Augmented Generation (RAG) Engineer, and why are they important?

To thrive as a Retrieval Augmented Generation (RAG) Engineer, you need a strong background in machine learning, natural language processing, and information retrieval, typically supported by a degree in computer science or a related field. Proficiency with frameworks like PyTorch or TensorFlow, experience with vector databases (e.g., FAISS, Pinecone), and familiarity with LLM APIs are commonly required. Creative problem-solving, strong communication, and the ability to collaborate across multidisciplinary teams are essential soft skills. These competencies ensure effective development, deployment, and optimization of advanced AI systems that integrate retrieval and generative capabilities.

What is a Summer Retrieval Augmented Generation role?

A Summer Retrieval Augmented Generation (RAG) role typically refers to a summer position focused on developing or improving retrieval-augmented generation systems, which are AI models that combine information retrieval with generative capabilities. In this role, you might work on integrating search algorithms with large language models, enabling systems to fetch relevant information from external sources and generate accurate, context-aware responses. These positions are often found in research labs, tech companies, or startups working on advanced AI applications, and are ideal for students or early-career professionals interested in machine learning, natural language processing, and AI research.

What are some common challenges faced when working on Retrieval-Augmented Generation (RAG) projects during a summer internship?

During a summer internship focused on Retrieval-Augmented Generation (RAG), interns often encounter challenges such as integrating retrieval systems with generative models, managing large-scale datasets, and optimizing latency for real-time responses. Collaboration with cross-functional teams—including data engineers, research scientists, and product managers—is essential for aligning project goals and troubleshooting implementation issues. Additionally, interns may need to balance exploratory research with delivering usable prototypes within tight timeframes, which helps develop both technical and project management skills.
What cities are hiring for Summer Retrieval Augmented Generation jobs? Cities with the most Summer Retrieval Augmented Generation job openings:
What are the most commonly searched types of Retrieval Augmented Generation jobs? The most popular types of Retrieval Augmented Generation jobs are:
What states have the most Summer Retrieval Augmented Generation jobs? States with the most job openings for Summer Retrieval Augmented Generation jobs include:

Gen AI Lead - RAG Specialist

Symhas

Pleasanton, CA • On-site

Other

Posted 9 days ago

Be an early applicant


Job description

Gen AI Lead – RAG (Retrieval-Augmented Generation) Specialist

We are looking for a highly skilled Gen AI Lead specializing in Retrieval-Augmented Generation (RAG) to join our AI team in Pleasanton, CA. You will own the architecture and delivery of enterprise-scale RAG systems that power intelligent search, Q&A, and knowledge management products.

Key Responsibilities
  • Design, build, and optimize end-to-end RAG pipelines for enterprise use cases
  • Lead vector store strategy, embedding model selection, and retrieval optimization
  • Implement advanced RAG patterns: hybrid search, re-ranking, contextual compression, and agentic RAG
  • Evaluate and improve answer quality, hallucination reduction, and retrieval precision
  • Guide a team of engineers in building scalable, production-grade knowledge systems
  • Partner with data engineering to ensure high-quality document ingestion pipelines
Requirements
  • 10+ years of overall AI/ML or software engineering experience
  • 3+ years building and deploying RAG systems in production
  • Expertise with vector databases (Pinecone, Weaviate, pgvector, Qdrant, etc.)
  • Strong experience with embedding models, chunking strategies, and retrieval optimization
  • Proficiency in Python, LLM APIs, and document processing pipelines
  • US Green Card or Citizenship required
  • Must be willing to work onsite in Pleasanton, CA