1

Summer Retrieval Augmented Generation Jobs (NOW HIRING)

AI/ML Engineer

Burbank, CA · On-site

$111K - $153K/yr

Build and deploy RAG (Retrieval-Augmented Generation) pipelines * Integrate LLMs via APIs (Azure OpenAI preferred) into enterprise applications * Develop and orchestrate agentic AI workflows with ...

Design and implement enterprise Retrieval Augmented Generation (RAG) architectures for GenAI platforms and applications. . Build and optimize semantic retrieval pipelines, vector search ...

This role focuses on building scalable systems leveraging Large Language Models (LLMs), Retrieval-Augmented Generation (RAG), and agentic AI workflows . The ideal candidate will bring deep expertise ...

Retrieval-Augmented Generation (RAG) * Ensure performance, scalability, and reliability of AI systems Agent Development & Orchestration * Program and orchestrate multi-agent systems * Build custom ...

AI/ML Engineer

Minneapolis, MN · Remote

$106K - $131K/yr

Familiarity with transformers, LLMs, and Retrieval-Augmented Generation (RAG) pipelines using vector databases. 6. Automation Development: Creating AI-powered automation solutions, including Einstein ...

Senior Cloud Data & AI Architect

Dallas, TX · On-site

$66.75 - $89.50/hr

Implement retrieval-augmented generation (RAG) pipelines with memory, context management, and tool usage. * Define Model Context Protocol (MCPs) to chain reasoning, retrieval, and action models.

AI Lead

Chicago, IL · On-site

$144K - $177K/yr

The ideal candidate will bring deep expertise in Python, FastAPI, and Retrieval-Augmented Generation (RAG) solutions, with hands-on experience deploying scalable AI applications on Azure. This role ...

Software Engineer (Java + GenAI)

San Jose, CA · On-site

$60.75 - $83.25/hr

... Retrieval-Augmented Generation (RAG) - Vector databases - Prompt engineering - Large Language Models (LLMs) - Application: Send suitable profiles and contact details to rams@vensoft.com

Senior Machine Learning Engineer

OR · On-site +1

$205K - $270K/yr

Architect and scale LLM and retrieval-augmented generation pipelines that ground models in enterprise data. This track focuses on building high-performance ML systems that process complex data ...

next page

Showing results 1-20

Summer Retrieval Augmented Generation information

What are the key skills and qualifications needed to thrive as a Retrieval Augmented Generation (RAG) Engineer, and why are they important?

To thrive as a Retrieval Augmented Generation (RAG) Engineer, you need a strong background in machine learning, natural language processing, and information retrieval, typically supported by a degree in computer science or a related field. Proficiency with frameworks like PyTorch or TensorFlow, experience with vector databases (e.g., FAISS, Pinecone), and familiarity with LLM APIs are commonly required. Creative problem-solving, strong communication, and the ability to collaborate across multidisciplinary teams are essential soft skills. These competencies ensure effective development, deployment, and optimization of advanced AI systems that integrate retrieval and generative capabilities.

What is a Summer Retrieval Augmented Generation role?

A Summer Retrieval Augmented Generation (RAG) role typically refers to a summer position focused on developing or improving retrieval-augmented generation systems, which are AI models that combine information retrieval with generative capabilities. In this role, you might work on integrating search algorithms with large language models, enabling systems to fetch relevant information from external sources and generate accurate, context-aware responses. These positions are often found in research labs, tech companies, or startups working on advanced AI applications, and are ideal for students or early-career professionals interested in machine learning, natural language processing, and AI research.

What are some common challenges faced when working on Retrieval-Augmented Generation (RAG) projects during a summer internship?

During a summer internship focused on Retrieval-Augmented Generation (RAG), interns often encounter challenges such as integrating retrieval systems with generative models, managing large-scale datasets, and optimizing latency for real-time responses. Collaboration with cross-functional teams—including data engineers, research scientists, and product managers—is essential for aligning project goals and troubleshooting implementation issues. Additionally, interns may need to balance exploratory research with delivering usable prototypes within tight timeframes, which helps develop both technical and project management skills.
What cities are hiring for Summer Retrieval Augmented Generation jobs? Cities with the most Summer Retrieval Augmented Generation job openings:
What are the most commonly searched types of Retrieval Augmented Generation jobs? The most popular types of Retrieval Augmented Generation jobs are:
What states have the most Summer Retrieval Augmented Generation jobs? States with the most job openings for Summer Retrieval Augmented Generation jobs include:

AI/ML Engineer

Carter Support Services

Burbank, CA • On-site

$111K - $153K/yr

Full-time

Posted 13 days ago


Job description

Location: Burbank, CA (100% Onsite)
Job Type: Full-Time
Experience Level: Mid-Senior (10+ Years)
Industry: Information Technology & Services


Position Overview

Carter Support Services is seeking a highly experienced Senior AI/ML Engineer to design, develop, and deploy advanced AI solutions. This role focuses on building scalable systems leveraging Large Language Models (LLMs), Retrieval-Augmented Generation (RAG), and agentic AI workflows.

The ideal candidate will bring deep expertise in Python, cloud-based AI deployment (Azure), and modern NLP techniques, along with experience delivering production-grade AI systems.


Key Responsibilities
  • Design and implement AI/ML solutions using Python and modern ML frameworks
  • Develop and optimize prompt engineering strategies for LLM-based systems
  • Build and deploy RAG (Retrieval-Augmented Generation) pipelines
  • Integrate LLMs via APIs (Azure OpenAI preferred) into enterprise applications
  • Develop and orchestrate agentic AI workflows with tool/function calling
  • Implement vector search solutions using vector databases or MongoDB
  • Ensure CI/CD integration and cloud deployment (Azure preferred)
  • Establish observability, monitoring, and evaluation frameworks for AI systems
  • Collaborate cross-functionally to deliver production-ready AI features

Required Qualifications
  • Bachelor’s degree in Computer Science, Engineering, or related field
  • 10+ years of experience in software engineering or AI/ML roles
  • 7+ years of Python experience (expert-level proficiency required)
  • 7+ years of Microsoft Azure experience, including Azure Machine Learning
  • 7+ years of DevOps experience, including CI/CD pipelines
  • 7+ years of MongoDB or similar database experience
  • Strong experience with LLM integration and RAG architectures
  • Experience with prompt engineering and context optimization
  • Solid understanding of NLP and transformer-based models
  • Experience with vector databases and search systems
  • Familiarity with agentic AI workflows and tool/function calling

Preferred Qualifications
  • Experience with Azure OpenAI API
  • Experience building scalable, enterprise-grade AI applications
  • Background in AI system monitoring, evaluation, and optimization

Work Environment
  • 100% onsite role in Burbank, CA
  • Collaborative, fast-paced technical environment

Powered by JazzHR

JruqlLdXda