1

Retrieval Augmented Generation Jobs (NOW HIRING)

Additionally, experience in building Retrieval-Augmented Generation (RAG) pipelines for search and chat applications is highly desired. Key Responsibilities: * Develop and optimize NLP models for ...

Responsibilities : โ€ข Lead onboarding of business applications onto the enterprise AI platform. โ€ข Design and implement Retrieval-Augmented Generation (RAG) architectures. โ€ข Act as the primary ...

Python AI Dev

Jersey City, NJ ยท On-site

$55 - $75.75/hr

... Retrieval-Augmented Generation) implementations โ€ข LangChain, LlamaIndex, OpenAI, Azure OpenAI, Anthropic, or similar frameworks โ€ข API development using FastAPI, Flask, or Django โ€ข Strong SQL ...

New

Senior AI Engineer - Privacy

Bellevue, WA ยท On-site

$70 - $75/hr

This role involves applying large language models, retrieval-augmented generation, multi-agent orchestration, and foundation model capabilities to automate and enhance privacy operations. Requirement ...

The ideal candidate will have a strong background in investment banking, hands-on experience with Microsoft Azure OpenAI, and expertise in Retrieval-Augmented Generation (RAG). Key Responsibilities:

Integrate with large language models (LLMs) and generative AI (GenAI) using prompt engineering, fine-tuning, and retrieval-augmented generation (RAG) techniques. * Implement MCP client and server ...

AI Engineer

Dallas, TX ยท On-site

Develop and maintain Retrieval-Augmented Generation (RAG) architectures using vector databases and semantic search technologies * Create, test, and refine prompts, structured outputs, and evaluation ...

AI Engineer

Dallas, TX ยท On-site

Develop and maintain Retrieval-Augmented Generation (RAG) architectures using vector databases and semantic search technologies * Create, test, and refine prompts, structured outputs, and evaluation ...

Java Fullstack Developer

Pittsburgh, PA ยท On-site

$45 - $50/hr

Knowledge of LLMs, AI agents, and Retrieval-Augmented Generation (RAG) frameworks. Responsibilities: * Design, develop, and maintain scalable microservices using Core Java and Spring Boot. * Build ...

next page

Showing results 1-20

Retrieval Augmented Generation information

What are the typical daily responsibilities of a Retrieval Augmented Generation engineer?

A Retrieval Augmented Generation engineer typically spends their day designing and implementing systems that combine information retrieval with advanced generative models, such as large language models. This includes fine-tuning models, integrating external data sources, developing vector search pipelines, and evaluating output quality. Collaboration with data scientists, machine learning engineers, and product teams is common to ensure the solutions meet user requirements and scale effectively. Additionally, RAG engineers often troubleshoot issues, monitor model performance in production, and stay informed about the latest advancements in AI and information retrieval.

What is a Retrieval Augmented Generation job?

A Retrieval Augmented Generation (RAG) job typically involves developing and optimizing AI systems that enhance text generation by incorporating external knowledge retrieved from relevant sources. Professionals in this field work on integrating retrieval mechanisms with large language models to improve the relevance, accuracy, and factual grounding of generated content. Common responsibilities include designing retrieval systems, fine-tuning language models, optimizing performance, and ensuring the seamless integration of factual data into AI-generated text. This role is highly interdisciplinary, involving expertise in natural language processing (NLP), machine learning, and information retrieval.

What are the key skills and qualifications needed to thrive in the Retrieval Augmented Generation position, and why are they important?

To thrive in a Retrieval Augmented Generation (RAG) engineering role, you need a solid background in machine learning, natural language processing (NLP), and experience with scalable information retrieval systems, typically supported by a relevant degree in computer science or a related field. Familiarity with tools such as Python, PyTorch or TensorFlow, vector databases, and search platforms like Elasticsearch is essential, along with practical experience deploying and tuning RAG pipelines. Strong problem-solving skills, a collaborative mindset, and effective communication abilities set outstanding professionals apart in this field. These competencies are crucial for designing, implementing, and optimizing hybrid retrieval-generation AI systems that address complex, real-world information needs.

More about Retrieval Augmented Generation jobs
What cities are hiring for Retrieval Augmented Generation jobs? Cities with the most Retrieval Augmented Generation job openings:
What are the most commonly searched types of Retrieval Augmented Generation jobs? The most popular types of Retrieval Augmented Generation jobs are:
What states have the most Retrieval Augmented Generation jobs? States with the most job openings for Retrieval Augmented Generation jobs include:
What job categories do people searching Retrieval Augmented Generation jobs look for? The top searched job categories for Retrieval Augmented Generation jobs are:
Infographic showing various Retrieval Augmented Generation job openings in the United States as of June 2026, with employment types broken down into 75% Full Time, and 25% Contract. Highlights an 75% In-person, and 25% Remote job distribution.
Python Software Developer (Python AWS, RAG) / on W2

Python Software Developer (Python AWS, RAG) / on W2

Noblesoft Solutions Inc.

Saint Petersburg, FL โ€ข Hybrid

$47.50 - $65.50/hr

Other

Posted 22 days ago


Job description

Position:ย Python Software Developer (Python AWS, RAG)
Contract Duration: Long Term
Location:ย St. Petersburg, FL (Hybrid, 3 days onsite)
Only Locals candidates

Job Description
We are seeking a talented and passionate Python Developer to join our team and contribute to the development and implementation of innovative Retrieval-Augmented Generation (RAG) systems. You will play a crucial role in designing, building, and maintaining applications that leverage the power of large language models (LLMs) and advanced retrieval techniques to create cutting-edge AI solutions.

Skills:

Strong Python programming skills:
Demonstrated proficiency in Python programming, including experience with relevant libraries and frameworks (e.g., FastAPI, Flask, Pandas, NumPy).
Experience with LLMs and RAG systems:
Familiarity with large language models and retrieval-augmented generation techniques, including experience with LLM APIs and retrieval systems.

Experience with data retrieval and indexing:
Experience with data retrieval from various sources (e.g., databases, APIs, file systems) and building and managing retrieval indices.

Knowledge of data structures and algorithms:
Understanding of fundamental data structures and algorithms relevant to building efficient and scalable RAG applications.

Experience with cloud computing platforms (e.g., AWS, Google Cloud Platform, Azure):
Familiarity with cloud computing platforms and their services for deploying and scaling RAG applications.

Strong problem-solving and analytical skills:
Ability to identify, analyze, and solve complex problems related to data retrieval, LLM integration, and RAG system optimization.

Bonus Points:
Experience with specific LLM frameworks (e.g., LangChain, Hugging Face Transformers).
Familiarity with search engines and information retrieval techniques.
Experience with machine learning and deep learning concepts.
Experience with building and deploying production-ready applications.
Contribution to open-source projects related to RAG or LLMs.

Education:
Bachelor''''''''s degree in Computer Science, Software Engineering, or related technical field required
Master''''''''s degree in Computer Science, AI/ML, or related field preferred
Relevant professional certifications in Python, cloud platforms, or AI/ML technologies are a plus

ย