Sr AWS Gen AI Engineer
$95K - $131K/yr
Building retrieval-augmented generation systems with embedding pipelines and semantic search * System design: Architecting scalable, secure, cost-efficient cloud-native data and AI systems
$95K - $131K/yr
Building retrieval-augmented generation systems with embedding pipelines and semantic search * System design: Architecting scalable, secure, cost-efficient cloud-native data and AI systems
$95K - $131K/yr
Building retrieval-augmented generation systems with embedding pipelines and semantic search * System design: Architecting scalable, secure, cost-efficient cloud-native data and AI systems
Design end-to-end retrieval-augmented generation (RAG) systems leveraging enterprise knowledge bases, policy documents, SOPs, and historical claims data. * Build autonomous and semi-autonomous agents ...
Design end-to-end retrieval-augmented generation (RAG) systems leveraging enterprise knowledge bases, policy documents, SOPs, and historical claims data. * Build autonomous and semi-autonomous agents ...
Design end-to-end retrieval-augmented generation (RAG) systems leveraging enterprise knowledge bases, policy documents, SOPs, and historical claims data. * Build autonomous and semi-autonomous agents ...
Design end-to-end retrieval-augmented generation (RAG) systems leveraging enterprise knowledge bases, policy documents, SOPs, and historical claims data. * Build autonomous and semi-autonomous agents ...
Dearborn, MI · On-site
$112K - $148K/yr
Evaluate trade-offs across architectural choices (e.g., single- versus multi-agent designs, retrieval-augmented generation versus fine-tuning, deterministic versus probabilistic pipelines)
Dearborn, MI · On-site
$112K - $148K/yr
Evaluate trade-offs across architectural choices (e.g., single- versus multi-agent designs, retrieval-augmented generation versus fine-tuning, deterministic versus probabilistic pipelines)
... and retrieval-augmented generation. * Engineering integrations between data platforms, governance, risk, and compliance workflows, and enterprise systems using application programming interfaces ...
... and retrieval-augmented generation. * Engineering integrations between data platforms, governance, risk, and compliance workflows, and enterprise systems using application programming interfaces ...
Own end-to-end retrieval-augmented generation (RAG) implementations (ingestion, chunking, embedding, indexing, retrieval, orchestration); define prompt engineering standards and evaluation harnesses ...
Own end-to-end retrieval-augmented generation (RAG) implementations (ingestion, chunking, embedding, indexing, retrieval, orchestration); define prompt engineering standards and evaluation harnesses ...
Dewitt, MI · Hybrid
Creates reference architectures, defines security requirements and patterns for model training, inference, retrieval-augmented generation (RAG), agent orchestration, tool calling, and multi-model ...
Dewitt, MI · Hybrid
Creates reference architectures, defines security requirements and patterns for model training, inference, retrieval-augmented generation (RAG), agent orchestration, tool calling, and multi-model ...
Dearborn, MI · On-site
$112K - $148K/yr
Google ADK, LangChain/LangGraph, OpenAI and Gemini APIs, prompt engineering, retrieval augmented generation (RAG) pipelines Data and Cloud Infrastructure: Google Cloud Platform (BigQuery, Vertex AI ...
Dearborn, MI · On-site
$112K - $148K/yr
Google ADK, LangChain/LangGraph, OpenAI and Gemini APIs, prompt engineering, retrieval augmented generation (RAG) pipelines Data and Cloud Infrastructure: Google Cloud Platform (BigQuery, Vertex AI ...
Dewitt, MI · Hybrid
Creates reference architectures, defines security requirements and patterns for model training, inference, retrieval-augmented generation (RAG), agent orchestration, tool calling, and multi-model ...
Dewitt, MI · Hybrid
Creates reference architectures, defines security requirements and patterns for model training, inference, retrieval-augmented generation (RAG), agent orchestration, tool calling, and multi-model ...
Own end-to-end retrieval-augmented generation (RAG) implementations (ingestion, chunking, embedding, indexing, retrieval, orchestration); define prompt engineering standards and evaluation harnesses ...
Own end-to-end retrieval-augmented generation (RAG) implementations (ingestion, chunking, embedding, indexing, retrieval, orchestration); define prompt engineering standards and evaluation harnesses ...
Dewitt, MI · Hybrid
Creates reference architectures, defines security requirements and patterns for model training, inference, retrieval-augmented generation (RAG), agent orchestration, tool calling, and multi-model ...
Dewitt, MI · Hybrid
Creates reference architectures, defines security requirements and patterns for model training, inference, retrieval-augmented generation (RAG), agent orchestration, tool calling, and multi-model ...
$105K - $126K/yr
Strong understanding of Generative AI principles and architectures, including Large Language Models (LLMs) and Retrieval-Augmented Generation (RAG) systems. * Proven experience in building and ...
Quick apply
$105K - $126K/yr
Strong understanding of Generative AI principles and architectures, including Large Language Models (LLMs) and Retrieval-Augmented Generation (RAG) systems. * Proven experience in building and ...
Dearborn, MI · On-site
$96K - $131K/yr
Strong understanding of Generative AI principles and architectures, including Large Language Models (LLMs) and Retrieval-Augmented Generation (RAG) systems. * Proven experience in building and ...
Quick apply
Dearborn, MI · On-site
$96K - $131K/yr
Strong understanding of Generative AI principles and architectures, including Large Language Models (LLMs) and Retrieval-Augmented Generation (RAG) systems. * Proven experience in building and ...
... retrieval-augmented generation (RAG), and parameter-efficient fine-tuning (PEFT/LoRA) to develop and evaluate algorithms that improve product/system performance, quality, data management, and ...
... retrieval-augmented generation (RAG), and parameter-efficient fine-tuning (PEFT/LoRA) to develop and evaluate algorithms that improve product/system performance, quality, data management, and ...
Detroit, MI · On-site +1
$113K - $136K/yr
Implement retrieval-augmented generation patterns, including document ingestion, chunking, embeddings, vector or hybrid search, and retrieval and evaluation telemetry * Deliver governed datasets and ...
Detroit, MI · On-site +1
$113K - $136K/yr
Implement retrieval-augmented generation patterns, including document ingestion, chunking, embeddings, vector or hybrid search, and retrieval and evaluation telemetry * Deliver governed datasets and ...
Auburn Hills, MI · On-site
$60.25 - $77.50/hr
We are seeking an experienced AI/ML Architect to design and develop conversational AI solutions using Retrieval-Augmented Generation (RAG) techniques. The ideal candidate will have a strong ...
Quick apply
Auburn Hills, MI · On-site
$60.25 - $77.50/hr
We are seeking an experienced AI/ML Architect to design and develop conversational AI solutions using Retrieval-Augmented Generation (RAG) techniques. The ideal candidate will have a strong ...
Warren, MI · On-site
Leverage LLMs and related AI services (e.g., retrieval-augmented generation, embeddings, vector search) to power agent capabilities. * Integrate agents with enterprise systems, APIs, and data sources ...
Warren, MI · On-site
Leverage LLMs and related AI services (e.g., retrieval-augmented generation, embeddings, vector search) to power agent capabilities. * Integrate agents with enterprise systems, APIs, and data sources ...
Warren, MI · On-site +1
Leverage LLMs and related AI services (e.g., retrieval-augmented generation, embeddings, vector search) to power agent capabilities. * Integrate agents with enterprise systems, APIs, and data sources ...
Warren, MI · On-site +1
Leverage LLMs and related AI services (e.g., retrieval-augmented generation, embeddings, vector search) to power agent capabilities. * Integrate agents with enterprise systems, APIs, and data sources ...
Detroit, MI · On-site
Architect and deliver integrated AI solutions, including agentic workflows, retrieval-augmented generation pipelines, and enterprise platform integrations * Define and enforce governance, security ...
Detroit, MI · On-site
Architect and deliver integrated AI solutions, including agentic workflows, retrieval-augmented generation pipelines, and enterprise platform integrations * Define and enforce governance, security ...
Warren, MI · On-site +1
Leverage LLMs and related AI services (e.g., retrieval-augmented generation, embeddings, vector search) to power agent capabilities. * Integrate agents with enterprise systems, APIs, and data sources ...
Warren, MI · On-site +1
Leverage LLMs and related AI services (e.g., retrieval-augmented generation, embeddings, vector search) to power agent capabilities. * Integrate agents with enterprise systems, APIs, and data sources ...
A Retrieval Augmented Generation engineer typically spends their day designing and implementing systems that combine information retrieval with advanced generative models, such as large language models. This includes fine-tuning models, integrating external data sources, developing vector search pipelines, and evaluating output quality. Collaboration with data scientists, machine learning engineers, and product teams is common to ensure the solutions meet user requirements and scale effectively. Additionally, RAG engineers often troubleshoot issues, monitor model performance in production, and stay informed about the latest advancements in AI and information retrieval.
A Retrieval Augmented Generation (RAG) job typically involves developing and optimizing AI systems that enhance text generation by incorporating external knowledge retrieved from relevant sources. Professionals in this field work on integrating retrieval mechanisms with large language models to improve the relevance, accuracy, and factual grounding of generated content. Common responsibilities include designing retrieval systems, fine-tuning language models, optimizing performance, and ensuring the seamless integration of factual data into AI-generated text. This role is highly interdisciplinary, involving expertise in natural language processing (NLP), machine learning, and information retrieval.
To thrive in a Retrieval Augmented Generation (RAG) engineering role, you need a solid background in machine learning, natural language processing (NLP), and experience with scalable information retrieval systems, typically supported by a relevant degree in computer science or a related field. Familiarity with tools such as Python, PyTorch or TensorFlow, vector databases, and search platforms like Elasticsearch is essential, along with practical experience deploying and tuning RAG pipelines. Strong problem-solving skills, a collaborative mindset, and effective communication abilities set outstanding professionals apart in this field. These competencies are crucial for designing, implementing, and optimizing hybrid retrieval-generation AI systems that address complex, real-world information needs.
$95K - $131K/yr
Other
Posted 16 days ago
Job Title: Sr AWS Gen AI Engineer
Longterm Contract
Location: Detroit, MI
Skills:
Sourced by ZipRecruiter
1,001 - 5,000 Employees
Northville, MI, US
2004