RAG (Retrieval-Augmented Generation) * Prompt engineering * Vector databases (design/usage/integration) * Model build + deployment * GenAI model build: training, fine-tuning, validation * Model ...
RAG (Retrieval-Augmented Generation) * Prompt engineering * Vector databases (design/usage/integration) * Model build + deployment * GenAI model build: training, fine-tuning, validation * Model ...
Senior Data Architect
Salt Lake City, UT · On-site
$65 - $87/hr
... as Retrieval-Augmented Generation (RAG), machine learning workflows, vector data, and modern AI architectures • Understanding of how data architecture enables Large Language Models (LLMs), AI ...
Senior Data Architect
Salt Lake City, UT · On-site
$65 - $87/hr
... as Retrieval-Augmented Generation (RAG), machine learning workflows, vector data, and modern AI architectures • Understanding of how data architecture enables Large Language Models (LLMs), AI ...
You'll transform legacy documentation into an AI-optimized, Retrieval-Augmented Generation (RAG) environment that delivers fast, accurate, and trustworthy answers for customers and service ...
You'll transform legacy documentation into an AI-optimized, Retrieval-Augmented Generation (RAG) environment that delivers fast, accurate, and trustworthy answers for customers and service ...
Familiarity with retrieval augmented generation (RAG) pipelines and how search quality feeds downstream LLM applications. * Background in information retrieval, natural language processing
Familiarity with retrieval augmented generation (RAG) pipelines and how search quality feeds downstream LLM applications. * Background in information retrieval, natural language processing
Implement retrieval-augmented generation pipelines using enterprise data sources * Build and orchestrate agent-based workflows to automate targeted tasks Model Integration and System Behavior
Implement retrieval-augmented generation pipelines using enterprise data sources * Build and orchestrate agent-based workflows to automate targeted tasks Model Integration and System Behavior
Sr. AI Engineer
Salt Lake City, UT · On-site
$101K - $138K/yr
Retrieval-Augmented Generation (RAG) * Multi-agent orchestration * Enterprise integrations (SAP, Salesforce, Databricks, SharePoint, Azure Ecosystem) Guide use-case prioritization and ...
Sr. AI Engineer
Salt Lake City, UT · On-site
$101K - $138K/yr
Retrieval-Augmented Generation (RAG) * Multi-agent orchestration * Enterprise integrations (SAP, Salesforce, Databricks, SharePoint, Azure Ecosystem) Guide use-case prioritization and ...
Sr. AI Engineer
Salt Lake City, UT · On-site
$53.50 - $69/hr
Retrieval-Augmented Generation (RAG) * Multi-agent orchestration * Enterprise integrations (SAP, Salesforce, Databricks, SharePoint, Azure Ecosystem) Guide use-case prioritization and platform ...
Sr. AI Engineer
Salt Lake City, UT · On-site
$53.50 - $69/hr
Retrieval-Augmented Generation (RAG) * Multi-agent orchestration * Enterprise integrations (SAP, Salesforce, Databricks, SharePoint, Azure Ecosystem) Guide use-case prioritization and platform ...
Implement retrieval-augmented generation pipelines using enterprise data sources * Build and orchestrate agent-based workflows to automate targeted tasks Model Integration and System Behavior
Quick apply
Implement retrieval-augmented generation pipelines using enterprise data sources * Build and orchestrate agent-based workflows to automate targeted tasks Model Integration and System Behavior
Sr. AI Engineer
Salt Lake City, UT · On-site
$101K - $138K/yr
Retrieval-Augmented Generation (RAG) * Multi-agent orchestration * Enterprise integrations (SAP, Salesforce, Databricks, SharePoint, Azure Ecosystem) Guide use-case prioritization and ...
Sr. AI Engineer
Salt Lake City, UT · On-site
$101K - $138K/yr
Retrieval-Augmented Generation (RAG) * Multi-agent orchestration * Enterprise integrations (SAP, Salesforce, Databricks, SharePoint, Azure Ecosystem) Guide use-case prioritization and ...
You'll transform legacy documentation into an AI-optimized, Retrieval-Augmented Generation (RAG) environment that delivers fast, accurate, and trustworthy answers for customers and service ...
You'll transform legacy documentation into an AI-optimized, Retrieval-Augmented Generation (RAG) environment that delivers fast, accurate, and trustworthy answers for customers and service ...
Senior Data Architect
Salt Lake City, UT · On-site
$90 - $100/hr
... as Retrieval-Augmented Generation (RAG), machine learning workflows, vector data, and modern AI architectures • Understanding of how data architecture enables Large Language Models (LLMs), AI ...
Senior Data Architect
Salt Lake City, UT · On-site
$90 - $100/hr
... as Retrieval-Augmented Generation (RAG), machine learning workflows, vector data, and modern AI architectures • Understanding of how data architecture enables Large Language Models (LLMs), AI ...
Senior ML Engineer
$98K - $134K/yr
Generate and manage high-quality vector embeddings for efficient retrieval-augmented generation (RAG) within a Vector Database. * Language Model (LM) Development & Fine-tuning: * Research, select ...
Senior ML Engineer
$98K - $134K/yr
Generate and manage high-quality vector embeddings for efficient retrieval-augmented generation (RAG) within a Vector Database. * Language Model (LM) Development & Fine-tuning: * Research, select ...
Senior ML Engineer
Lehi, UT · On-site
$98K - $134K/yr
Generate and manage high-quality vector embeddings for efficient retrieval-augmented generation (RAG) within a Vector Database. * Language Model (LM) Development & Fine-tuning: * Research, select ...
Senior ML Engineer
Lehi, UT · On-site
$98K - $134K/yr
Generate and manage high-quality vector embeddings for efficient retrieval-augmented generation (RAG) within a Vector Database. * Language Model (LM) Development & Fine-tuning: * Research, select ...
Senior ML Engineer
$98K - $134K/yr
Generate and manage high-quality vector embeddings for efficient retrieval-augmented generation (RAG) within a Vector Database. * Language Model (LM) Development & Fine-tuning: * Research, select ...
Senior ML Engineer
$98K - $134K/yr
Generate and manage high-quality vector embeddings for efficient retrieval-augmented generation (RAG) within a Vector Database. * Language Model (LM) Development & Fine-tuning: * Research, select ...
AI Engineer
Draper, UT · On-site
Develop and productionize LLM-based solutions, including prompt engineering, retrieval-augmented generation (RAG) pipelines, fine-tuning, and multimodal models * Build and orchestrate agentic AI ...
AI Engineer
Draper, UT · On-site
Develop and productionize LLM-based solutions, including prompt engineering, retrieval-augmented generation (RAG) pipelines, fine-tuning, and multimodal models * Build and orchestrate agentic AI ...
LLM-based application development, retrieval-augmented generation (RAG) architectures, agentic design patterns, prompt engineering, or vector databases • Demonstrated ability to learn new technical ...
LLM-based application development, retrieval-augmented generation (RAG) architectures, agentic design patterns, prompt engineering, or vector databases • Demonstrated ability to learn new technical ...
... retrieval-augmented generation pipelines using enterprise data sources • Build and orchestrate agent-based workflows to automate targeted tasks • Integrate LLM APIs such as Anthropic Claude and ...
... retrieval-augmented generation pipelines using enterprise data sources • Build and orchestrate agent-based workflows to automate targeted tasks • Integrate LLM APIs such as Anthropic Claude and ...
Software Engineer 2 - AI Platform
Salt Lake City, UT · On-site +1
Design solutions for context management, memory, and retrieval-augmented generation (RAG) to enhance agent effectiveness. Experience you'll bring: * Bachelor's degree in Computer Science or Software ...
Software Engineer 2 - AI Platform
Salt Lake City, UT · On-site +1
Design solutions for context management, memory, and retrieval-augmented generation (RAG) to enhance agent effectiveness. Experience you'll bring: * Bachelor's degree in Computer Science or Software ...
Familiarity with large language models beyond basic API usage - e.g., experience with prompt tuning, deploying LLMs, or implementing retrieval-augmented generation (RAG) pipelines with vector ...
Familiarity with large language models beyond basic API usage - e.g., experience with prompt tuning, deploying LLMs, or implementing retrieval-augmented generation (RAG) pipelines with vector ...
Senior Backend Engineer - AI Platform
Salt Lake City, UT · On-site +1
$118K - $156K/yr
Design solutions for context management, memory, and retrieval-augmented generation (RAG) to enhance agent effectiveness. Experience you'll bring: * Bachelor's degree in Computer Science or Software ...
Senior Backend Engineer - AI Platform
Salt Lake City, UT · On-site +1
$118K - $156K/yr
Design solutions for context management, memory, and retrieval-augmented generation (RAG) to enhance agent effectiveness. Experience you'll bring: * Bachelor's degree in Computer Science or Software ...
Retrieval Augmented Generation Rag information

Deloitte rating
8.0
Based on 89 frontline employees who took The Breakroom Quiz
71st of 146 rated financial services
Job description
Deloitte professionals help organizations navigate business risks and opportunities across financial, operational, information technology (IT), and regulatory areas. In this Manager role, you will lead teams delivering end-to-end (full stack) Generative AI (GenAI) solutions-including Retrieval-Augmented Generation (RAG) and agentic AI-from strategy and architecture through build, deployment, and adoption.
Recruiting for this role ends on May 31st, 2026
Work you'll do
- Lead client discovery, requirements, and solution shaping; translate needs into architecture, technical specifications, delivery plans, and acceptance criteria.
- Design, build, and implement custom AI/GenAI solutions tailored to business workflows and risk considerations.
- Architect and optimize agentic AI systems (e.g., tool-using agents, multi-step orchestration, multi-agent patterns) and integrate with enterprise platforms.
- Lead end-to-end RAG implementations including ingestion, preprocessing, chunking, embeddings, indexing, retrieval, orchestration, and evaluation.
- Drive GenAI model build activities (training, fine-tuning, validation), benchmarking, and continuous improvement of quality, safety, latency, and cost.
- Oversee model deployment and production operations (monitoring, observability, incident response, iteration).
- Lead development pods (planning, quality, delivery), including code/design reviews, mentoring, and engineering best practices.
- Collaborate with cross-functional stakeholders (product, data, security, risk/compliance) to deliver scalable, maintainable solutions.
- Evaluate emerging GenAI/agent frameworks and cloud services; prototype and recommend fit-for-purpose approaches.
The team
Our team culture is collaborative and encourages team members to take initiative and seek on-the-job learning opportunities. Audit & Assurance services are focused on engagements related to independent External Audit services, Accounting, Controls & Reporting Advisory, and Specialized Assurance & Sustainability. We bring together the diverse skills and industry experience of our people, leading-edge technology, and a global network to deliver high-quality audits of financial statements and internal controls over financial reporting, along with assurance reports and valuable advice and insights across the corporate reporting landscape. Learn more about Deloitte Audit & Assurance.
Qualifications
Required:
- Bachelor's degree (or equivalent) in Computer Science, Engineering, Data Science, or a related field.
- 6+ years of relevant experience in software engineering/full stack development and delivering AI/ML or GenAI-enabled solutions.
- Experience leading teams and delivering client-facing solutions with clear ownership for quality and timelines.
- Required technical skills (must have):
- GenAI / NLP / Agentic AI
- Python programming
- Natural Language Processing (NLP)
- Agentic AI, including LangChain, LangGraph, and LlamaIndex
- RAG (Retrieval-Augmented Generation)
- Prompt engineering
- Vector databases (design/usage/integration)
- Model build + deployment
- GenAI model build: training, fine-tuning, validation
- Model deployment (serving patterns, monitoring, iteration)
- Containers (e.g., Docker)
- Data engineering + APIs
- ETL (extract, transform, load) and data engineering (pipelines, quality, preprocessing)
- FastAPI (or equivalent) to build backend services
- API development and integration (RESTful services)
- Full stack engineering
- JavaScript/TypeScript
- HTML/CSS plus SASS/LESS
- UI/UX design principles
- Front-end frameworks: React, Angular, or Vue
- Cloud AI/ML services across Azure, Amazon Web Services (AWS), and Google Cloud Platform (GCP)
- Vertex AI experience
- You should reside within a commutable distance of your assigned office with the ability to commute daily, if required
- You can expect to co-locate on average 3 times a week with variations based on types of work/projects and client locations
- Ability to travel up to 50%, on average, based on the work you do and the clients/sectors you serve
- Limited immigration sponsorship may be available.
Preferred:
- Cloud certification (AWS, Azure, or GCP) and/or AI/ML certification.
- Experience with deep learning frameworks (e.g., PyTorch, TensorFlow, Keras).
- Familiarity with AI/GenAI ethics and governance frameworks and implementing controls in production.
The wage range for this role takes into account the wide range of factors that are considered in making compensation decisions including but not limited to skill sets; experience and training; licensure and certifications; and other business and organizational needs. The disclosed range estimate has not been adjusted for the applicable geographic differential associated with the location at which the position may be filled. At Deloitte, it is not typical for an individual to be hired at or near the top of the range for their role and compensation decisions are dependent on the facts and circumstances of each case. A reasonable estimate of the current range is $151,470 to $218,025.
You may also be eligible to participate in a discretionary annual incentive program, subject to the rules governing the program, whereby an award, if any, depends on various factors, including, without limitation, individual and organizational performance.
Qualifications:Deloitte professionals help organizations navigate business risks and opportunities across financial, operational, information technology (IT), and regulatory areas. In this Manager role, you will lead teams delivering end-to-end (full stack) Generative AI (GenAI) solutions-including Retrieval-Augmented Generation (RAG) and agentic AI-from strategy and architecture through build, deployment, and adoption.
Recruiting for this role ends on May 31st, 2026
Work you'll do
- Lead client discovery, requirements, and solution shaping; translate needs into architecture, technical specifications, delivery plans, and acceptance criteria.
- Design, build, and implement custom AI/GenAI solutions tailored to business workflows and risk considerations.
- Architect and optimize agentic AI systems (e.g., tool-using agents, multi-step orchestration, multi-agent patterns) and integrate with enterprise platforms.
- Lead end-to-end RAG implementations including ingestion, preprocessing, chunking, embeddings, indexing, retrieval, orchestration, and evaluation.
- Drive GenAI model build activities (training, fine-tuning, validation), benchmarking, and continuous improvement of quality, safety, latency, and cost.
- Oversee model deployment and production operations (monitoring, observability, incident response, iteration).
- Lead development pods (planning, quality, delivery), including code/design reviews, mentoring, and engineering best practices.
- Collaborate with cross-functional stakeholders (product, data, security, risk/compliance) to deliver scalable, maintainable solutions.
- Evaluate emerging GenAI/agent frameworks and cloud services; prototype and recommend fit-for-purpose approaches.
The team
Our team culture is collaborative and encourages team members to take initiative and seek on-the-job learning opportunities. Audit & Assurance services are focused on engagements related to independent External Audit services, Accounting, Controls & Reporting Advisory, and Specialized Assurance & Sustainability. We bring together the diverse skills and industry experience of our people, leading-edge technology, and a global network to deliver high-quality audits of financial statements and internal controls over financial reporting, along with assurance reports and valuable advice and insights across the corporate reporting landscape. Learn more about Deloitte Audit & Assurance.
Qualifications
Required:
- Bachelor's degree (or equivalent) in Computer Science, Engineering, Data Science, or a related field.
- 6+ years of relevant experience in software engineering/full stack development and delivering AI/ML or GenAI-enabled solutions.
- Experience leading teams and delivering client-facing solutions with clear ownership for quality and timelines.
- Required technical skills (must have):
- GenAI / NLP / Agentic AI
- Python programming
- Natural Language Processing (NLP)
- Agentic AI, including LangChain, LangGraph, and LlamaIndex
- RAG (Retrieval-Augmented Generation)
- Prompt engineering
- Vector databases (design/usage/integration)
- Model build + deployment
- GenAI model build: training, fine-tuning, validation
- Model deployment (serving patterns, monitoring, iteration)
- Containers (e.g., Docker)
- Data engineering + APIs
- ETL (extract, transform, load) and data engineering (pipelines, quality, preprocessing)
- FastAPI (or equivalent) to build backend services
- API development and integration (RESTful services)
- Full stack engineering
- JavaScript/TypeScript
- HTML/CSS plus SASS/LESS
- UI/UX design principles
- Front-end frameworks: React, Angular, or Vue
- Cloud AI/ML services across Azure, Amazon Web Services (AWS), and Google Cloud Platform (GCP)
- Vertex AI experience
- You should reside within a commutable distance of your assigned office with the ability to commute daily, if required
- You can expect to co-locate on average 3 times a week with variations based on types of work/projects and client locations
- Ability to travel up to 50%, on average, based on the work you do and the clients/sectors you serve
- Limited immigration sponsorship may be available.
Preferred:
- Cloud certification (AWS, Azure, or GCP) and/or AI/ML certification.
- Experience with deep learning frameworks (e.g., PyTorch, TensorFlow, Keras).
- Familiarity with AI/GenAI ethics and governance frameworks and implementing controls in production.
The wage range for this role takes into account the wide range of factors that are considered in making compensation decisions including but not limited to skill sets; experience and training; licensure and certifications; and other business and organizational needs. The disclosed range estimate has not been adjusted for the applicable geographic differential associated with the location at which the position may be filled. At Deloitte, it is not typical for an individual to be hired at or near the top of the range for their role and compensation decisions are dependent on the facts and circumstances of each case. A reasonable estimate of the current range is $151,470 to $218,025.
You may also be eligible to participate in a discretionary annual incentive program, subject to the rules governing the program, whereby an award, if any, depends on various factors, including, without limitation, individual and organizational performance.
Education:Bachelor's DegreeEmployment Type: