1

Retrieval Augmented Generation Rag Jobs in Berkeley, CA

Own production retrieval-augmented generation (RAG) pipelines and retrieval infrastructure including vector databases, embeddings, and indexing for domain-specific search at scale. * Implement multi ...

Own production retrieval-augmented generation (RAG) pipelines and retrieval infrastructure including vector databases, embeddings, and indexing for domain-specific search at scale. * Implement multi ...

Senior Data Architect

San Francisco, CA · On-site +1

$79.25 - $106/hr

Design vector-based data architectures and Retrieval Augmented Generation (RAG) patterns to enable LLM reporting and Agentic AI. * Standardization: Establish scalable data management frameworks and a ...

AI Engineer (Mid-Level)

San Francisco, CA · On-site

$180K - $400K/yr

Own production retrieval-augmented generation (RAG) pipelines and retrieval infrastructure, including vector databases, embeddings, and indexing for domain-specific search at scale. * Implement multi ...

Retrieval-augmented generation (RAG) pipelines and vector-based semantic search systems Representation learning and semantic embeddings for clustering, categorization, and content understanding Model ...

Experience with Reinforcement Learning (RLHF/RLCEF), Retrieval-Augmented Generation (RAG), or multi-agent AI systems Why Join Resolve AI? * Make a Real Impact : Join a mission-driven team tackling ...

Collaborate with other engineers on prompt engineering, model evaluation, and retrieval-augmented generation (RAG) pipelines that ground the agent in accurate, up-to-date knowledge. About you: * 7+ ...

Senior Manager, Data Science

San Bruno, CA

$80.25 - $107.50/hr

Lead design of GenAI platform components, including: o Orchestration and lifecycle management for LLMs and multimodal models o Retrieval-augmented generation (RAG) pipelines o Vector search and ...

Senior Machine Learning Engineer

San Francisco, CA · On-site

$123K - $169K/yr

Design and deploy advanced Large Language Model (LLM)-powered applications using modern Retrieval-Augmented Generation (RAG) and external AI APIs. * Develop robust content understanding and ...

Data Engineer

San Francisco, CA · On-site

$134K - $162K/yr

... Retrieval-Augmented Generation (RAG) systems integrating structured and unstructured enterprise data • Design and implement agentic workflows using frameworks such as LangChain, LangGraph ...

AI Engineer

San Mateo, CA · On-site

$160K - $180K/yr

Hands-on experience with agent frameworks (e.g., LangChain) and retrieval-augmented generation (RAG), including real-world implementation and operational use. * Experience building APIs and data ...

... models o Retrieval-augmented generation (RAG) pipelines o Vector search and embedding pipelines o Model serving, autoscaling, and high-availability inference systems • Architect end-to-end ...

next page

Showing results 1-20

Retrieval Augmented Generation Rag information

See Berkeley, CA salary details

$18

$24

$32

How much do retrieval augmented generation rag jobs pay per hour?

As of Jul 2, 2026, the average hourly pay for retrieval augmented generation rag in Berkeley, CA is $24.79, according to ZipRecruiter salary data. Most workers in this role earn between $21.20 and $25.91 per hour, depending on experience, location, and employer.
What are popular job titles related to Retrieval Augmented Generation Rag jobs in Berkeley, CA? For Retrieval Augmented Generation Rag jobs in Berkeley, CA, the most frequently searched job titles are:
What job categories do people searching Retrieval Augmented Generation Rag jobs in Berkeley, CA look for? The top searched job categories for Retrieval Augmented Generation Rag jobs in Berkeley, CA are:
What cities near Berkeley, CA are hiring for Retrieval Augmented Generation Rag jobs? Cities near Berkeley, CA with the most Retrieval Augmented Generation Rag job openings:

AI Engineer (Mid-Level)

Clera

San Francisco, CA • On-site

Full-time

Posted 28 days ago


Job description

About the Role
This is a mid-level AI Engineer role on the core product team, focused on building agentic systems that automate complex, multi-step workflows across regulated and enterprise domains. You'll work across the full stack to ship production LLM-based services, ensure reliability and safety, and collaborate with leadership, product, and design to deliver measurable user impact.
What You'll Do
  • Design, build, and maintain agentic systems that automate complex, multi-step workflows across healthcare, legal, fintech, logistics, and compliance domains.
  • Own production retrieval-augmented generation (RAG) pipelines and retrieval infrastructure including vector databases, embeddings, and indexing for domain-specific search at scale.
  • Implement multi-agent orchestration, tool-calling, memory, and reasoning components to deliver robust AI-driven user experiences.
  • Develop evaluation and safety infrastructure to measure model performance, surface regressions, and enforce enterprise-level trust and reliability.
  • Ship full-stack AI products from MVP to enterprise-grade by designing APIs and data models, implementing frontend and backend code, and operating production systems with CI/CD, monitoring, and testing.
  • Collaborate with leadership, product, and design to prioritize work, define success metrics, and iterate based on user feedback and telemetry.
What We're Looking For
  • 2-8 years of software engineering experience with demonstrated delivery of shipped user-facing or backend products.
  • Practical experience deploying LLMs or LLM-based services in production, including prompt design, orchestration, and tool integration.
  • Proficiency across the stack: Python plus TypeScript/React (or equivalent), and experience with cloud platforms (AWS or GCP) and relational or NoSQL databases.
  • Working knowledge of RAG patterns, vector databases, embeddings, and retrieval pipelines, with sound judgment to choose appropriate approaches.
  • Experience building automated tests, evaluations, and monitoring for AI systems to ensure reliability beyond demos.
  • Experience with agent or workflow frameworks and orchestration tools.
  • Familiarity with fine-tuning, parameter-efficient tuning, or multi-modal model integration.
  • Background building multi-tenant or enterprise-ready systems, or experience in regulated industries such as healthcare, fintech, or legal.
  • Experience designing API-driven, high-throughput systems and real-time product features.
  • Proven ownership delivering end-to-end features from data model to deploy and monitoring, with a user-centric and pragmatic engineering mindset.
Location
Remote or San Francisco, CA, United States.