1

Salaried Rag Jobs in Santa Rosa, CA (NOW HIRING)

Data Engineer (Founding Team)

Bodega Bay, CA · On-site

$135K - $163K/yr

Competitive salary + early-stage equity Backed by 8VC, we're building a world-class team to tackle ... Familiar with fine-tuning LLMs or enabling RAG pipelines using enterprise knowledge * Experience ...

Competitive base salary and equity package * Comprehensive medical, dental, and vision coverage ... RAG, Synthetic Data, Data Engineering, Data Pipelines, ETL, Data Processing, Web Crawling, Data ...

Competitive base salary and equity package * Comprehensive medical, dental, and vision coverage ... RAG, Synthetic Data, Data Engineering, Data Pipelines, ETL, Data Processing, Web Crawling, Data ...

DevOps Engineer (Founding Team)

Bodega Bay, CA · On-site

$62.50 - $85.75/hr

Competitive salary + meaningful equity (founding tier) Backed by 8VC, we're building a world-class ... Building retrieval-augmented generation (RAG) pipelines -- and deploying them safely and repeatably

Salaried Rag information

See Santa Rosa, CA salary details

$35K

$63.7K

$91.3K

How much do salaried rag jobs pay per year?

As of Jun 21, 2026, the average yearly pay for salaried rag in Santa Rosa, CA is $63,681.00, according to ZipRecruiter salary data. Most workers in this role earn between $53,600.00 and $71,100.00 per year, depending on experience, location, and employer.

What is the difference between Salaried Rag vs Salaried Technician?

AspectSalaried RagSalaried Technician
Required CredentialsHigh school diploma or equivalent, specialized trainingHigh school diploma, technical certification or associate degree
Work EnvironmentOffice or field-based, depending on industryIndustrial, manufacturing, or technical settings
Employer & Industry UsageMedia, printing, or creative industriesManufacturing, maintenance, or technical services
Common Search & ComparisonYesYes

The comparison shows that Salaried Rag and Salaried Technician share similar credential requirements and are used in related industries. Salaried Rag typically refers to roles in media or creative fields, while Salaried Technician is common in technical and industrial sectors. Both roles involve specialized skills and are salaried positions, but their work environments and industry applications differ.

What are the most commonly searched types of Rag jobs in Santa Rosa, CA? The most popular types of Rag jobs in Santa Rosa, CA are:
What job categories do people searching Salaried Rag jobs in Santa Rosa, CA look for? The top searched job categories for Salaried Rag jobs in Santa Rosa, CA are:
What cities near Santa Rosa, CA are hiring for Salaried Rag jobs? Cities near Santa Rosa, CA with the most Salaried Rag job openings:

ML/AI Research Engineer -- Agentic AI Lab (Founding Team)

Fabrion

Bodega Bay, CA • On-site

Full-time

Posted 14 days ago


Job description

ML/AI Research Engineer — Agentic AI Lab (Founding Team)

Location: San Francisco Bay Area
Type: Full-Time
Compensation: Competitive salary + meaningful equity (founding tier)

Backed by 8VC, we're building a world-class team to tackle one of the industry’s most critical infrastructure problems.

About the Role

We’re designing the future of enterprise AI infrastructure — grounded in agents, retrieval-augmented generation (RAG), knowledge graphs, and multi-tenant governance.

We’re looking for an ML/AI Research Engineer to join our AI Lab and lead the design, training, evaluation, and optimization of agent-native AI models. You'll work at the intersection of LLMs, vector search, graph reasoning, and reinforcement learning — building the intelligence layer that sits on top of our enterprise data fabric.

This isn’t a prompt engineer role. It’s full-cycle ML: from data curation and fine-tuning to evaluation, interpretability, and deployment — with cost-awareness, alignment, and agent coordination all in scope.

Core Responsibilities

  • Fine-tune and evaluate open-source LLMs (e.g. LLaMA 3, Mistral, Falcon, Mixtral) for enterprise use cases with both structured and unstructured data

  • Build and optimize RAG pipelines using LangChain, LangGraph, LlamaIndex, or Dust — integrated with our vector DBs and internal knowledge graph

  • Train agent architectures (ReAct, AutoGPT, BabyAGI, OpenAgents) using enterprise task data

  • Develop embedding-based memory and retrieval chains with token-efficient chunking strategies

  • Create reinforcement learning pipelines to optimize agent behaviors (e.g. RLHF, DPO, PPO)

  • Establish scalable evaluation harnesses for LLM and agent performance, including synthetic evals, trace capture, and explainability tools

  • Contribute to model observability, drift detection, error classification, and alignment

  • Optimize inference latency and GPU resource utilization across cloud and on-prem environments

Desired Experience

Model Training:

  • Deep experience fine-tuning open-source LLMs using HuggingFace Transformers, DeepSpeed, vLLM, FSDP, LoRA/QLoRA

  • Worked with both base and instruction-tuned models; familiar with SFT, RLHF, DPO pipelines

  • Comfortable building and maintaining custom training datasets, filters, and eval splits

  • Understand tradeoffs in batch size, token window, optimizer, precision (FP16, bfloat16), and quantization

RAG + Knowledge Graphs:

  • Experience building enterprise-grade RAG pipelines integrated with real-time or contextual data

  • Familiar with LangChain, LangGraph, LlamaIndex, and open-source vector DBs (Weaviate, Qdrant, FAISS)

  • Experience grounding models with structured data (SQL, graph, metadata) + unstructured sources

  • Bonus: Worked with Neo4j, Puppygraph, RDF, OWL, or other semantic modeling systems

Agent Intelligence:

  • Experience training or customizing agent frameworks with multi-step reasoning and memory

  • Understand common agent loop patterns (e.g. Plan→Act→Reflect), memory recall, and tools

  • Familiar with self-correction, multi-agent communication, and agent ops logging

Optimization:

  • Strong background in token cost optimization, chunking strategies, reranking (e.g. Cohere, Jina), compression, and retrieval latency tuning

  • Experience running models under quantized (int4/int8) or multi-GPU settings with inference tuning (vLLM, TGI)

Preferred Tech Stack

  • LLM Training & Inference: HuggingFace Transformers, DeepSpeed, vLLM, FlashAttention, FSDP, LoRA

  • Agent Orchestration: LangChain, LangGraph, ReAct, OpenAgents, LlamaIndex

  • Vector DBs: Weaviate, Qdrant, FAISS, Pinecone, Chroma

  • Graph Knowledge Systems: Neo4j, Puppygraph, RDF, Gremlin, JSON-LD

  • Storage & Access: Iceberg, DuckDB, Postgres, Parquet, Delta Lake

  • Evaluation: OpenLLM Evals, Trulens, Ragas, LangSmith, Weight & Biases

  • Compute: Ray, Kubernetes, TGI, Sagemaker, LambdaLabs, Modal

  • Languages: Python (core), optionally Rust (for inference layers) or JS (for UX experimentation)

Soft Skills & Mindset

  • Startup DNA: resourceful, fast-moving, and capable of working in ambiguity

  • Deep curiosity about agent-based architectures and real-world enterprise complexity

  • Comfortable owning model performance end-to-end: from dataset to deployment

  • Strong instincts around explainability, safety, and continuous improvement

  • Enjoy pair-designing with product and UX to shape capabilities, not just APIs

Why This Role Matters

This role is foundational to our thesis: that agents + enterprise data + knowledge modeling can create intelligent infrastructure for real-world, multi-billion-dollar workflows. Your work won’t be buried in research reports — it will be productionized and activated by hundreds of users and hundreds of thousands of decisions. If this is your dream role - we would love to hear from you.