Design end-to-end AI solutions spanning Generative AI (RAG, CAG, GraphRAG, fine-tuning, model distillation) and agentic AI (tool-using agents, multi-agent orchestration, MCP-based integrations)
Design end-to-end AI solutions spanning Generative AI (RAG, CAG, GraphRAG, fine-tuning, model distillation) and agentic AI (tool-using agents, multi-agent orchestration, MCP-based integrations)
Artificial Intelligence Specialist
Colorado Springs, CO · On-site +1
$109K - $203K/yr
RAG patterns and agentic frameworks (LangGraph); Python web/API development (FastAPI, Flask, Django) Local AI model stacks (vLLM, LiteLLM, Ollama); reverse proxies (Caddy, Nginx, Traefik); vector ...
Artificial Intelligence Specialist
Colorado Springs, CO · On-site +1
$109K - $203K/yr
RAG patterns and agentic frameworks (LangGraph); Python web/API development (FastAPI, Flask, Django) Local AI model stacks (vLLM, LiteLLM, Ollama); reverse proxies (Caddy, Nginx, Traefik); vector ...
Principal AI Engineer (Software & AI Labs)
$133K - $178K/yr
Developing Agentic/RAG solutions for developer workflows and DevSecOps Processes with an objective ... AI-led Productivity & SDLC Acceleration * Lead technical evaluations and rollouts of AI tools for ...
Principal AI Engineer (Software & AI Labs)
$133K - $178K/yr
Developing Agentic/RAG solutions for developer workflows and DevSecOps Processes with an objective ... AI-led Productivity & SDLC Acceleration * Lead technical evaluations and rollouts of AI tools for ...
Artificial Intelligence Specialist
$109K - $203K/yr
RAG patterns and agentic frameworks (LangGraph); Python web/API development (FastAPI, Flask, Django) Local AI model stacks (vLLM, LiteLLM, Ollama); reverse proxies (Caddy, Nginx, Traefik); vector ...
Artificial Intelligence Specialist
$109K - $203K/yr
RAG patterns and agentic frameworks (LangGraph); Python web/API development (FastAPI, Flask, Django) Local AI model stacks (vLLM, LiteLLM, Ollama); reverse proxies (Caddy, Nginx, Traefik); vector ...
AI Developer/Engineer
Denver, CO · On-site
$111K - $145K/yr
Implement and maintain retrieval-augmented generation (RAG) solutions, including document ingestion, chunking, embeddings, vector search, and prompt orchestration. * Integrate AI applications with ...
AI Developer/Engineer
Denver, CO · On-site
$111K - $145K/yr
Implement and maintain retrieval-augmented generation (RAG) solutions, including document ingestion, chunking, embeddings, vector search, and prompt orchestration. * Integrate AI applications with ...
AI Developer/Engineer
Denver, CO · On-site +1
$111K - $145K/yr
Implement and maintain retrieval-augmented generation (RAG) solutions, including document ingestion, chunking, embeddings, vector search, and prompt orchestration. * Integrate AI applications with ...
AI Developer/Engineer
Denver, CO · On-site +1
$111K - $145K/yr
Implement and maintain retrieval-augmented generation (RAG) solutions, including document ingestion, chunking, embeddings, vector search, and prompt orchestration. * Integrate AI applications with ...
AI Engineering
Denver, CO · On-site
8+ years of management experience 4+ years of LLM experience (fine-tuning, RAG, prompt engineering, agentic) 8+ years of ML/Data Science Experience Someone who has been delivering AI/ML models into ...
Quick apply
AI Engineering
Denver, CO · On-site
8+ years of management experience 4+ years of LLM experience (fine-tuning, RAG, prompt engineering, agentic) 8+ years of ML/Data Science Experience Someone who has been delivering AI/ML models into ...
AI Data Engineer
Denver, CO · On-site
$117K - $141K/yr
AI Data Engineer: Location: Denver, CO Experience: 2-5 Years Job Summary We are seeking an AI Data ... Experience with OpenAI, LangChain, RAG architectures. * Knowledge of MLOps.
AI Data Engineer
Denver, CO · On-site
$117K - $141K/yr
AI Data Engineer: Location: Denver, CO Experience: 2-5 Years Job Summary We are seeking an AI Data ... Experience with OpenAI, LangChain, RAG architectures. * Knowledge of MLOps.
This role focuses on helping every engineer at Strive design, build, and ship AI-enabled software safely and effectively - from AI-assisted development workflows to agentic and RAG-based applications ...
This role focuses on helping every engineer at Strive design, build, and ship AI-enabled software safely and effectively - from AI-assisted development workflows to agentic and RAG-based applications ...
AI Engineer, Generative AI Agents
Denver, CO · On-site
$135K - $220K/yr
LLM serving and inference, RAG pipelines, evaluation harnesses and the APIs and infrastructure that ... Design and develop intelligent AI agents capable of intent recognition and decomposing complex ...
AI Engineer, Generative AI Agents
Denver, CO · On-site
$135K - $220K/yr
LLM serving and inference, RAG pipelines, evaluation harnesses and the APIs and infrastructure that ... Design and develop intelligent AI agents capable of intent recognition and decomposing complex ...
Senior AI/ML Engineer
Almont, CO · On-site
$90 - $100/hr
The role includes LLM orchestration, RAG pipelines, vector database integration, and multi-agent systems. The engineer will build predictive models, apply document AI with transformers, and integrate ...
Senior AI/ML Engineer
Almont, CO · On-site
$90 - $100/hr
The role includes LLM orchestration, RAG pipelines, vector database integration, and multi-agent systems. The engineer will build predictive models, apply document AI with transformers, and integrate ...
Your work will span classical machine learning, agentic AI systems, chatbots and custom copilots, RAG-based applications, and enterprise adoption strategies for our clients. You bring a strong ...
Your work will span classical machine learning, agentic AI systems, chatbots and custom copilots, RAG-based applications, and enterprise adoption strategies for our clients. You bring a strong ...
AI/ML Engineer
Colorado Springs, CO · Remote
$144K - $246K/yr
... architectures (RAG, graph, hybrid). * Write clean, efficient Python code for data ingestion ... Integrate AI services into real-world systems via APIs, event-driven workflows, or UI copilots.
AI/ML Engineer
Colorado Springs, CO · Remote
$144K - $246K/yr
... architectures (RAG, graph, hybrid). * Write clean, efficient Python code for data ingestion ... Integrate AI services into real-world systems via APIs, event-driven workflows, or UI copilots.
Senior AI DevOps Engineer
$96K - $137K/yr
Implement complex AI workflows using frameworks like LangChain and LlamaIndex to drive agentic ... Deliver high-quality RAG pipelines utilizing Milvus vector databases to improve data retrieval ...
Senior AI DevOps Engineer
$96K - $137K/yr
Implement complex AI workflows using frameworks like LangChain and LlamaIndex to drive agentic ... Deliver high-quality RAG pipelines utilizing Milvus vector databases to improve data retrieval ...
AI/ML Engineer
Colorado Springs, CO · On-site +1
$144K - $246K/yr
... architectures (RAG, graph, hybrid). * Write clean, efficient Python code for data ingestion ... Integrate AI services into real-world systems via APIs, event-driven workflows, or UI copilots.
AI/ML Engineer
Colorado Springs, CO · On-site +1
$144K - $246K/yr
... architectures (RAG, graph, hybrid). * Write clean, efficient Python code for data ingestion ... Integrate AI services into real-world systems via APIs, event-driven workflows, or UI copilots.
AI/ML Engineer
Colorado Springs, CO · Remote
$144K - $246K/yr
... architectures (RAG, graph, hybrid). * Write clean, efficient Python code for data ingestion ... Integrate AI services into real-world systems via APIs, event-driven workflows, or UI copilots.
AI/ML Engineer
Colorado Springs, CO · Remote
$144K - $246K/yr
... architectures (RAG, graph, hybrid). * Write clean, efficient Python code for data ingestion ... Integrate AI services into real-world systems via APIs, event-driven workflows, or UI copilots.
AI/ML Engineer
Colorado Springs, CO · Remote
$144K - $246K/yr
... architectures (RAG, graph, hybrid). * Write clean, efficient Python code for data ingestion ... Integrate AI services into real-world systems via APIs, event-driven workflows, or UI copilots.
AI/ML Engineer
Colorado Springs, CO · Remote
$144K - $246K/yr
... architectures (RAG, graph, hybrid). * Write clean, efficient Python code for data ingestion ... Integrate AI services into real-world systems via APIs, event-driven workflows, or UI copilots.
AWS Senior Architect with AI Ops Architecture (15 years exp)
Denver, CO · On-site
$66.75 - $87.50/hr
Hands-on experience with LangChain, LangFuse, Llama 3.2 LLM, and RAG-based architectures. * Agentic AI & MCP : Drive implementation of Agentic AI systems and Model Context Protocol (MCP), A2A for ...
AWS Senior Architect with AI Ops Architecture (15 years exp)
Denver, CO · On-site
$66.75 - $87.50/hr
Hands-on experience with LangChain, LangFuse, Llama 3.2 LLM, and RAG-based architectures. * Agentic AI & MCP : Drive implementation of Agentic AI systems and Model Context Protocol (MCP), A2A for ...
The AI Solution Architect is the builderandowns thedataarchitecture, the Python code, the ... Build andoptimizeRetrieval-Augmented Generation (RAG) pipelines end to end - including document ...
The AI Solution Architect is the builderandowns thedataarchitecture, the Python code, the ... Build andoptimizeRetrieval-Augmented Generation (RAG) pipelines end to end - including document ...
Sr. Staff AI/ML Engineer
Denver, CO · On-site
$245K - $272K/yr
AI is a fundamental part of how work gets done at Gusto. We expect all team members to actively ... This includes developing core platform capabilities - agent orchestration, RAG infrastructure, eval ...
Sr. Staff AI/ML Engineer
Denver, CO · On-site
$245K - $272K/yr
AI is a fundamental part of how work gets done at Gusto. We expect all team members to actively ... This includes developing core platform capabilities - agent orchestration, RAG infrastructure, eval ...
Ai Rag information
What are the key skills and qualifications needed to thrive as an AI Researcher, and why are they important?
What is the difference between Ai Rag vs Data Analyst?
| Aspect | Ai Rag | Data Analyst |
|---|---|---|
| Required Credentials | Typically a diploma or certification in AI, machine learning, or related fields | Bachelor's degree in statistics, mathematics, or related fields |
| Work Environment | Tech companies, AI startups, research labs | Business, finance, healthcare, and various industries |
| Employer & Industry Usage | Primarily in AI development and research | Across industries for data interpretation and decision-making |
| Common Search & Comparison | Yes | Yes |
Ai Rag and Data Analyst roles share overlapping skills in data handling and analysis, but Ai Rag focuses more on AI-specific applications and machine learning, while Data Analysts concentrate on interpreting data to inform business decisions. Both roles are vital in data-driven industries, with Ai Rag often working in AI development environments and Data Analysts supporting strategic insights across sectors.
What are AI RAGs?
What are some common challenges faced by AI RAG (Retrieval-Augmented Generation) engineers when integrating retrieval systems with large language models?

Other
Medical, Dental, Vision, Life, Retirement, PTO
Posted 23 days ago
Job description
Why you'll love Softchoice:
We are a software-focused IT solutions and services provider that equips organizations to be agile and innovative, and for their people to be engaged, connected, and creative at work. That means moving them to the cloud, helping them build the workplace of tomorrow, and enabling them to make smarter decisions about their technology. By doing these things we help them create success for their customers and their people.
We stand proudly for our people and support their success through career development and advancement. We are recognized and respected for our culture of inclusion and belonging, continuously striving to do what's good for our people and communities.Â
Â
The impact you'll have:
We are seeking a Senior Technical Solutions Architect - AI to serve as a hands-on, platform-agnostic technical architect for our strategic AI engagements. This person sits at the intersection of customer strategy, applied AI engineering, and modern software delivery. They translate ambiguous business problems into working prototypes, scalable reference architectures, and production-grade solutions across public-cloud hyperscaler AI platforms and sovereign (on-premise / private) AI environments.
The ideal candidate is equally comfortable whiteboarding an agentic architecture with a CIO, writing the proof-of-concept code that proves it works, and guiding a client engineering team through the secure path to production. They are vendor-fluent but vendor-neutral - recommending the right tool for the workload, the data, the risk profile, and the budget.
What you'll do
Solutioning & Architecture
- Design end-to-end AI solutions spanning Generative AI (RAG, CAG, GraphRAG, fine-tuning, model distillation) and agentic AI (tool-using agents, multi-agent orchestration, MCP-based integrations).
- Architect across all major hyperscaler AI stacks - AWS (Bedrock, SageMaker, Q), Microsoft Azure (Azure AI Foundry, Azure OpenAI), and Google Cloud (Vertex AI, Gemini) - and recommend the right platform per workload rather than defaulting to a single provider.
- Architect sovereign / on-premise AI solutions using stacks such as NVIDIA AI Enterprise (NIM, NeMo, Blueprints), Dell AI Factory, HPE Private Cloud AI, Red Hat OpenShift AI, Run:ai, and open-source model serving (vLLM, TGI, Ollama) - for clients with data residency, regulatory, IP, or air-gapped requirements.
- Develop reusable reference architectures, decision frameworks, and trade-off analyses (cost, latency, accuracy, governance, sovereignty) that scale across the practice.
Rapid Prototyping
- Build working prototypes - not just slides. Translate client problem statements into functional demos and pilots in days, not months.
- Stand up RAG, CAG, and agentic workflows quickly using frameworks such as LangChain / LangGraph, LlamaIndex, CrewAI, AutoGen, Semantic Kernel, and MCP-compliant agent toolchains.
- Integrate vector stores (Pinecone, Weaviate, Milvus, Chroma, pgvector, OpenSearch), graph stores (Neo4j, Neptune), and hybrid retrieval pipelines as the use case demands.
- Run rigorous, repeatable evals on prototypes (groundedness, faithfulness, latency, cost-per-task, tool-use accuracy) so recommendations are evidence-based.
AI-Native Engineering & Modernization
- Lead solutioning for AI-native software engineering engagements: AI-assisted development, code refactoring at scale, tech debt burndown, legacy modernization, test generation, and documentation regeneration.
- Architect Secure SDLC (SSDLC) practices into every AI-built or AI-assisted codebase - threat modeling, SAST/DAST integration, SBOM generation, dependency hygiene, secrets management, and supply-chain security.
- Advise clients on integrating AI coding agents (Claude Code, Cursor, GitHub Copilot Workspace, Devin, and others) into their existing SDLC and DevSecOps toolchains without compromising guardrails.
- Define MLOps / LLMOps / AgentOps patterns: model and prompt versioning, evaluation pipelines, observability (traces, token usage, drift), guardrails, and human-in-the-loop review.
AI Security
- Conduct AI-specific threat modeling for every solution - covering adversarial inputs, prompt injection, jailbreaking, model inversion, training data extraction, and indirect injection via tool outputs or retrieved documents - and translate findings into concrete mitigations in the architecture.
- Design multi-layer guardrail architectures: input sanitization and intent classification, output filtering (PII redaction, toxicity screening, factual grounding checks), content safety policies, and fallback / refusal handling - covering both hosted API models and self-hosted open-weight deployments.
- Enforce least-privilege access control for agentic systems: scope tool permissions, define agent authorization boundaries, audit and log all tool invocations, and ensure agents cannot escalate privileges or exfiltrate data outside approved boundaries.
- Maintain end-to-end AI supply chain security: vet third-party model weights and datasets for backdoors or poisoning, validate fine-tuned model integrity, enforce cryptographic signing of model artifacts, and apply model cards and datasheets as governance artifacts.
- Align AI solutions to applicable compliance frameworks - NIST AI RMF, OWASP LLM Top 10, ISO/IEC 42001, EU AI Act, and relevant sector-specific regulations - and produce the risk documentation, impact assessments, and audit trails clients need to satisfy internal governance and external regulators.
Client Engagement & Enablement
- Serve as the senior technical voice in client conversations - from executive briefings through deep technical design sessions.
- Partner with sales, delivery, and practice leadership to scope statements of work, estimate effort, and de-risk delivery.
- Mentor architects, engineers, and consultants across the broader AI practice; raise the technical bar through code reviews, internal enablement, and reusable assets.
- Stay ahead of the field - evaluate emerging models, frameworks, and protocols (e.g., MCP, A2A, ACP, new agent frameworks, new sovereign AI stacks) and bring well-reasoned points of view back to the practice.
What you'll bring to the table:
- 8+ years of progressive experience in software engineering, solutions / Enterprise architecture, or applied AI/ML, with at least 2+ years in a hands-on Generative AI or agentic AI role.
- Demonstrated ability to rapidly prototype AI solutions and ship working code - not just designs or documents.
- Deep, hands-on experience with at least one of the three major hyperscaler AI platforms (AWS, Azure, GCP) and a working understanding of the second and third.
- Production experience designing and shipping RAG and/or agentic systems, including practical familiarity with chunking strategies, embedding model selection, retrieval evaluation, and orchestration patterns.
- Working knowledge of MCP (Model Context Protocol) and modern agent-tool integration patterns; ability to design MCP servers and clients, and to reason about when MCP is the right abstraction versus alternatives.
- Strong understanding of CAG (Cache-Augmented Generation), RAG variants (naive, hybrid, GraphRAG, agentic RAG), and the trade-offs between each.
- Proficiency in Python; comfort in at least one additional language (TypeScript/JavaScript, Go, Java, or C#).
- Experience integrating with enterprise systems: REST/GraphQL APIs, event streams (Kafka, EventBridge), identity (OIDC, SAML, OAuth2), and enterprise data platforms (Snowflake, Databricks, Fabric, BigQuery).
- Excellent written and verbal communication; able to move fluidly between executive narrative and engineering whiteboard.
- Foundational fluency in AI security concepts: able to identify and articulate risks such as prompt injection, data poisoning, model extraction, and inference-time attacks, and to reason about appropriate mitigations for each in the context of a given architecture and risk tolerance.
Strongly Preferred
- Software development background with real production experience across the SDLC and Secure SDLC (SSDLC) - including CI/CD, infrastructure as code (Terraform, Pulumi, Bicep), containers and Kubernetes, and DevSecOps tooling.
- Experience leading code refactoring, technical debt remediation, and legacy modernization programs - ideally with AI-assisted approaches.
- Experience designing sovereign / on-premise AI deployments: NVIDIA NIM / NeMo, OpenShift AI, Run:ai, vLLM at scale, GPU capacity planning, and on-prem vector / graph stores.
- Background in security and governance: prompt injection defense, output filtering, data loss prevention, model risk management, NIST AI RMF, ISO/IEC 42001, and EU AI Act readiness; familiarity with the OWASP LLM Top 10, adversarial ML attack taxonomies (MITRE ATLAS), and red-teaming / evaluation techniques for LLMs; experience translating these frameworks into practical control designs rather than checkbox compliance.
- Experience fine-tuning, distilling, or post-training open-weight models (Llama, Mistral, Qwen, Gemma) for enterprise use cases.
- Industry experience in regulated verticals (financial services, healthcare, public sector, defense) where sovereignty and compliance are non-negotiable.
- Relevant certifications (AWS / Azure / GCP AI specialty, CKA/CKAD, CISSP, NVIDIA-certified) - useful, but capability is weighted more heavily than credentials.
Education
Bachelor's degree in Computer Science, Engineering, Mathematics, or a related technical field, or equivalent demonstrable experience. Advanced degree is welcomed but not required.
What Sets a Great Candidate Apart
- A pragmatic, opinionated point of view on when not to use GenAI or agents - and the judgment to steer clients toward the right answer even when it isn't the flashy one.
- Curiosity that runs ahead of the market: already experimenting with the next protocol, the next model, the next orchestration pattern before clients ask.
- Comfort with ambiguity - the ability to walk into a half-formed problem, frame it, prototype against it, and leave the client with a clearer path forward than they had that morning.
Location & Travel
Remote-friendly with periodic travel to client sites and internal events (estimated 0-15%).
Compensation:
Corporate/Pay Mix: A reasonable estimate of the current base pay range for this position is $124,320 to $155,400 annually + 30% Target Incentives
Actual salary will be based on a variety of factors, including location, experience, skill set, education, and related certification. The range for this position in other geographic locations may differ.
Softchoice offers a comprehensive and competitive benefit plan to all full-time employees, which includes:
- Health and Wellbeing: Medical, Dental, and Vision Care, Flexible Spending Account, Employee Assistance Program
- Financial Benefits: 401k Plan with Company Matching, Life and Disability Insurance
- Paid Time Off: PTO and Sick Leave (starting at 20 days per year), Holidays, Parental Leave, Volunteer Days, Bereavement Leave
- Additional Perks: Employee Discount Program
Not sure if you qualify? Think about applying anyway:
We understand that not everyone brings 100% of the skills and experience for the role.
At Softchoice, we offer opportunities to a diverse group including those with a variety of workplace experiences and backgrounds. Whether you are new to corporate tech, returning to work after a gap in employment, or looking to transition and take the next step in your career, we are excited to learn more about you and encourage you to apply.
Why You'll Love Working Here:
- The People: You'll thrive in our collaborative environment, surrounded by incredible colleagues who foster support and innovation, driving our collective success
- High-Performing Culture: At Softchoice, we are dedicated to achieving our goals and committed to success for our customers and each other
- Flexibility: Plan your workdays in a way that suits you best
- Award-Winning Workplace: Proudly recognized as a Great Place to Work for 20Â consecutive years
- Inclusive Culture: We are committed to an inclusive culture where every team member can be their authentic self
- Competitive Benefits: Benefit from competitive perks that start on day one
Inclusion & Equal opportunity employment:
We are an equal opportunity employer committed to diversity, inclusion & belonging. People seeking employment at Softchoice are considered without regard to any protected category including but not limited to, race, color, religion, national origin, age, sex, marital status, ancestry, disability, veteran status, gender identity, or sexual orientation.
Require accommodation? We are ready to help:
We are proud to provide interview & employment accommodation during the recruitment and hiring process. If you require any accommodation to apply or interview for a position, please reach out directly to asktalentacquisition@softchoice.com. We are committed to working with you to best meet your needs.
Our commitment to your experience:
We are committed to the safety of all applicants and team members. With that in mind, we ...
About Softchoice
Sourced by ZipRecruiter
Industry
It services
Company size
1,001 - 5,000 Employees
Headquarters location
Toronto, OH, US