AI Developer
Memphis, TN · On-site +1
Evaluate and integrate LLMs, embedding models, and vector databases to support efficient and accurate retrieval and generation. Design and implement scaffolding and orchestration around LLMs ...
Memphis, TN · On-site +1
Evaluate and integrate LLMs, embedding models, and vector databases to support efficient and accurate retrieval and generation. Design and implement scaffolding and orchestration around LLMs ...
Memphis, TN · On-site +1
Evaluate and integrate LLMs, embedding models, and vector databases to support efficient and accurate retrieval and generation. Design and implement scaffolding and orchestration around LLMs ...
Evaluate and integrate LLMs, embedding models, and vector databases to support efficient and accurate retrieval and generation. * Design and implement scaffolding and orchestration around LLMs ...
Evaluate and integrate LLMs, embedding models, and vector databases to support efficient and accurate retrieval and generation. * Design and implement scaffolding and orchestration around LLMs ...
Memphis, TN · On-site +1
$60K - $78K/yr
Evaluate and integrate LLMs, embedding models, and vector databases to support efficient and accurate retrieval and generation. Design and implement scaffolding and orchestration around LLMs ...
Memphis, TN · On-site +1
$60K - $78K/yr
Evaluate and integrate LLMs, embedding models, and vector databases to support efficient and accurate retrieval and generation. Design and implement scaffolding and orchestration around LLMs ...
$59K - $77K/yr
Evaluate and integrate LLMs, embedding models, and vector databases to support efficient and accurate retrieval and generation. * Design and implement scaffolding and orchestration around LLMs ...
$59K - $77K/yr
Evaluate and integrate LLMs, embedding models, and vector databases to support efficient and accurate retrieval and generation. * Design and implement scaffolding and orchestration around LLMs ...
Familiar with RAG (Retrieval-Augmented Generation) pipelines, vector databases, or orchestration frameworks. * Capable of working with GraphQL, microservices, or event-driven systems in fast-paced ...
Familiar with RAG (Retrieval-Augmented Generation) pipelines, vector databases, or orchestration frameworks. * Capable of working with GraphQL, microservices, or event-driven systems in fast-paced ...
Memphis, TN · On-site
$125K - $162K/yr
... vector databases, and LLM-based retrieval systems is highly desirable About Us Since opening our first store in 1979, AutoZone has grown into a leading retailer and distributor of automotive parts ...
Memphis, TN · On-site
$125K - $162K/yr
... vector databases, and LLM-based retrieval systems is highly desirable About Us Since opening our first store in 1979, AutoZone has grown into a leading retailer and distributor of automotive parts ...
Memphis, TN · On-site
$54.25 - $72.50/hr
Build and maintain infrastructure for Retrieval-Augmented Generation (RAG) pipelines and vector databases * Configure GPU-enabled node pools and optimize resource allocation for AI/ML workloads
Memphis, TN · On-site
$54.25 - $72.50/hr
Build and maintain infrastructure for Retrieval-Augmented Generation (RAG) pipelines and vector databases * Configure GPU-enabled node pools and optimize resource allocation for AI/ML workloads
$125K - $162K/yr
... vector databases, and LLM-based retrieval systems is highly desirable Program Leadership Lead end-to-end execution of search platform initiatives from concept through production Drive alignment ...
$125K - $162K/yr
... vector databases, and LLM-based retrieval systems is highly desirable Program Leadership Lead end-to-end execution of search platform initiatives from concept through production Drive alignment ...
Memphis, TN · On-site
$101K - $139K/yr
Design and execute the infrastructure for Retrieval-Augmented Generation (RAG), including vector database management (OpenSearch, Pinecone, or Azure AI Search) and semantic index optimization. Legacy ...
Quick apply
Memphis, TN · On-site
$101K - $139K/yr
Design and execute the infrastructure for Retrieval-Augmented Generation (RAG), including vector database management (OpenSearch, Pinecone, or Azure AI Search) and semantic index optimization. Legacy ...
$93K - $128K/yr
Design and execute the infrastructure for Retrieval-Augmented Generation (RAG), including vector database management (OpenSearch, Pinecone, or Azure AI Search) and semantic index optimization. • ...
$93K - $128K/yr
Design and execute the infrastructure for Retrieval-Augmented Generation (RAG), including vector database management (OpenSearch, Pinecone, or Azure AI Search) and semantic index optimization. • ...
$93K - $128K/yr
Design and execute the infrastructure for Retrieval-Augmented Generation (RAG), including vector database management (OpenSearch, Pinecone, or Azure AI Search) and semantic index optimization.
$93K - $128K/yr
Design and execute the infrastructure for Retrieval-Augmented Generation (RAG), including vector database management (OpenSearch, Pinecone, or Azure AI Search) and semantic index optimization.
| Aspect | Vector Databases | Data Engineers |
|---|---|---|
| Required Skills | Database management, data modeling, query optimization | Data pipeline development, ETL processes, programming |
| Work Environment | Data storage systems, AI/ML projects, cloud platforms | Data infrastructure, cloud environments, big data tools |
| Industry Usage | AI, machine learning, recommendation systems | Data integration, analytics, data architecture |
While Vector Databases focus on storing and querying high-dimensional vector data for AI applications, Data Engineers build and maintain data pipelines and infrastructure to support data analysis and machine learning workflows. Both roles are essential in data-driven industries but serve different functions within the data ecosystem.
Full-time
Posted 4 days ago
PURPOSE OF POSITION Responsible for model integration, data pipelines, retrieval infrastructure, and the engineering scaffolding required to ship reliable, secure, and cost-effective Artificial Intelligence (AI) features. This role ensures the delivery of production-grade Large Language Model (LLM) systems that meet real-world demands for performance, cost-efficiency, and governance. MINIMUM QUALIFICATIONS Education: Master's degree preferred.
Bachelor's in Computer Science, Data Science, AI, or related field with equivalent experience considered, or related field or equivalent practical experience. Training and Experience: 3-7 years in backend development, AI systems, or related roles, with a focus on LLMs integration or retrieval systems. General Skills: Must have strong software engineering fundamentals and a deep understanding of working with LLMs in production environments.
The ideal candidate brings hands-on experience with Python and modern data tooling and is comfortable building robust pipelines that connect unstructured content, structured data, and retrieval systems to power context-aware LLM workflows. You should demonstrate fluency in the design and reasoning of data movement processes, including ingestion, preprocessing, vector indexing, and query generation. Experience working with both open-weight and API-based large language models is also essential.
This role requires a practical mindset, a strong command of SQL and retrieval strategies over relational data, and the ability to experiment, evaluate, and iterate toward scalable, cost-effective, and trustworthy AI features. Required Skills: Mastery in Python, including experience with modern practices in structuring, testing, and maintaining codebases. Orchestrated Retrieval-Augmented Generation (RAG) systems, including document chunking, embedding, vector search, and grounded context construction.
Expertise with PostgreSQL and pgvector, including schema design and structured retrieval over relational data. Robust operational understanding with SQL query generation, particularly in the context of semantic or hybrid retrieval. Comprehensive background integrating and orchestrating LLMs, with a focus on prompt templating, tool usage, and response parsing.
Familiarity with Google ADK or equivalent frameworks for LLM scaffolding and orchestration. Proficient in utilizing unstructured and structured data, including ingestion from PDFs, DOCX, Markdown, HTML, and APIs. Experience deploying and debugging LLM systems, including containerization (Docker), API-based LLM integration (e.g., Ollama or vLLM), and environment configuration
Preferred Skills Background with graph-enhanced retrieval, using tools like Neo4j or ArangoDB, and an understanding of when and how to apply knowledge graphs to improve LLM grounding. Versed in model adaptation techniques, including LoRA, QLoRA, or PEFT approaches for fine-tuning or personalization. Expert in designing and implementing advanced prompt optimization frameworks, including developing automated evaluation systems and troubleshooting complex failure modes to enhance AI model performance and reliability.
Proven ability to design end-to-end hybrid search and reranking pipelines, such as ColBERT, BGE rerankers, or commercial tools like Cohere Rerank. Expertise with infrastructure optimizations, such as autoscaling (KEDA, HPA), Redis caching layers, or efficient streaming and batching. Demonstrated skill in safe deployment practices, including prompt injection mitigation and handling of sensitive or regulated data.
Clearance: Must be able to obtain/maintain a Secret clearance. Prefer holds an active Secret clearance. DUTIES & RESPONSIBILITIES Design and implement end-to-end RAG architectures, including document ingestion, chunking, embedding generation, vector indexing, query planning, retrieval, and response synthesis.
Evaluate and integrate LLMs, embedding models, and vector databases to support efficient and accurate retrieval and generation. Design and implement scaffolding and orchestration around LLMs, including prompt templating, tool invocation, evaluation harnesses, and safety guards. Develop data processing pipelines for structured and unstructured content (PDF, DOCX, HTML, Markdown, databases, APIs); implement normalization, deduplication, PII redaction, and metadata enrichment.
Implement and optimize retrieval strategies and context construction (citation, source attribution, grounding). Adapt retrieval and embedding strategies to domain-specific taxonomies, ontologies, or structured schemas; support contextual retrieval from hierarchical or relational sources. Productionize LLM-based systems: containerize components (Docker), deploy orchestration via Kubernetes or serverless platforms, implement observability (OpenTelemetry, logging, tracing), and manage configuration.
Measure and improve quality: define offline and online evals, golden datasets, A/B tests, hallucination detection, toxicity filters, and guardrails. Optimize performance and cost: batching, caching, streaming, and efficient context management. Implement security, privacy, and compliance best practices including access controls, injection defense, and safe data handling.
Develop solutions that can run entirely on-premise or in air-gapped environments, prioritizing data sovereignty and privacy. Various other duties in direct support of accomplishment of primary duties listed. SUPERVISORY/MANAGEMENT RESPONSIBILITY None.