1

Vector Databases Jobs in Memphis, TN (NOW HIRING)

Evaluate and integrate LLMs, embedding models, and vector databases to support efficient and accurate retrieval and generation. Design and implement scaffolding and orchestration around LLMs ...

Junior AI Developer

Memphis, TN · On-site +1

$60K - $78K/yr

Evaluate and integrate LLMs, embedding models, and vector databases to support efficient and accurate retrieval and generation. Design and implement scaffolding and orchestration around LLMs ...

Technical Program Manager

Memphis, TN · On-site

$125K - $162K/yr

... vector databases, and LLM-based retrieval systems is highly desirable About Us Since opening our first store in 1979, AutoZone has grown into a leading retailer and distributor of automotive parts ...

Systems Engineer - Cloud Ops

Memphis, TN · On-site

$54.25 - $72.50/hr

Build and maintain infrastructure for Retrieval-Augmented Generation (RAG) pipelines and vector databases * Configure GPU-enabled node pools and optimize resource allocation for AI/ML workloads

Technical Program Manager

Memphis, TN

$125K - $162K/yr

... vector databases, and LLM-based retrieval systems is highly desirable Program Leadership Lead end-to-end execution of search platform initiatives from concept through production Drive alignment ...

AI Sr. Engineer LLMOps & MLOps

Memphis, TN · On-site

$101K - $139K/yr

Design and execute the infrastructure for Retrieval-Augmented Generation (RAG), including vector database management (OpenSearch, Pinecone, or Azure AI Search) and semantic index optimization. Legacy ...

Design and execute the infrastructure for Retrieval-Augmented Generation (RAG), including vector database management (OpenSearch, Pinecone, or Azure AI Search) and semantic index optimization.

Vector Databases information

What are vector databases?

Vector databases are specialized databases designed to store, manage, and search high-dimensional vector data, which is commonly generated from machine learning models, such as embeddings from natural language processing or image recognition. They enable efficient similarity search operations, such as finding the most similar items to a given query vector, which is essential for applications like recommendation systems, semantic search, and AI-powered search engines. Unlike traditional databases that handle structured or unstructured data, vector databases are optimized for fast and scalable similarity searches on large datasets of vectors.

What are some common challenges faced when working with vector databases, and how can they be addressed?

Professionals working with vector databases often encounter challenges such as efficiently scaling to handle large datasets, ensuring low-latency similarity searches, and integrating the database with machine learning pipelines. To address these, teams typically implement distributed architectures, fine-tune indexing strategies, and collaborate closely with data engineers and machine learning specialists. Staying updated with the latest developments in vector database technologies and maintaining clear communication with cross-functional teams are also key to overcoming these challenges.

What is the difference between Vector Databases vs Data Engineers?

AspectVector DatabasesData Engineers
Required SkillsDatabase management, data modeling, query optimizationData pipeline development, ETL processes, programming
Work EnvironmentData storage systems, AI/ML projects, cloud platformsData infrastructure, cloud environments, big data tools
Industry UsageAI, machine learning, recommendation systemsData integration, analytics, data architecture

While Vector Databases focus on storing and querying high-dimensional vector data for AI applications, Data Engineers build and maintain data pipelines and infrastructure to support data analysis and machine learning workflows. Both roles are essential in data-driven industries but serve different functions within the data ecosystem.

What are the key skills and qualifications needed to thrive as a Vector Database Engineer, and why are they important?

Success as a Vector Database Engineer requires a strong background in computer science, database management, and experience with machine learning or AI-driven data systems. Familiarity with vector database platforms (such as Pinecone, Milvus, or Weaviate), cloud infrastructure, and proficiency in languages like Python are typically expected. Strong problem-solving skills, effective communication, and the ability to work cross-functionally help engineers stand out. These competencies are vital to efficiently design, deploy, and maintain scalable vector search solutions that power modern AI applications.
What are popular job titles related to Vector Databases jobs in Memphis, TN? For Vector Databases jobs in Memphis, TN, the most frequently searched job titles are:

AI Developer

CTI

Memphis, TN • On-site, Remote

Full-time

Posted 4 days ago


Job description

PURPOSE OF POSITION Responsible for model integration, data pipelines, retrieval infrastructure, and the engineering scaffolding required to ship reliable, secure, and cost-effective Artificial Intelligence (AI) features. This role ensures the delivery of production-grade Large Language Model (LLM) systems that meet real-world demands for performance, cost-efficiency, and governance. MINIMUM QUALIFICATIONS Education: Master's degree preferred.

Bachelor's in Computer Science, Data Science, AI, or related field with equivalent experience considered, or related field or equivalent practical experience. Training and Experience: 3-7 years in backend development, AI systems, or related roles, with a focus on LLMs integration or retrieval systems. General Skills: Must have strong software engineering fundamentals and a deep understanding of working with LLMs in production environments.

The ideal candidate brings hands-on experience with Python and modern data tooling and is comfortable building robust pipelines that connect unstructured content, structured data, and retrieval systems to power context-aware LLM workflows. You should demonstrate fluency in the design and reasoning of data movement processes, including ingestion, preprocessing, vector indexing, and query generation. Experience working with both open-weight and API-based large language models is also essential.

This role requires a practical mindset, a strong command of SQL and retrieval strategies over relational data, and the ability to experiment, evaluate, and iterate toward scalable, cost-effective, and trustworthy AI features. Required Skills: Mastery in Python, including experience with modern practices in structuring, testing, and maintaining codebases. Orchestrated Retrieval-Augmented Generation (RAG) systems, including document chunking, embedding, vector search, and grounded context construction.

Expertise with PostgreSQL and pgvector, including schema design and structured retrieval over relational data. Robust operational understanding with SQL query generation, particularly in the context of semantic or hybrid retrieval. Comprehensive background integrating and orchestrating LLMs, with a focus on prompt templating, tool usage, and response parsing.

Familiarity with Google ADK or equivalent frameworks for LLM scaffolding and orchestration. Proficient in utilizing unstructured and structured data, including ingestion from PDFs, DOCX, Markdown, HTML, and APIs. Experience deploying and debugging LLM systems, including containerization (Docker), API-based LLM integration (e.g., Ollama or vLLM), and environment configuration

Preferred Skills Background with graph-enhanced retrieval, using tools like Neo4j or ArangoDB, and an understanding of when and how to apply knowledge graphs to improve LLM grounding. Versed in model adaptation techniques, including LoRA, QLoRA, or PEFT approaches for fine-tuning or personalization. Expert in designing and implementing advanced prompt optimization frameworks, including developing automated evaluation systems and troubleshooting complex failure modes to enhance AI model performance and reliability.

Proven ability to design end-to-end hybrid search and reranking pipelines, such as ColBERT, BGE rerankers, or commercial tools like Cohere Rerank. Expertise with infrastructure optimizations, such as autoscaling (KEDA, HPA), Redis caching layers, or efficient streaming and batching. Demonstrated skill in safe deployment practices, including prompt injection mitigation and handling of sensitive or regulated data.

Clearance: Must be able to obtain/maintain a Secret clearance. Prefer holds an active Secret clearance. DUTIES & RESPONSIBILITIES Design and implement end-to-end RAG architectures, including document ingestion, chunking, embedding generation, vector indexing, query planning, retrieval, and response synthesis.

Evaluate and integrate LLMs, embedding models, and vector databases to support efficient and accurate retrieval and generation. Design and implement scaffolding and orchestration around LLMs, including prompt templating, tool invocation, evaluation harnesses, and safety guards. Develop data processing pipelines for structured and unstructured content (PDF, DOCX, HTML, Markdown, databases, APIs); implement normalization, deduplication, PII redaction, and metadata enrichment.

Implement and optimize retrieval strategies and context construction (citation, source attribution, grounding). Adapt retrieval and embedding strategies to domain-specific taxonomies, ontologies, or structured schemas; support contextual retrieval from hierarchical or relational sources. Productionize LLM-based systems: containerize components (Docker), deploy orchestration via Kubernetes or serverless platforms, implement observability (OpenTelemetry, logging, tracing), and manage configuration.

Measure and improve quality: define offline and online evals, golden datasets, A/B tests, hallucination detection, toxicity filters, and guardrails. Optimize performance and cost: batching, caching, streaming, and efficient context management. Implement security, privacy, and compliance best practices including access controls, injection defense, and safe data handling.

Develop solutions that can run entirely on-premise or in air-gapped environments, prioritizing data sovereignty and privacy. Various other duties in direct support of accomplishment of primary duties listed. SUPERVISORY/MANAGEMENT RESPONSIBILITY None.