1

Vector Databases Jobs in Allen, TX (NOW HIRING)

Production experience with vector databases and designing ingestion + embedding pipelines for both batch and streaming workloads. Hands-on with prompt design, evaluation, LLM orchestration, and RAG ...

Production experience with vector databases and designing ingestion + embedding pipelines for both batch and streaming workloads. * Hands-on with prompt design, evaluation, LLM orchestration, and RAG ...

Build and optimize Retrieval-Augmented Generation (RAG) pipelines using vector databases and embedding models. * Integrate and work with LLMs such as Anthropic Claude, Google Gemini, and other ...

Deep understanding of LLMs, embeddings, vector databases (e.g., FAISS, Pinecone, Weaviate). Experience with cloud platforms (AWS, Azure, GCP) and containerization (Docker, Kubernetes). Familiarity ...

Deep understanding of LLMs, embeddings, vector databases (e.g., FAISS, Pinecone, Weaviate). * Experience with cloud platforms (AWS, Azure, GCP) and containerization (Docker, Kubernetes)

Lead Gen AI Engineer

Plano, TX · On-site

$98K - $129K/yr

Deep understanding of LLMs, embeddings, vector databases (e.g., FAISS, Pinecone, Weaviate). Experience with cloud platforms (AWS, Azure, GCP) and containerization (Docker, Kubernetes). Familiarity ...

Sr. Data/GenAI Engineer

Irving, TX · On-site

$101K - $138K/yr

... vector database. o Serve as the subject matter expert on data sources, liaising with data owners to understand structures, access patterns, and semantics. o Ensure the data ingestion pipeline is ...

The ideal candidate will have a solid background in machine learning along with handson expertise in LLMs, RAG, embeddings, vector databases, and Agentic AI frameworks such as CrewAI, AutoGen ...

The ideal candidate will have a solid background in machine learning along with hands‐on expertise in LLMs, RAG, embeddings, vector databases, and Agentic AI frameworks such as CrewAI, AutoGen ...

Lead AI/ML Engineer

Plano, TX · On-site

$98K - $129K/yr

Vector Databases & Retrieval Pipelines: Implement andmaintainvector stores (OpenSearch, Pinecone, Milvus,Qdrant) and design efficient similarity search, retrieval workflows, and indexing strategies.

next page

Showing results 1-20

Vector Databases information

What are vector databases?

Vector databases are specialized databases designed to store, manage, and search high-dimensional vector data, which is commonly generated from machine learning models, such as embeddings from natural language processing or image recognition. They enable efficient similarity search operations, such as finding the most similar items to a given query vector, which is essential for applications like recommendation systems, semantic search, and AI-powered search engines. Unlike traditional databases that handle structured or unstructured data, vector databases are optimized for fast and scalable similarity searches on large datasets of vectors.

What are some common challenges faced when working with vector databases, and how can they be addressed?

Professionals working with vector databases often encounter challenges such as efficiently scaling to handle large datasets, ensuring low-latency similarity searches, and integrating the database with machine learning pipelines. To address these, teams typically implement distributed architectures, fine-tune indexing strategies, and collaborate closely with data engineers and machine learning specialists. Staying updated with the latest developments in vector database technologies and maintaining clear communication with cross-functional teams are also key to overcoming these challenges.

What is the difference between Vector Databases vs Data Engineers?

AspectVector DatabasesData Engineers
Required SkillsDatabase management, data modeling, query optimizationData pipeline development, ETL processes, programming
Work EnvironmentData storage systems, AI/ML projects, cloud platformsData infrastructure, cloud environments, big data tools
Industry UsageAI, machine learning, recommendation systemsData integration, analytics, data architecture

While Vector Databases focus on storing and querying high-dimensional vector data for AI applications, Data Engineers build and maintain data pipelines and infrastructure to support data analysis and machine learning workflows. Both roles are essential in data-driven industries but serve different functions within the data ecosystem.

What are the key skills and qualifications needed to thrive as a Vector Database Engineer, and why are they important?

Success as a Vector Database Engineer requires a strong background in computer science, database management, and experience with machine learning or AI-driven data systems. Familiarity with vector database platforms (such as Pinecone, Milvus, or Weaviate), cloud infrastructure, and proficiency in languages like Python are typically expected. Strong problem-solving skills, effective communication, and the ability to work cross-functionally help engineers stand out. These competencies are vital to efficiently design, deploy, and maintain scalable vector search solutions that power modern AI applications.
What are popular job titles related to Vector Databases jobs in Allen, TX? For Vector Databases jobs in Allen, TX, the most frequently searched job titles are:
What job categories do people searching Vector Databases jobs in Allen, TX look for? The top searched job categories for Vector Databases jobs in Allen, TX are:
What cities near Allen, TX are hiring for Vector Databases jobs? Cities near Allen, TX with the most Vector Databases job openings:
Data AI Engineer with Vector Databases

Data AI Engineer with Vector Databases

Apex Informatics

Plano, TX • On-site

$109K - $131K/yr

Other

Posted 3 days ago


Job description

Title: Data AI Engineer with Vector Databases
Location: Plano, TX
Job Description:
Key Responsibilities:
  1. Design and build ETL/ELT pipelines and data processing workflows
  2. Develop batch and real-time data pipelines using modern frameworks
  3. Work with Python and SQL for data transformation and analytics
  4. Implement GenAI data architectures, including RAG pipelines and vector indexing
  5. Manage and optimize Vector Databases for embedding storage and similarity search
  6. Build secure data solutions on AWS, ensuring data quality and compliance
  7. Support analytics, reporting, and data modernization initiatives

Required Skills:
  1. Strong experience in Python and SQL
  2. Hands-on experience with ETL/ELT and data pipelines
  3. Mandatory: Experience with Vector Databases
  4. Experience with GenAI / LLM frameworks (LangChain or LangGraph)
  5. Experience with Big Data frameworks (Apache Spark, Apache Kafka)
  6. Workflow orchestration using Apache Airflow
  7. Experience with data platforms like Databricks or Snowflake
  8. AWS services: S3, Glue, Redshift

Nice to Have:
  1. Experience with ML frameworks (Scikit-learn, PyTorch)
  2. Knowledge of RAG architectures and embedding pipelines
  3. Experience in financial services / fintech environments