... vector databases (Pinecone, Weaviate, pgvector) and embedding-based retrieval for ML/RAG applications. • Array database experience (TileDB, Zarr) for genomics, imaging, or high-dimensional ...
... vector databases (Pinecone, Weaviate, pgvector) and embedding-based retrieval for ML/RAG applications. • Array database experience (TileDB, Zarr) for genomics, imaging, or high-dimensional ...
... vector databases (Pinecone, Weaviate, pgvector) and embedding-based retrieval for ML/RAG applications. • Array database experience (TileDB, Zarr) for genomics, imaging, or high-dimensional ...
... vector databases (Pinecone, Weaviate, pgvector) and embedding-based retrieval for ML/RAG applications. • Array database experience (TileDB, Zarr) for genomics, imaging, or high-dimensional ...
ML Engineer
Indianapolis, IN · On-site +1
Vector databases such as pgvector, Chroma, Pinecone, Weaviate, or Qdrant. * Docker and containerized deployments. * Kubernetes orchestration. * Azure AI infrastructure and GPU environments. * CI/CD ...
Quick apply
ML Engineer
Indianapolis, IN · On-site +1
Vector databases such as pgvector, Chroma, Pinecone, Weaviate, or Qdrant. * Docker and containerized deployments. * Kubernetes orchestration. * Azure AI infrastructure and GPU environments. * CI/CD ...
Senior Computer Vision Engineer
Newburgh, IN · On-site
$99K - $136K/yr
Experience with SQL databases, vector embeddings, and vector databases such as Milvus, pgvector, Pinecone, Chroma, etc. * Experience with multimodal models such as CLIP, GPT-4V, Llama * Experience ...
Senior Computer Vision Engineer
Newburgh, IN · On-site
$99K - $136K/yr
Experience with SQL databases, vector embeddings, and vector databases such as Milvus, pgvector, Pinecone, Chroma, etc. * Experience with multimodal models such as CLIP, GPT-4V, Llama * Experience ...
Senior Computer Vision Engineer
Newburgh, IN · On-site
$99K - $136K/yr
Experience with SQL databases, vector embeddings, and vector databases such as Milvus, pgvector, Pinecone, Chroma, etc. * Experience with multimodal models such as CLIP, GPT-4V, Llama * Experience ...
Senior Computer Vision Engineer
Newburgh, IN · On-site
$99K - $136K/yr
Experience with SQL databases, vector embeddings, and vector databases such as Milvus, pgvector, Pinecone, Chroma, etc. * Experience with multimodal models such as CLIP, GPT-4V, Llama * Experience ...
Senior Computer Vision Engineer
$99K - $136K/yr
Experience with SQL databases, vector embeddings, and vector databases such as Milvus, pgvector, Pinecone, Chroma, etc. * Experience with multimodal models such as CLIP, GPT-4V, Llama * Experience ...
Senior Computer Vision Engineer
$99K - $136K/yr
Experience with SQL databases, vector embeddings, and vector databases such as Milvus, pgvector, Pinecone, Chroma, etc. * Experience with multimodal models such as CLIP, GPT-4V, Llama * Experience ...
Senior Computer Vision Engineer
Newburgh, IN · On-site
$99K - $136K/yr
Experience with SQL databases, vector embeddings, and vector databases such as Milvus, pgvector, Pinecone, Chroma, etc. * Experience with multimodal models such as CLIP, GPT-4V, Llama * Experience ...
Quick apply
Senior Computer Vision Engineer
Newburgh, IN · On-site
$99K - $136K/yr
Experience with SQL databases, vector embeddings, and vector databases such as Milvus, pgvector, Pinecone, Chroma, etc. * Experience with multimodal models such as CLIP, GPT-4V, Llama * Experience ...
Lead Data Engineer (Data Engineer)
Bloomington, IN · On-site
$105K - $127K/yr
Familiarity with vector databases (e.g., Marqo, Milvus, Pinecone) and embedding models for semantic search and retrieval-augmented generation (RAG). * Proficiency in building and maintaining RESTful ...
Lead Data Engineer (Data Engineer)
Bloomington, IN · On-site
$105K - $127K/yr
Familiarity with vector databases (e.g., Marqo, Milvus, Pinecone) and embedding models for semantic search and retrieval-augmented generation (RAG). * Proficiency in building and maintaining RESTful ...
AI Engineer/ML Engineer - Senior Developers - AI Training - Indianapolis, US
Indianapolis, IN · On-site +1
$80/hr
Vector Databases: familiarity with Pinecone, Milvus, or Weaviate for RAG evaluation. Why Prolific is a great platform to join as a Participant Joining our Expert Network will give you the chance to ...
Quick apply
AI Engineer/ML Engineer - Senior Developers - AI Training - Indianapolis, US
Indianapolis, IN · On-site +1
$80/hr
Vector Databases: familiarity with Pinecone, Milvus, or Weaviate for RAG evaluation. Why Prolific is a great platform to join as a Participant Joining our Expert Network will give you the chance to ...
Pinecone Vector Databases information
What is a Pinecone Vector Database?
What are the key skills and qualifications needed to thrive as a Pinecone Vector Database Engineer, and why are they important?
What are some common challenges faced by engineers working with Pinecone Vector Databases, and how can they be addressed?
What is the difference between Pinecone Vector Databases vs Data Engineers?
| Aspect | Pinecone Vector Databases | Data Engineers |
|---|---|---|
| Primary Role | Managing and deploying vector database solutions for AI/ML applications | Designing, building, and maintaining data pipelines and infrastructure |
| Skills & Certifications | Knowledge of vector databases, cloud platforms, programming (Python, SQL) | Data modeling, ETL processes, cloud services, programming (Python, Java) |
| Work Environment | Tech companies, AI startups, cloud providers | Data-driven organizations, tech firms, finance, healthcare |
While Pinecone Vector Databases specialists focus on deploying and managing vector database solutions for AI applications, Data Engineers build and maintain the data infrastructure that supports these systems. Both roles require programming skills and familiarity with cloud platforms, but their core responsibilities differ: one centers on database management, the other on data pipeline development.
- Home Based Electrical Engineer Student
- Remote Manual Testing Insurance Domain
- Generative Ai Developer
- Manager Prompt Engineering
- Remote Protocol Engineer
- Entry Level Remote Python
- Datacenter Engineer Salary
- Freelance Graduate Electrical Engineer
- Work From Home Avaya Support Engineer
- Contract Electrical Project Manager
Full-time
Posted 11 days ago
Eli Lilly and Company rating
8.8
Based on 62 frontline employees who took The Breakroom Quiz
10th of 73 rated pharmaceutical
Job description
Eli Lilly and Company is a global healthcare leader headquartered in Indianapolis, Indiana, focused on making life better for people around the world. They are seeking Data Architects to design and build the data infrastructure necessary for AI-native drug discovery, transforming raw scientific data into actionable insights for both scientists and AI agents.
Responsibilities:
• Design and implement data models, schemas, and ontologies for chemical, biological, and automation-generated data that serve discovery workflows across the portfolio.
• Define and maintain controlled vocabularies, metadata standards, and FAIR-compliant data frameworks in partnership with Preparedness4Insight.
• Implement semantic data standards (RDF, OWL, SPARQL) and ontology engineering practices to create interoperable, machine-readable scientific data.
• Design and implement data lakehouse architecture using modern platforms (Databricks, Snowflake, or equivalent), including data storage patterns, partitioning strategies, and query optimization.
• Build and optimize ETL/ELT pipelines using Spark, dbt, or similar tools to transform raw scientific data into analytical and ML-ready formats.
• Implement real-time and streaming data integration (Kafka, Kinesis, event-driven patterns) connecting LIMS, instruments, and lab automation systems to the data infrastructure.
• Design and implement knowledge graphs (Neo4j, Amazon Neptune, TigerGraph) that capture molecular, target, pathway, and experimental relationships across the discovery landscape.
• Architect specialized data solutions: array databases (TileDB) for genomics/imaging, document stores (MongoDB) for experimental records, and vector databases for embedding-based retrieval supporting ML and RAG workflows.
• Build query and traversal patterns that enable scientists and AI agents to ask relational questions across the entire data landscape.
• Partner with scientific software engineers to ensure data architectures are implementable, performant, and well-documented.
• Collaborate with Methods4Insight to design data structures that support analytical model training, deployment, and evaluation.
• Work with Tech@Lilly to define scaling strategies, ensure enterprise compliance, and transition data architectures to production-grade management.
• Contribute to build-versus-buy-versus-adopt decisions by evaluating commercial and open-source data platforms against Data Foundry requirements.
Qualifications:
Required:
• M.S. or PhD in Computer Science, Data Science, Bioinformatics, Computational Biology, Information Science, or related STEM field
• MS (with 6+ years) and PhD (with 2+ years) of data architecture, data engineering, or scientific informatics experience.
• Deep expertise in at least one of the focus areas: relational databases, data modeling and ontology engineering, data platform and lakehouse architecture (Databricks, Snowflake, Spark), or knowledge graph and specialized database systems (Neo4j, Neptune, MongoDB, TileDB)
Preferred:
• Working familiarity with multiple database paradigms — relational, graph, document, columnar, key-value — and strong SQL skills.
• Understanding of scientific data types and experimental workflows in life sciences or pharma (chemical, biological, HTE data).
• Strong communication skills with ability to translate data architecture concepts for both technical and scientific audiences.
• Familiarity with cloud platforms (AWS, Azure, or GCP) and modern data integration patterns.
• Pharmaceutical or biotech research industry experience, particularly in discovery data management or research informatics.
• Experience with semantic web technologies: RDF, OWL, SPARQL, Protégé, or equivalent ontology engineering tools.
• Hands-on experience with graph databases (Neo4j, Neptune, TigerGraph) and knowledge graph design patterns for scientific data.
• Data lakehouse architecture experience: Databricks (Delta Lake, Unity Catalog), Snowflake, or equivalent; ETL/ELT with Spark, dbt.
• Experience with streaming/real-time data platforms (Kafka, Kinesis, Flink) and event-driven architectures.
• Familiarity with LIMS, ELN systems (e.g., Benchling), and laboratory instrument data integration.
• Experience with vector databases (Pinecone, Weaviate, pgvector) and embedding-based retrieval for ML/RAG applications.
• Array database experience (TileDB, Zarr) for genomics, imaging, or high-dimensional scientific data.
• FAIR data principles implementation experience and Data Readiness Level frameworks.
• Scientific data standards and controlled vocabularies in chemistry (InChI, SMILES) or biology (Gene Ontology, UniProt).
• Experience with C, C++, or Rust for performance-critical data processing; familiarity with HPC data I/O patterns for large-scale scientific computations.
Company:
We're a medicine company turning science into healing to make life better for people around the world. Founded in 1876, the company is headquartered in Indianapolis, USA, with a team of 10001+ employees. The company is currently Late Stage.
What Eli Lilly and Company employees say
Pay
Benefits
Hours and flexibility
Workplace
Get the full story on Breakroom
About Eli Lilly
Sourced by ZipRecruiter
Eli Lilly, based in Indianapolis, IN, US, is one of the pioneers in the pharmaceutical industry with a rich history dating back to 1876. This global pharmaceutical company focuses on discovering, developing, manufacturing and selling pharmaceutical products in approximately 120 countries. The company's product categories include endocrinology, oncology, cardiovascular, neuroscience, and immunology. Having invested over $9 billion in research and development in the past decade, Eli Lilly is also committed to creating high-quality medicines that meet real needs. As a recipient of several awards and recognitions, Eli Lilly is known for its focus on life-saving research and drug development. Their mission is to make medicines that help people live longer, healthier, and more active lives.
Industry
Pharmaceutical product wholesalers
Company size
10,000+ Employees
Headquarters location
Indianapolis, IN, US
Year founded
1876