D. valued for ontology and knowledge graph roles. • B.S. with 7+ years and M.S. with 5+ years of data architecture, data engineering, or scientific informatics' experience. • SQL skills and ...
D. valued for ontology and knowledge graph roles. • B.S. with 7+ years and M.S. with 5+ years of data architecture, data engineering, or scientific informatics' experience. • SQL skills and ...
Data Architect, Data Foundry
Louisville, CO · On-site
$66.50 - $85.75/hr
D. valued for ontology and knowledge graph roles. * B.S. with 7+ years and M.S. with 5+ years of data architecture, data engineering, or scientific informatics' experience. * SQL skills and ...
Data Architect, Data Foundry
Louisville, CO · On-site
$66.50 - $85.75/hr
D. valued for ontology and knowledge graph roles. * B.S. with 7+ years and M.S. with 5+ years of data architecture, data engineering, or scientific informatics' experience. * SQL skills and ...
RDF, OWL, SPARQL, Protégé, or equivalent ontology engineering tools. • Hands-on experience with graph databases (Neo4j, Neptune, TigerGraph) and knowledge graph design patterns for scientific ...
RDF, OWL, SPARQL, Protégé, or equivalent ontology engineering tools. • Hands-on experience with graph databases (Neo4j, Neptune, TigerGraph) and knowledge graph design patterns for scientific ...
ML Engineer
Denver, CO · On-site +1
Machine Learning Engineer (Llama AI Platform) Location: Remote (Preferred U.S. Time Zones ... Knowledge graph implementations. * Business intelligence and analytics platforms. Success Metrics ...
Quick apply
ML Engineer
Denver, CO · On-site +1
Machine Learning Engineer (Llama AI Platform) Location: Remote (Preferred U.S. Time Zones ... Knowledge graph implementations. * Business intelligence and analytics platforms. Success Metrics ...
... knowledge graph development, transformation, and vectorization to support Retrieval-Augmented ... D. in Computer Science, Software Engineering, Robotics, Computer Engineering, Data Science ...
... knowledge graph development, transformation, and vectorization to support Retrieval-Augmented ... D. in Computer Science, Software Engineering, Robotics, Computer Engineering, Data Science ...
Principal Software Engineer
Boulder, CO · On-site
$185K - $250K/yr
A modern, cloud-first infrastructure built on Knowledge Graph using Neo4J and Terraform and Kubernetes . * High-impact autonomy: engineers research, design, and deliver major systems end-to-end.
Quick apply
Principal Software Engineer
Boulder, CO · On-site
$185K - $250K/yr
A modern, cloud-first infrastructure built on Knowledge Graph using Neo4J and Terraform and Kubernetes . * High-impact autonomy: engineers research, design, and deliver major systems end-to-end.
Knowledge graphs - Working on graph data management and knowledge discovery, and creating a natural ... Strong programming (Python, C++) and algorithmic skills. * Good communication skills. Curious ...
Knowledge graphs - Working on graph data management and knowledge discovery, and creating a natural ... Strong programming (Python, C++) and algorithmic skills. * Good communication skills. Curious ...
Knowledge graphs - Working on graph data management and knowledge discovery, and creating a natural ... Strong programming (Python, C++) and algorithmic skills. * Good communication skills. Curious ...
Knowledge graphs - Working on graph data management and knowledge discovery, and creating a natural ... Strong programming (Python, C++) and algorithmic skills. * Good communication skills. Curious ...
Knowledge graphs - Working on graph data management and knowledge discovery, and creating a natural ... Strong programming (Python, C++) and algorithmic skills. * Good communication skills. Curious ...
Knowledge graphs - Working on graph data management and knowledge discovery, and creating a natural ... Strong programming (Python, C++) and algorithmic skills. * Good communication skills. Curious ...
Knowledge graphs - Working on graph data management and knowledge discovery, and creating a natural ... Strong programming (Python, C++) and algorithmic skills. * Good communication skills. Curious ...
Knowledge graphs - Working on graph data management and knowledge discovery, and creating a natural ... Strong programming (Python, C++) and algorithmic skills. * Good communication skills. Curious ...
Knowledge graphs - Working on graph data management and knowledge discovery, and creating a natural ... Strong programming (Python, C++) and algorithmic skills. * Good communication skills. Curious ...
Knowledge graphs - Working on graph data management and knowledge discovery, and creating a natural ... Strong programming (Python, C++) and algorithmic skills. * Good communication skills. Curious ...
Knowledge graphs - Working on graph data management and knowledge discovery, and creating a natural ... Strong programming (Python, C++) and algorithmic skills. * Good communication skills. Curious ...
Knowledge graphs - Working on graph data management and knowledge discovery, and creating a natural ... Strong programming (Python, C++) and algorithmic skills. * Good communication skills. Curious ...
Staff AI Engineer
Boulder, CO · On-site +1
$200K - $275K/yr
We are seeking a Staff AI Engineer to architect, build, and operationalize these systems at scale ... Strong experience with multimodal LLMs, knowledge graph, data synthesis, LLM fine tuning ...
Staff AI Engineer
Boulder, CO · On-site +1
$200K - $275K/yr
We are seeking a Staff AI Engineer to architect, build, and operationalize these systems at scale ... Strong experience with multimodal LLMs, knowledge graph, data synthesis, LLM fine tuning ...
AI/ML Engineer
Colorado Springs, CO · Remote
$190K - $220K/yr
... graph, hybrid). * Write clean, efficient Python code for data ingestion, feature engineering ... Working knowledge of vector databases (Milvus, Pinecone, Chroma, FAISS) and retrieval-based ...
AI/ML Engineer
Colorado Springs, CO · Remote
$190K - $220K/yr
... graph, hybrid). * Write clean, efficient Python code for data ingestion, feature engineering ... Working knowledge of vector databases (Milvus, Pinecone, Chroma, FAISS) and retrieval-based ...
AI/ML Engineer
Colorado Springs, CO · Remote
$190K - $220K/yr
... graph, hybrid). * Write clean, efficient Python code for data ingestion, feature engineering ... Working knowledge of vector databases (Milvus, Pinecone, Chroma, FAISS) and retrieval-based ...
AI/ML Engineer
Colorado Springs, CO · Remote
$190K - $220K/yr
... graph, hybrid). * Write clean, efficient Python code for data ingestion, feature engineering ... Working knowledge of vector databases (Milvus, Pinecone, Chroma, FAISS) and retrieval-based ...
AI/ML Engineer
Colorado Springs, CO · Remote
$190K - $220K/yr
... graph, hybrid). * Write clean, efficient Python code for data ingestion, feature engineering ... Working knowledge of vector databases (Milvus, Pinecone, Chroma, FAISS) and retrieval-based ...
AI/ML Engineer
Colorado Springs, CO · Remote
$190K - $220K/yr
... graph, hybrid). * Write clean, efficient Python code for data ingestion, feature engineering ... Working knowledge of vector databases (Milvus, Pinecone, Chroma, FAISS) and retrieval-based ...
AI/ML Engineer
Colorado Springs, CO · On-site +1
$144K - $246K/yr
... graph, hybrid). * Write clean, efficient Python code for data ingestion, feature engineering ... Working knowledge of vector databases (Milvus, Pinecone, Chroma, FAISS) and retrieval-based ...
AI/ML Engineer
Colorado Springs, CO · On-site +1
$144K - $246K/yr
... graph, hybrid). * Write clean, efficient Python code for data ingestion, feature engineering ... Working knowledge of vector databases (Milvus, Pinecone, Chroma, FAISS) and retrieval-based ...
AI/ML Software Engineer
Colorado Springs, CO · Remote
$190K - $220K/yr
This role is for a hands-on engineer and someone who loves writing code, building systems end-to ... Familiarity with databases such as PostgreSQL, MongoDB, or graph databases. * Knowledge of ...
AI/ML Software Engineer
Colorado Springs, CO · Remote
$190K - $220K/yr
This role is for a hands-on engineer and someone who loves writing code, building systems end-to ... Familiarity with databases such as PostgreSQL, MongoDB, or graph databases. * Knowledge of ...
Utilize knowledge graph databases to unify, contextualize, and activate industrial data, ensuring ... engineers, data scientists, and RPA developers. * Establish monitoring, cost optimization ...
Utilize knowledge graph databases to unify, contextualize, and activate industrial data, ensuring ... engineers, data scientists, and RPA developers. * Establish monitoring, cost optimization ...
AI/ML Software Engineer
Colorado Springs, CO · Remote
$190K - $220K/yr
FTI Defense is seeking an AI/ML Software Engineer to design, build, and deploy secure, scalable ... Familiarity with databases such as PostgreSQL, MongoDB, or graph databases. * Knowledge of ...
AI/ML Software Engineer
Colorado Springs, CO · Remote
$190K - $220K/yr
FTI Defense is seeking an AI/ML Software Engineer to design, build, and deploy secure, scalable ... Familiarity with databases such as PostgreSQL, MongoDB, or graph databases. * Knowledge of ...
Knowledge Graph Engineer information
Which IT job is the highest paid?
What engineers make 500,000?
What engineers make 300,000 a year?
Are knowledge graphs the future?
Full-time
Posted 13 days ago
Eli Lilly and Company rating
8.8
Based on 62 frontline employees who took The Breakroom Quiz
10th of 73 rated pharmaceutical
Job description
Eli Lilly and Company is a global healthcare leader headquartered in Indianapolis, Indiana, dedicated to making life better for people around the world. They are seeking Data Architects to design and build the data infrastructure that enables AI-native drug discovery, focusing on creating schemas, ontologies, data models, and platform architectures that transform scientific data into actionable insights.
Responsibilities:
• Design and implement data models, schemas, and ontologies for chemical, biological, and automation-generated data that serve discovery workflows across the portfolio.
• Define and maintain controlled vocabularies, metadata standards, and FAIR-compliant data frameworks in partnership with Preparedness4Insight.
• Implement semantic data standards (RDF, OWL, SPARQL) and ontology engineering practices to create interoperable, machine-readable scientific data.
• Design and implement data lakehouse architecture using modern platforms (Databricks, Snowflake, or equivalent), including data storage patterns, partitioning strategies, and query optimization.
• Build and optimize ETL/ELT pipelines using Spark, dbt, or similar tools to transform raw scientific data into analytical and ML-ready formats.
• Implement real-time and streaming data integration (Kafka, Kinesis, event-driven patterns) connecting LIMS, instruments, and lab automation systems to the data infrastructure.
• Design and implement knowledge graphs (Neo4j, Amazon Neptune, TigerGraph) that capture molecular, target, pathway, and experimental relationships across the discovery landscape.
• Architect specialized data solutions: array databases (TileDB) for genomics/imaging, document stores (MongoDB) for experimental records, and vector databases for embedding-based retrieval supporting ML and RAG workflows.
• Build query and traversal patterns that enable scientists and AI agents to ask relational questions across the entire data landscape.
• Partner with scientific software engineers to ensure data architectures are implementable, performant, and well-documented.
• Collaborate with Methods4Insight to design data structures that support analytical model training, deployment, and evaluation.
• Work with Tech@Lilly to define scaling strategies, ensure enterprise compliance, and transition data architectures to production-grade management.
• Contribute to build-versus-buy-versus-adopt decisions by evaluating commercial and open-source data platforms against Data Foundry requirements.
Qualifications:
Required:
• B.S. or M.S. in Computer Science, Data Science, Bioinformatics, Computational Biology, Information Science, or related STEM field; Ph.D. valued for ontology and knowledge graph roles.
• B.S. with 7+ years and M.S. with 5+ years of data architecture, data engineering, or scientific informatics' experience.
• SQL skills and experience in multiple database paradigms (relational, graph, document, columnar, key-value).
• Qualified applicants must be authorized to work in the United States on a full-time basis. Lilly will not provide support for or sponsor work authorization or visas for this role, including but not limited to F-1 CPT, F-1 OPT, F-1 STEM OPT, J-1, H-1B, TN, O-1, E-3, H-1B1, or L-1.
Preferred:
• Expertise in at least one of: data modeling/ontologies, data platform engineering (Databricks, Snowflake, Spark), or graph/specialized databases (Neo4j, Neptune, MongoDB).
• Familiarity with cloud platforms (AWS, Azure, or GCP) and modern data integration patterns.
• Understanding of scientific data types and experimental workflows in life sciences or pharma (chemical, biological, HTE data).
• Strong communication skills with ability to translate data architecture concepts for both technical and scientific audiences.
• Pharmaceutical or biotech research industry experience, particularly in discovery data management or research informatics.
• Experience with semantic web technologies: RDF, OWL, SPARQL, Protégé, or equivalent ontology engineering tools.
• Hands-on experience with graph databases (Neo4j, Neptune, TigerGraph) and knowledge graph design patterns for scientific data.
• Data lakehouse architecture experience: Databricks (Delta Lake, Unity Catalog), Snowflake, or equivalent; ETL/ELT with Spark, dbt.
• Experience with streaming/real-time data platforms (Kafka, Kinesis, Flink) and event-driven architectures.
• Familiarity with LIMS, ELN systems (e.g., Benchling), and laboratory instrument data integration.
• Experience with vector databases (Pinecone, Weaviate, pgvector) and embedding-based retrieval for ML/RAG applications.
• Array database experience (TileDB, Zarr) for genomics, imaging, or high-dimensional scientific data.
• Experience with bioinformatics data formats (FASTA, BAM/CRAM, VCF) and biological sequence databases; familiarity with NGS data pipelines and proteomics data management.
• FAIR data principles implementation experience and Data Readiness Level frameworks.
• Scientific data standards and controlled vocabularies in chemistry (InChI, SMILES) or biology (Gene Ontology, UniProt, pathway databases such as Reactome or KEGG).
Company:
We're a medicine company turning science into healing to make life better for people around the world. Founded in 1876, the company is headquartered in Indianapolis, USA, with a team of 10001+ employees. The company is currently Late Stage.
What Eli Lilly and Company employees say
Pay
Benefits
Hours and flexibility
Workplace
Get the full story on Breakroom
About Eli Lilly
Sourced by ZipRecruiter
Eli Lilly, based in Indianapolis, IN, US, is one of the pioneers in the pharmaceutical industry with a rich history dating back to 1876. This global pharmaceutical company focuses on discovering, developing, manufacturing and selling pharmaceutical products in approximately 120 countries. The company's product categories include endocrinology, oncology, cardiovascular, neuroscience, and immunology. Having invested over $9 billion in research and development in the past decade, Eli Lilly is also committed to creating high-quality medicines that meet real needs. As a recipient of several awards and recognitions, Eli Lilly is known for its focus on life-saving research and drug development. Their mission is to make medicines that help people live longer, healthier, and more active lives.
Industry
Pharmaceutical product wholesalers
Company size
10,000+ Employees
Headquarters location
Indianapolis, IN, US
Year founded
1876