1

Bioinformatics Data Engineer Jobs in Colorado (NOW HIRING)

Postdoctoral Fellow/Trainee

Aurora, CO · On-site

$49K - $67K/yr

... Bioinformatics, Biomedical Informatics, Computer Science, Physics, Engineering, or an equivalent ... Proven experience in methods development for multi-omic data analysis * Proven experience in ...

Bioinformatics Data Engineer information

How do Bioinformatics Data Engineers typically collaborate with researchers and other teams in a biomedical organization?

Bioinformatics Data Engineers often work closely with biologists, data scientists, and software engineers to ensure the effective collection, processing, and analysis of complex biological data. They regularly participate in cross-functional meetings to understand research goals, develop data pipelines, and troubleshoot data-related issues. Collaboration is essential, as engineers must translate scientific requirements into technical solutions, provide data access and visualization tools, and support researchers in extracting meaningful insights from large datasets. This teamwork fosters a dynamic environment where communication and adaptability are key.

What is the difference between Bioinformatics Data Engineer vs Bioinformatics Analyst?

AspectBioinformatics Data EngineerBioinformatics Analyst
Required CredentialsBachelor's or Master's in Bioinformatics, Computer Science, or related fields; programming skillsBachelor's or Master's in Bioinformatics, Biology, or related fields; data analysis skills
Work EnvironmentData pipelines, database management, software developmentData interpretation, report generation, biological data analysis
Employer & Industry UsageBiotech companies, research labs, pharmaResearch institutions, healthcare, biotech
Common Search & ComparisonFocuses on data infrastructure and pipelinesFocuses on biological data interpretation

The main difference between a Bioinformatics Data Engineer and a Bioinformatics Analyst lies in their focus areas. Data Engineers build and maintain data pipelines and infrastructure, while Analysts interpret biological data to generate insights. Both roles require strong bioinformatics knowledge, but Data Engineers emphasize programming and data management, whereas Analysts focus on biological interpretation and reporting.

What is a Bioinformatics Data Engineer?

A Bioinformatics Data Engineer is a professional who designs, develops, and maintains data infrastructure for managing and analyzing large-scale biological data, such as genomics or proteomics datasets. They build pipelines and tools to process, store, and retrieve complex biological information efficiently. Their work enables researchers and scientists to access and interpret data for discoveries in fields like medicine, genetics, and biotechnology. Often, they collaborate closely with bioinformaticians, data scientists, and software engineers to support research initiatives.

What are the key skills and qualifications needed to thrive as a Bioinformatics Data Engineer, and why are they important?

To thrive as a Bioinformatics Data Engineer, you need a strong background in computer science, biology, and statistics, often supported by a relevant degree and experience in data engineering. Proficiency with programming languages (such as Python, R, or SQL), bioinformatics tools, cloud platforms, and big data frameworks (like Hadoop or Spark) is typically required. Strong problem-solving, collaboration, and communication skills help you work effectively across interdisciplinary teams and convey complex findings. These skills ensure accurate analysis, efficient data pipeline development, and meaningful insights that advance biological research and healthcare solutions.
What cities in Colorado are hiring for Bioinformatics Data Engineer jobs? Cities in Colorado with the most Bioinformatics Data Engineer job openings:
Data Architect, Data Foundry

Data Architect, Data Foundry

Eli Lilly and Company

Louisville, CO • On-site

Full-time

This job post has expired 1 day ago. Applications are no longer accepted.


Eli Lilly and Company rating

8.8

Company rating: 8.8 out of 10

Based on 62 frontline employees who took The Breakroom Quiz

11th of 71 rated pharmaceutical


Job description

Job Summary:
Eli Lilly and Company is a global healthcare leader headquartered in Indianapolis, Indiana, dedicated to making life better for people around the world. They are seeking Data Architects to design and build the data infrastructure that enables AI-native drug discovery, focusing on creating schemas, ontologies, data models, and platform architectures that transform scientific data into actionable insights.
Responsibilities:
• Design and implement data models, schemas, and ontologies for chemical, biological, and automation-generated data that serve discovery workflows across the portfolio.
• Define and maintain controlled vocabularies, metadata standards, and FAIR-compliant data frameworks in partnership with Preparedness4Insight.
• Implement semantic data standards (RDF, OWL, SPARQL) and ontology engineering practices to create interoperable, machine-readable scientific data.
• Design and implement data lakehouse architecture using modern platforms (Databricks, Snowflake, or equivalent), including data storage patterns, partitioning strategies, and query optimization.
• Build and optimize ETL/ELT pipelines using Spark, dbt, or similar tools to transform raw scientific data into analytical and ML-ready formats.
• Implement real-time and streaming data integration (Kafka, Kinesis, event-driven patterns) connecting LIMS, instruments, and lab automation systems to the data infrastructure.
• Design and implement knowledge graphs (Neo4j, Amazon Neptune, TigerGraph) that capture molecular, target, pathway, and experimental relationships across the discovery landscape.
• Architect specialized data solutions: array databases (TileDB) for genomics/imaging, document stores (MongoDB) for experimental records, and vector databases for embedding-based retrieval supporting ML and RAG workflows.
• Build query and traversal patterns that enable scientists and AI agents to ask relational questions across the entire data landscape.
• Partner with scientific software engineers to ensure data architectures are implementable, performant, and well-documented.
• Collaborate with Methods4Insight to design data structures that support analytical model training, deployment, and evaluation.
• Work with Tech@Lilly to define scaling strategies, ensure enterprise compliance, and transition data architectures to production-grade management.
• Contribute to build-versus-buy-versus-adopt decisions by evaluating commercial and open-source data platforms against Data Foundry requirements.
Qualifications:
Required:
• B.S. or M.S. in Computer Science, Data Science, Bioinformatics, Computational Biology, Information Science, or related STEM field; Ph.D. valued for ontology and knowledge graph roles.
• B.S. with 7+ years and M.S. with 5+ years of data architecture, data engineering, or scientific informatics' experience.
• SQL skills and experience in multiple database paradigms (relational, graph, document, columnar, key-value).
• Qualified applicants must be authorized to work in the United States on a full-time basis. Lilly will not provide support for or sponsor work authorization or visas for this role, including but not limited to F-1 CPT, F-1 OPT, F-1 STEM OPT, J-1, H-1B, TN, O-1, E-3, H-1B1, or L-1.
Preferred:
• Expertise in at least one of: data modeling/ontologies, data platform engineering (Databricks, Snowflake, Spark), or graph/specialized databases (Neo4j, Neptune, MongoDB).
• Familiarity with cloud platforms (AWS, Azure, or GCP) and modern data integration patterns.
• Understanding of scientific data types and experimental workflows in life sciences or pharma (chemical, biological, HTE data).
• Strong communication skills with ability to translate data architecture concepts for both technical and scientific audiences.
• Pharmaceutical or biotech research industry experience, particularly in discovery data management or research informatics.
• Experience with semantic web technologies: RDF, OWL, SPARQL, Protégé, or equivalent ontology engineering tools.
• Hands-on experience with graph databases (Neo4j, Neptune, TigerGraph) and knowledge graph design patterns for scientific data.
• Data lakehouse architecture experience: Databricks (Delta Lake, Unity Catalog), Snowflake, or equivalent; ETL/ELT with Spark, dbt.
• Experience with streaming/real-time data platforms (Kafka, Kinesis, Flink) and event-driven architectures.
• Familiarity with LIMS, ELN systems (e.g., Benchling), and laboratory instrument data integration.
• Experience with vector databases (Pinecone, Weaviate, pgvector) and embedding-based retrieval for ML/RAG applications.
• Array database experience (TileDB, Zarr) for genomics, imaging, or high-dimensional scientific data.
• Experience with bioinformatics data formats (FASTA, BAM/CRAM, VCF) and biological sequence databases; familiarity with NGS data pipelines and proteomics data management.
• FAIR data principles implementation experience and Data Readiness Level frameworks.
• Scientific data standards and controlled vocabularies in chemistry (InChI, SMILES) or biology (Gene Ontology, UniProt, pathway databases such as Reactome or KEGG).
Company:
We're a medicine company turning science into healing to make life better for people around the world. Founded in 1876, the company is headquartered in Indianapolis, USA, with a team of 10001+ employees. The company is currently Late Stage.

What Eli Lilly and Company employees say

Pay

Benefits

Hours and flexibility

Workplace

Get the full story on Breakroom


Eli Lilly logo

About Eli Lilly

Sourced by ZipRecruiter

Eli Lilly, based in Indianapolis, IN, US, is one of the pioneers in the pharmaceutical industry with a rich history dating back to 1876. This global pharmaceutical company focuses on discovering, developing, manufacturing and selling pharmaceutical products in approximately 120 countries. The company's product categories include endocrinology, oncology, cardiovascular, neuroscience, and immunology. Having invested over $9 billion in research and development in the past decade, Eli Lilly is also committed to creating high-quality medicines that meet real needs. As a recipient of several awards and recognitions, Eli Lilly is known for its focus on life-saving research and drug development. Their mission is to make medicines that help people live longer, healthier, and more active lives.

Industry

Pharmaceutical product wholesalers

Company size

10,000+ Employees

Headquarters location

Indianapolis, IN, US

Year founded

1876