1

Biological Data Science Jobs (NOW HIRING)

You'll work at the intersection of computational biology, machine learning, and drug development ... Lead and execute complex data science projects that directly advance our drug development portfolio

Strong experience in data science, bioinformatics, computational biology, or a related field * Experience working with biological knowledgebases, public datasets, or ontology driven systems

Strong experience in data science, bioinformatics, computational biology, or a related field * Experience working with biological knowledgebases, public datasets, or ontology driven systems

next page

Showing results 1-20

Biological Data Science information

See salary details

$32.5K

$47.3K

$70K

How much do biological data science jobs pay per year?

As of May 28, 2026, the average yearly pay for biological data science in the United States is $47,326.00, according to ZipRecruiter salary data. Most workers in this role earn between $37,500.00 and $52,000.00 per year, depending on experience, location, and employer.

What are the key skills and qualifications needed to thrive as a Biological Data Scientist, and why are they important?

To thrive as a Biological Data Scientist, you need a strong background in biology, statistics, and data analysis, often supported by an advanced degree in bioinformatics, computational biology, or a related field. Familiarity with programming languages like Python or R, experience with bioinformatics tools (e.g., BLAST, Bioconductor), and knowledge of databases such as GenBank are typically required. Strong problem-solving, collaboration, and communication skills help you interpret complex data and effectively share findings with interdisciplinary teams. These skills are crucial for deriving meaningful biological insights from large datasets and driving innovation in research or healthcare.

What are some common challenges faced by professionals in Biological Data Science, and how can they be addressed?

Professionals in Biological Data Science often encounter challenges such as managing large and complex datasets, integrating diverse data types (e.g., genomics, proteomics, clinical data), and staying current with rapidly evolving analytical tools. Addressing these challenges typically involves strong collaboration with interdisciplinary teams, continuous learning of new computational methods, and implementing robust data management practices. Building effective communication skills is also important, as conveying complex findings to non-specialist stakeholders is a frequent part of the role.

What is Biological Data Science?

Biological Data Science is an interdisciplinary field that combines biology, computer science, statistics, and mathematics to analyze and interpret complex biological data. Professionals in this field work with large datasets generated from genomics, proteomics, and other biological experiments to uncover meaningful patterns and insights. They use computational tools and algorithms to advance research in areas such as medicine, agriculture, and environmental science. The goal is to translate raw biological data into actionable knowledge that can drive scientific discovery and innovation.

What is the difference between Biological Data Science vs Bioinformatics?

AspectBiological Data ScienceBioinformatics
Required CredentialsDegree in Data Science, Biology, or related fields; programming skillsDegree in Bioinformatics, Biology, or related fields; computational skills
Work EnvironmentResearch labs, biotech companies, healthcareResearch labs, healthcare, academic institutions
Employer & Industry UsageTech companies, pharmaceuticals, research institutionsAcademic, healthcare, biotech firms
Common Search & ComparisonYesYes

Biological Data Science focuses on analyzing biological data using advanced data analytics, machine learning, and statistical methods. Bioinformatics emphasizes developing algorithms and software for understanding biological data, often with a stronger focus on computational biology. While both roles require programming skills and biological knowledge, Biological Data Science is more data analytics-oriented, whereas Bioinformatics centers on computational tool development.

More about Biological Data Science jobs
What cities are hiring for Biological Data Science jobs? Cities with the most Biological Data Science job openings:
What are the most commonly searched types of Biological Data Science jobs? The most popular types of Biological Data Science jobs are:
What states have the most Biological Data Science jobs? States with the most job openings for Biological Data Science jobs include:
Infographic showing various Biological Data Science job openings in the United States as of May 2026, with employment types broken down into 1% As Needed, 82% Full Time, 13% Part Time, and 4% Contract. Highlights an 95% Physical, 4% Hybrid, and 1% Remote job distribution, with an average salary of $47,326 per year, or $22.8 per hour.

Sr Staff Data Scientist, Virtual Biology Initiative

Biohub

Redwood City, CA โ€ข On-site, Remote

Other

Retirement, PTO

This job post hasย expired today.ย Applications are no longer accepted.


Job description

Sr Staff Data Scientist, Virtual Biology Initiative, AI Research

New York, NY (Hybrid); Redwood City, CA (Hybrid)

Biohub is the first large-scale initiative bringing frontier AI models, massive compute, and frontier experimental capabilities under one roof. We're building a general-purpose system to accelerate scientific discovery, integrating frontier AI models, biological foundation models, and lab capabilities, with the ultimate goal of curing disease. Our technology powers scientists around the world, translating AI capabilities into tools that accelerate research everywhere.

The Opportunity

In April 2026, Biohub launched the Virtual Biology Initiativeโ€”a $500 million, five-year commitment to galvanize a global effort to build predictive models of the human cell. This initiative will bring together leading institutions to generate the multi-modal biological data, at unprecedented scale, that will power the next generation of AI models for biology while producing datasets of unprecedented size.

Our data science team defines the algorithms and processing approaches that turn raw biological measurements into rich representations models can actually learn from. That includes designing data formats and representations optimized for AI use cases, building cost-aware processing pipelines that balance expressiveness with efficiency, developing scalable QC and validation frameworks across modalities, creating agent-augmented curation tools for metadata extraction and ontology mapping, and building the cross-modal entity resolution and semantic infrastructure that ties it all together.

Both the scale and domain are active research areas. How do you tokenize a cell image? How do you represent a perturbation experiment? How do you combine transcriptomics with imaging in a way that preserves biological meaning? These questions don't have established answers. We need scientific leaders who can work at this frontier: people who understand biological measurement deeply, think creatively about data representations, sampling, and tokenization strategies, and can translate that thinking into data representations that enable novel training architectures.

You'll work directly with scientists, computational biologists, data engineers, and AI researchers to define model input and biological evaluations. You will operate with broad scope and high autonomy, influencing roadmap decisions across teams while mentoring senior individual contributors. Success in this role means creating and implementing data systems that are not only large, but adaptive, interpretable, and scientifically groundedโ€”accelerating progress toward robust biological frontier models and ultimately advancing human health.

What You'll Do
  • Set technical vision and strategy for the design of data representations and tokenization strategies across biological data typesโ€”including imaging, sequencing, and multimodal dataโ€”that enable novel model architectures
  • Develop, deploy and validate approaches for combining heterogeneous data modalities into unified training frameworks, designing for robustness to noise, bias, and batch effects
  • Evaluate model performance, identifying which biological signals are captured or lost and iterating to improve
  • Partner deeply with ML engineers and AI researchers to co-design datasets and optimize model training, evaluation, and generalization
  • Lead cross-functional initiatives spanning data engineering, infrastructure, science, and product, aligning technical execution with long-term scientific goals
  • Identify and drive new data acquisition and generation opportunities, from consortium partnerships to internal experimental pipelines
  • Serve as a technical mentor and leader, raising the bar for data science and ML rigor across the organization
What You'll Bring
  • 12+ years of experience (or PhD + 7 years) working with large-scale biological datasets, including ownership of end-to-end data products
  • Deep expertise in at least one of: (a) imaging dataโ€”microscopy, cell phenotyping, spatial biology, and the data characteristics of image-based biological measurement; or (b) genomics dataโ€”bulk and single-cell sequencing, functional genomics, epigenomics, transcriptomics, spatial biology, and/or multi-omics
  • Understanding of how to transform raw biological data into AI-ready datasets, including familiarity with scientific best practices, noise characteristics, batch effects, and quality assessment specific to your domain
  • Experience with tokenization strategies for non-text data (images, sequences, graphs, time series) or with creating data representations and feature engineering for machine learning in scientific or biological contexts
  • Strong expertise in data science and statistical modeling; familiarity with modern ML architectures (transformers, diffusion models, or similar) and how data representation choices affect learning
  • Strong computational skills; demonstrated ability to design robust, extensible data architectures
  • Excellent communication and leadership skills, with the ability to translate between biology, ML, and engineering audiences and align teams to deliver complex projects
  • Creative, first-principles thinking about how to structure data for learning
Compensation

The Redwood City, CA & New York City, NY base pay range for a new hire in this role is $241,000.00 - $331,100.00. New hires are typically hired into the lower portion of the range, enabling employee growth in the range over time. Actual placement in range is based on job-related skills and experience, as evaluated throughout the interview process.

Better Together

As we grow, we're excited to strengthen in-person connections and cultivate a collaborative, team-oriented environment. This role is a hybrid position requiring you to be onsite for at least 60% of the working month, approximately 3 days a week, with specific in-office days determined by the team's manager. The exact schedule will be at the hiring manager's discretion and communicated during the interview process.

Benefits for the Whole You

We're thankful to have an incredible team behind our work. To honor their commitment, we offer a wide range of benefits to support the people who make all we do possible.

  • Provides a generous employer match on employee 401(k) contributions to support planning for the future.
  • Paid time off to volunteer at an organization of your choice.
  • Funding for select family-forming benefits.
  • Relocation support for employees who need assistance moving

If you're interested in a role but your previous experience doesn't perfectly align with each qualification in the job description, we still encourage you to apply as you may be the perfect fit for this or another role.