1

Data Curator Jobs (NOW HIRING)

We are a Data Curation company collaborating with some of the most renowned pharmaceutical organizations in the world. Our team of scientists, curators, computational biologists, data scientists ...

Staff Data Scientist

San Francisco, CA · On-site

$220K - $280K/yr

Advanced Data Curation & Management: Develop comprehensive data curation strategies for improvements to AI models. Partner with MLOps to architect data management systems for algorithm training and ...

Working within a cross-functional team and reporting to a technical lead, you will operate across the machine learning development lifecycle, from data curation and synthetic data generation to model ...

Perform detailed data cleaning and curation activities including transformation, formatting, and resolving data inconsistencies or discrepancies. * Manage data queries and discrepancies, working with ...

next page

Showing results 1-20

Data Curator information

See salary details

$25.5K

$72.6K

$119.5K

How much do data curator jobs pay per year?

As of Jul 1, 2026, the average yearly pay for data curator in the United States is $72,627.00, according to ZipRecruiter salary data. Most workers in this role earn between $50,000.00 and $94,000.00 per year, depending on experience, location, and employer.

What does a Data Curator do?

A Data Curator is responsible for collecting, organizing, and maintaining datasets to ensure they are accurate, accessible, and usable for analysis. They work with data scientists, analysts, and other stakeholders to standardize metadata, establish data governance policies, and improve data quality. Data Curators often clean and enrich datasets, manage data repositories, and ensure compliance with regulatory standards. Their role is essential in making data more discoverable and valuable for decision-making and research.

What are the key skills and qualifications needed to thrive in the Data Curator position, and why are they important?

Success as a Data Curator requires strong analytical abilities, attention to detail, and a background in information management, data science, or a related field. Familiarity with data management tools, metadata standards (such as Dublin Core), and systems like relational databases and data repositories is highly advantageous, and certifications in data governance or data management can be beneficial. Excellent organizational skills, effective communication, and a collaborative mindset are key soft skills for this role. Mastery of these skills ensures data assets are accurately cataloged, accessible, and usable for stakeholders, supporting effective decision-making and research outcomes.

How to become a data curator?

To become a data curator, you typically need a bachelor's degree in information science, library science, computer science, or a related field. Developing skills in data management, metadata standards, and data curation tools such as Excel, SQL, or specialized software is essential, along with experience in data organization and quality control. Certifications in data management or related areas can enhance job prospects.

How much does a data curator make?

The average salary for a data curator typically ranges from $50,000 to $85,000 per year, depending on experience, education, and industry. Entry-level roles may start around $45,000, while experienced professionals with specialized skills in data management and metadata standards can earn higher salaries.

What qualifications do I need to be a curator?

A data curator typically needs a bachelor's degree in information science, library science, computer science, or a related field. Strong skills in data management, metadata standards, and familiarity with data curation tools are important, along with attention to detail and organizational abilities.

What does a data curator do?

A data curator is responsible for collecting, organizing, and maintaining data to ensure its quality, accuracy, and accessibility. They often use data management tools and follow standards to prepare data for analysis or sharing within organizations.

What does a typical day look like for a Data Curator, and who do they usually collaborate with?

A typical day for a Data Curator involves collecting, organizing, and annotating datasets, maintaining data integrity, and updating records to comply with quality standards. Data Curators frequently work alongside data analysts, researchers, and IT professionals to ensure data is properly structured and easily accessible for various projects. You may also interact with data stewards or subject matter experts to validate metadata or resolve inconsistencies. Collaboration and communication are integral to the role, as you help bridge technical teams and end users, ensuring that data systems meet the evolving needs of the organization.

More about Data Curator jobs
What cities are hiring for Data Curator jobs? Cities with the most Data Curator job openings:
What are the most commonly searched types of Data Curator jobs? The most popular types of Data Curator jobs are:
What states have the most Data Curator jobs? States with the most job openings for Data Curator jobs include:
Senior Biological Data Architect

$70 - $90/hr

Contractor

This job post has expired 1 day ago. Applications are no longer accepted.


Job description

About Rancho BioSciences, LLC
Rancho BioSciences is a fully remote, US-based provider of biomedical data curation and data science services for pharma and biotech, spanning drug discovery through translational research. Our teams of scientists, data engineers, and software experts deliver end-to-end solutions across data curation, management, mining, and analysis to help customers accelerate R&D. We partner long-term with blue-chip clients and emerging biotechs, bringing scientific rigor, quality, and a customer-first mindset to every engagement.
About the role
  • We are seeking a full-time contractor for a Senior Biological Data Architect to design, harmonize, and govern complex biomedical data models on behalf of our pharmaceutical, academic, and institutional clients. The successful candidate will be an expert problem solver with deep expertise in conceptual, logical, and canonical data modeling for biomedical and scientific domains, including disease biology, genetics, translational research, and drug development. You will play a central role in client initiatives that deliver FAIR-aligned data products enabling rapid query and decision-making by R&D scientists.
  • We are a Data Curation company collaborating with some of the most renowned pharmaceutical organizations in the world. Our team of scientists, curators, computational biologists, data scientists, knowledge engineers, and solution developers is distributed across the country; we support talented people living where they choose, working collaboratively on projects that have real impact on human health.
  • While fully remote, candidates will be expected to spend the majority of time overlapping East Coast US or UK working hours.

What you'll do
  • Partner with scientific and technical stakeholders to elicit requirements and propose canonical data models that represent the full breadth of biomedical concepts relevant to target discovery, disease understanding, and translational research, along with the evidence and provenance that support them.
  • Design and lead source-to-canonical harmonization activities, covering vocabulary alignment, persistent identifier assignment, and lineage and provenance capture.
  • Define schemas, controlled vocabularies, identifier strategies, and ontology bindings in collaboration with knowledge engineering, curation, data engineering, and platform teams.
  • Design models that power data pipelines, APIs, knowledge graphs, analytical workflows, and downstream R&D query use cases.
  • Establish validation rules and data quality checks covering ontology term validation, range and cardinality checks, required-field enforcement, ID and label consistency, cross-field consistency, and provenance completeness.
  • Manage the full schema lifecycle: repository management (e.g., GitHub-based), semantic versioning, changelogs, tagged releases, data dictionaries, metadata catalogs, and downstream impact assessments.
  • Drive schema review, approval, and publication processes; identify modeling risks early, such as metadata gaps, ontology conflicts, source data quality issues, lineage gaps, and compatibility risks.
  • Lead modeling strategy spanning harmonization, pipeline validation, knowledge graphs, and FAIR data product delivery.
  • Translate ambiguous scientific requirements into clear, durable canonical models and make defensible, documented decisions on ontology reuse, extension, and mapping.
  • Design modular, reusable, future-proof models aligned with FAIR and enterprise standards, with consistent persistent identifier and provenance conventions across data assets.
  • Communicate strategies, trade-offs, and progress clearly to clients and internal teams.

Qualifications
Required:
  • PhD in Life Sciences (or equivalent demonstrated expertise) with first-hand experience working with biomedical or research data.
  • Strong conceptual, logical, and canonical data modeling experience for complex scientific or biomedical domains.
  • Hands-on experience with LinkML or equivalent schema modeling frameworks, comfortable defining classes, slots, ranges, identifiers, required fields, constraints, cardinality, descriptions, and ontology bindings.
  • Working knowledge of YAML-based schema authoring.
  • Solid grasp of FAIR principles (findability, accessibility, interoperability, reusability), including persistent identifiers, metadata standards, provenance, and schema versioning.
  • Experience with biomedical ontologies and controlled vocabularies, including familiarity with public ontology resources covering genes, diseases, phenotypes, anatomy, cell types, assays, units, and evidence.
  • Familiarity with semantic web technologies such as RDF, OWL, JSON-LD, SHACL, ShEx, and SPARQL, and with knowledge graph modeling.
  • Proven experience designing Entity Relationship Diagrams and Conceptual and Logical Data Models.
  • Experience with schema and model registries, data catalogs, metadata registries, and data dictionary management.
  • Proficiency in Python, R, or SQL for model conformance testing, ontology mapping, or data quality validation (notebook-based workflows a plus).
  • Experience with SDLC methodologies, unit and integration testing, and documentation practices.
  • AI awareness: comfortable evaluating how and AI-driven curation and mapping tools can accelerate modeling, harmonization, and validation workflows.

Nice to Have:
  • Experience working with modern cloud data platforms and data lake environments such as Snowflake or Databricks.
  • Hands-on use of AI-powered coding assistants and established collaboration workflows that incorporate them into day-to-day modeling, documentation, or validation work.

The pay range for this role is:
70 - 90 USD per hour (United States)