1

Data Curation Jobs (NOW HIRING)

Working within a cross-functional team and reporting to a technical lead, you will operate across the machine learning development lifecycle, from data curation and synthetic data generation to model ...

Data Engineer III

Menlo Park, CA · On-site +1

$134K - $162K/yr

Our team develops comprehensive data curation and evaluation solutions for image generation models across quality dimensions including visual quality, prompt adherence, identity preservation ...

Hands-on experience with data curation techniques for overhead imagery (optical or SAR) and computer vision model development * Experience building and maintaining ETL or data processing pipelines

Data engineer

Irving, TX

$109K - $132K/yr

Good experience with Schema design, ETL setup, Batch jobs setup / custom scripting, data curation and aggregation. * Postgres, MongoDB, Strong knowledge in CICD Pipeline for automatic deployment.

Platform Engineer, Data

Austin, TX

$113K - $136K/yr

You will build and maintain large-scale image and video pipelines, but with a focus on data curation strategies such as coreset selection, embedding-based filtering, and automated complexity scoring.

next page

Showing results 1-20

Data Curation information

See salary details

$10

$44

$71

How much do data curation jobs pay per hour?

As of Jun 12, 2026, the average hourly pay for data curation in the United States is $44.82, according to ZipRecruiter salary data. Most workers in this role earn between $29.81 and $58.89 per hour, depending on experience, location, and employer.

What are some typical challenges faced by professionals in Data Curation roles?

Professionals in Data Curation often manage large, complex datasets from diverse sources, which can present challenges in ensuring consistency, accuracy, and proper documentation. They may encounter incomplete or poorly formatted data that requires significant cleaning and standardization. Collaborating with researchers, data engineers, and subject matter experts is common to clarify requirements and maintain data integrity. Staying current with evolving data standards and technologies is also key to success, making adaptability important in daily work.

What skills are needed for data curation?

Data curation requires strong organizational skills, attention to detail, and proficiency with data management tools such as databases and spreadsheets. Knowledge of data standards, metadata creation, and basic programming or scripting skills can also enhance effectiveness in maintaining and validating data quality.

Do curators make a lot of money?

Data curators typically earn a median salary that varies by experience and industry, with entry-level positions starting around $40,000 annually and experienced professionals earning over $70,000. Salaries can increase with specialized skills, certifications, and working in high-demand sectors such as technology or healthcare.

How much do data curators make in the US?

Data curators in the US typically earn a median annual salary of around $60,000 to $80,000, depending on experience, education, and industry. Entry-level positions may start lower, while experienced professionals with specialized skills or certifications can earn higher salaries. The role often involves working with data management tools and maintaining data quality standards.

What are the key skills and qualifications needed to thrive in the Data Curation position, and why are they important?

To excel in Data Curation, you need a strong background in data management, information science, and database technologies, often supported by a degree in a related field. Familiarity with data wrangling tools, metadata standards, programming languages (such as Python or R), and data management systems is highly valuable. Attention to detail, analytical thinking, and collaborative communication are standout soft skills in this position. These abilities ensure data integrity, usability, and accessibility, which are essential for supporting robust decision-making and research outcomes.

What does a data curator do?

A data curator is responsible for organizing, maintaining, and ensuring the quality of data within a database or repository. They review, categorize, and annotate data to make it accessible and useful for analysis, often using tools like data management software and following data standards. Their work supports accurate data retrieval and reliable research or decision-making processes.

What is a Data Curation job?

A Data Curation job involves collecting, organizing, maintaining, and ensuring the quality of data for accuracy and accessibility. Data curators clean and structure datasets, manage metadata, and ensure compliance with data governance standards. They work closely with data scientists, analysts, and engineers to support data-driven decision-making. This role is essential in industries like research, healthcare, finance, and technology, where high-quality data is crucial for insights and innovation.

More about Data Curation jobs
What cities are hiring for Data Curation jobs? Cities with the most Data Curation job openings:
What are the most commonly searched types of Data Curation jobs? The most popular types of Data Curation jobs are:
What states have the most Data Curation jobs? States with the most job openings for Data Curation jobs include:
Infographic showing various Data Curation job openings in the United States as of June 2026, with employment types broken down into 97% Full Time, and 3% Contract. Highlights an 83% Physical, 4% Hybrid, and 13% Remote job distribution, with an average salary of $93,230 per year, or $44.8 per hour.

Research Scientist - Model Capability Boundary Exploration and AI Data Flywheel System Developmen...

ByteDance

San Jose, CA • On-site

Full-time

Posted 23 days ago


Job description

Job Summary:
ByteDance is a leading tech company known for its innovative products like TikTok and CapCut. They are seeking a Research Scientist to develop and operate Large Language Model (LLM) service platforms, focusing on building a next-generation big model as a service platform and managing GPU resources efficiently.
Responsibilities:
• Building a next-generation big model as a service platform to serve hundreds of LLMs based applications;
• To develop and maintain the big model as a service platform, including offline training/finetuning, online inference, model management, and resource orchestration, etc.;
• To manage a huge number of GPU resources and provide computing power efficiently.
Qualifications:
Required:
• Currently pursuing or recently completed a Ph.D. in Computer Science, Machine Learning, Artificial Intelligence, Data Science, or a related technical field.
• Research experience in one or more of the following areas: LLM post-training and alignment, model evaluation, test-time scaling, agent systems, or large-scale data curation and optimization.
• Demonstrated research ability through publications, substantial research projects, or internships.
• Ability to work independently on open-ended research problems, from problem formulation to experimental execution.
Preferred:
• Strong interest in foundation models and data-centric AI, particularly in how large models can improve over time through better data, feedback, and system design. Relevant directions include data flywheels, continual learning, data curation and valuation, and the co-design of algorithms and infrastructure.
• A strong publication record with multiple first-author papers, in areas of machine learning, NLP, data mining, or related fields.
• Internship or research experience in similar fields, ideally with experience with scalable ML systems, especially those involving real-world deployment, feedback loops, or human-in-the-loop pipelines.
• Strong motivation to connect research with practice, and to build end-to-end AI systems spanning modeling, data, evaluation, and infrastructure.
Company:
ByteDance is a technology company that develops content creation platforms and services. Founded in 2012, the company is headquartered in Beijing, CHN, with a team of 10001+ employees. The company is currently Late Stage.