1

Data Curation Ai Machine Learning Jobs (NOW HIRING)

Terra AI's mission is to define the new global standard for data-driven critical resource ... Data Curation Skills * Hands-on experience in creating, cleaning, and maintaining high-quality ...

... model lifecycle: data curation → training/fine-tuning → evaluation → deployment → ... Required : • 7+ years of experience in Applied AI / Machine Learning / Generative AI • Proven ...

Senior Data Engineer / Data Curator

San Jose, CA · On-site

$124K - $168K/yr

... curation, particularly in machine learning or AI-driven environments. • Strong proficiency in Python (Pandas, NumPy) and SQL for data manipulation and querying. • Familiarity with cloud-based ...

Senior Machine Learning Engineer

San Francisco, CA · On-site

$144K - $190K/yr

... data curation, active learning strategies, and statistical quality metrics to optimize the signal ... Atoms is a robotics startup that develops industrial robotics and physical AI systems to automate ...

The ideal candidate brings a strong foundation in machine learning, data engineering, and MLOps ... Deploy AI/ML models into production environments, integrating with existing enterprise systems ...

Machine Learning Data Engineer

Cupertino, CA · On-site

$141K - $169K/yr

Experience designing data quality control processes, data curation workflows, or Human-in-the-Loop ... Experience with prompt engineering, machine learning tools, and fine-tuning Large Language Models ...

Senior Machine Learning Engineer

San Francisco, CA · On-site

$144K - $190K/yr

... data curation, active learning strategies, and statistical quality metrics to optimize the signal ... applying AI Transformers to robotics, physical actuation, or spatial-temporal data. • Proven ...

Senior Machine Learning Engineer

San Francisco, CA · On-site

$144K - $190K/yr

... data curation, active learning strategies, and statistical quality metrics to optimize the signal ... Atoms is a robotics startup that develops industrial robotics and physical AI systems to automate ...

... machine learning and AI conferences, publish technical blog posts, and contribute to the open-source AI community! In this role, you will: * Develop scalable data curation and processing methods that ...

next page

Showing results 1-20

Data Curation Ai Machine Learning information

See salary details

$44.5K

$129.7K

$177.5K

How much do data curation ai machine learning jobs pay per year?

As of Jul 4, 2026, the average yearly pay for data curation ai machine learning in the United States is $129,716.00, according to ZipRecruiter salary data. Most workers in this role earn between $114,500.00 and $137,500.00 per year, depending on experience, location, and employer.

What are the key skills and qualifications needed to thrive as a Data Curation AI Machine Learning Specialist, and why are they important?

To thrive as a Data Curation AI Machine Learning Specialist, you need strong data management skills, a background in computer science or data science, and experience with machine learning principles. Familiarity with programming languages like Python or R, data labeling tools, and database systems, as well as certifications in machine learning or data engineering, are typically required. Attention to detail, critical thinking, and effective communication stand out as essential soft skills for managing complex datasets and collaborating with cross-functional teams. These skills ensure high-quality, well-organized data that drives accurate machine learning models and reliable AI outcomes.

What is a Data Curation AI/Machine Learning specialist?

A Data Curation AI/Machine Learning specialist is a professional who manages, organizes, and prepares large datasets to be used in artificial intelligence and machine learning projects. They ensure that data is accurate, relevant, and accessible, often cleaning and labeling data so it can be effectively used to train machine learning models. Their role bridges the gap between raw data sources and the teams building AI solutions, enabling more reliable and efficient model development. They may also work with data governance, privacy, and compliance issues to ensure data quality and security.

What are some common challenges faced by data curation professionals working in AI and machine learning projects?

One of the key challenges data curation specialists encounter in AI and machine learning is ensuring the quality and consistency of large, diverse datasets. This often involves dealing with missing, incomplete, or biased data, which can impact model performance. Additionally, data curators must navigate evolving data privacy regulations and work closely with data scientists, engineers, and domain experts to align data preparation with project goals. Effective communication and a meticulous approach are crucial for maintaining data integrity and supporting robust machine learning outcomes.

What is the difference between Data Curation Ai Machine Learning vs Data Analyst?

AspectData Curation Ai Machine LearningData Analyst
Primary FocusPreparing and managing data for AI and ML modelsAnalyzing data to generate business insights
Skills RequiredData management, programming, understanding of AI/ML algorithmsStatistical analysis, data visualization, Excel, SQL
Tools UsedPython, R, SQL, data cleaning toolsExcel, Tableau, SQL, statistical software
Work EnvironmentData science teams, AI/ML projects, tech companiesBusiness departments, analytics teams, consulting firms

While Data Curation Ai Machine Learning specialists focus on preparing data for AI and machine learning models, Data Analysts interpret data to support business decisions. Both roles require strong data skills but differ in their primary objectives and tools used.

More about Data Curation Ai Machine Learning jobs
What cities are hiring for Data Curation Ai Machine Learning jobs? Cities with the most Data Curation Ai Machine Learning job openings:
What states have the most Data Curation Ai Machine Learning jobs? States with the most job openings for Data Curation Ai Machine Learning jobs include:
Infographic showing various Data Curation Ai Machine Learning job openings in the United States as of June 2026, with employment types broken down into 1% Internship, 59% Full Time, 39% Part Time, and 1% Temporary. Highlights an 66% Physical, 3% Hybrid, and 31% Remote job distribution, with an average salary of $129,716 per year, or $62.4 per hour.

Senior/Staff Machine Learning Researcher

Terra AI

Redwood City, CA • Remote

Full-time

Posted 21 days ago


Job description

Terra AI is building a new category at the intersection of artificial intelligence, geoscience, and critical resource development.
As global demand for copper, lithium, nickel, rare earth elements, geothermal energy, and other strategic resources accelerates, the mining and subsurface industries face a growing challenge: traditional exploration methods remain slow, expensive, and highly uncertain. Terra AI was founded to help solve this problem by redefining how critical resources are discovered, evaluated, and developed.
By combining advanced machine learning, probabilistic modeling, and deep geoscience expertise, Terra AI helps exploration and mining companies make faster, more informed subsurface decisions with greater confidence and capital efficiency. The company’s platform integrates geological, geophysical, and drilling data into intelligent systems designed to improve targeting accuracy, accelerate discovery timelines, and reduce exploration risk.
Backed by leading investors including Khosla Ventures and working alongside strategic industry partners including Rio Tinto, Ero Copper, and Ramaco Resources, Terra AI is emerging as one of the more closely watched AI-native companies operating within the mining and critical minerals sector.
Terra AI’s mission is to define the new global standard for data-driven critical resource development — breaking the cost and time curve required to support electrification, energy security, and the global energy transition.
The company operates with a strong partnership mentality, combining technical rigor, candid communication, continual learning, and environmental stewardship to help modern exploration teams solve some of the world’s most important resource challenges.

Role description

In the same way image generators have shown the remarkable ability to produce a diverse set of realistic pictures conditioned on a text prompt (and other inputs), we are developing a generative model that produces 3D geological models conditioned on geophysical surveys, borehole measurements, and other forms of physical observation. The outputs of the generative model capture what we know and don’t know about the state of the subsurface, allowing explorers to make maximally informed decisions about how and where to explore for critical resources.

We are looking for a talented deep learning engineer or scientist to lead the development of this model that will revolutionize decision-making in the earth subsurface for a wide range of clean energy applications.

Role Responsibilities
  • Design, train, test, and iterate on diffusion models for 3D geological models

  • Design, train, test, and iterate on an approach for conditioning generation on geophysical data and other observations

  • Inform the generation of synthetic data to improve model performance

  • Adapt diffusion modeling approach to specific real-world projects in collaboration with project teams.

Qualifications

Required Qualifications:

  • Extensive PyTorch Experience

    • Deep understanding of PyTorch, including writing custom modules, optimizing training, and debugging issues in large-scale models.

  • Expertise in Developing Large Deep Learning Models from Scratch

    • Proven ability to design, implement, and train complex deep learning architectures from the ground up.

  • Data Curation Skills

    • Hands-on experience in creating, cleaning, and maintaining high-quality datasets tailored for machine learning applications.

  • Strong Software Engineering and Design Experience

    • Proficient in software development best practices, including version control, testing, and code optimization.

    • Familiarity with designing scalable and maintainable systems.

Nice-to-haves:

  • Experience with Generative Models

    • Familiarity with generative architectures, particularly diffusion models, and an emphasis on posterior sampling methods.

  • Knowledge of Transformer Architectures

    • Experience building and training transformers, especially in applications involving 3D data.

  • Scaling Models Across Large GPU Clusters

    • Expertise in parallelizing models across multiple GPUs and optimizing distributed training pipelines.

  • Cloud Infrastructure Expertise

    • Experience setting up, managing, and optimizing cloud environments for machine learning workloads, including provisioning resources and managing costs.