Build LLM-based methods for synthetic data generation, privacy, and context-aware anonymization, with automated evaluation across multilingual text, documents, and multimodal content. * Optimize task ...
Build LLM-based methods for synthetic data generation, privacy, and context-aware anonymization, with automated evaluation across multilingual text, documents, and multimodal content. * Optimize task ...
We are seeking a Simulation Engineer to join our AI robotics research team, focusing on high-fidelity synthetic data generation. In this role, you will leverage classic game engine architecture, 3D ...
We are seeking a Simulation Engineer to join our AI robotics research team, focusing on high-fidelity synthetic data generation. In this role, you will leverage classic game engine architecture, 3D ...
Build LLM-based methods for synthetic data generation, privacy, and context-aware anonymization, with automated evaluation across multilingual text, documents, and multimodal content. * Optimize task ...
Build LLM-based methods for synthetic data generation, privacy, and context-aware anonymization, with automated evaluation across multilingual text, documents, and multimodal content. * Optimize task ...
Build LLM-based methods for synthetic data generation, privacy, and context‑aware anonymization, with automated evaluation across multilingual text, documents, and multimodal content. * Optimize ...
Build LLM-based methods for synthetic data generation, privacy, and context‑aware anonymization, with automated evaluation across multilingual text, documents, and multimodal content. * Optimize ...
Build LLM‑based methods for synthetic data generation, privacy, and context‑aware anonymization, with automated evaluation across multilingual text, documents, and multimodal content. * Optimize ...
Build LLM‑based methods for synthetic data generation, privacy, and context‑aware anonymization, with automated evaluation across multilingual text, documents, and multimodal content. * Optimize ...
Senior Scientist: Synthetic Data & Privacy for LLMs (Santa Clara)
Santa Clara, CA · On-site
$168K - $264K/yr
Nvidia Corporation in Santa Clara, California, is seeking a Senior Scientist to drive advancements in synthetic data generation and AI privacy. This role involves building LLM-based methods ...
Senior Scientist: Synthetic Data & Privacy for LLMs (Santa Clara)
Santa Clara, CA · On-site
$168K - $264K/yr
Nvidia Corporation in Santa Clara, California, is seeking a Senior Scientist to drive advancements in synthetic data generation and AI privacy. This role involves building LLM-based methods ...
Senior Scientist - Synthetic Data & Privacy for LLMs (New York)
Manhattan, NY · On-site
$192K - $304K/yr
NVIDIA AI is seeking a Senior Scientist to help advance the capabilities in synthetic data generation and privacy-preserving AI. You will contribute to open-source libraries, combining software ...
Senior Scientist - Synthetic Data & Privacy for LLMs (New York)
Manhattan, NY · On-site
$192K - $304K/yr
NVIDIA AI is seeking a Senior Scientist to help advance the capabilities in synthetic data generation and privacy-preserving AI. You will contribute to open-source libraries, combining software ...
NVIDIA is seeking a Senior Scientist to advance the capabilities in synthetic data generation and privacy-preserving AI in Santa Clara, California. You'll build methods based on large language models ...
NVIDIA is seeking a Senior Scientist to advance the capabilities in synthetic data generation and privacy-preserving AI in Santa Clara, California. You'll build methods based on large language models ...
Data Scientist
Herndon, VA · On-site +1
Working within a cross-functional team and reporting to a technical lead, you will operate across the machine learning development lifecycle, from data curation and synthetic data generation to model ...
Data Scientist
Herndon, VA · On-site +1
Working within a cross-functional team and reporting to a technical lead, you will operate across the machine learning development lifecycle, from data curation and synthetic data generation to model ...
Synthetic Data Generation: Develop and maintain synthetic data generation pipelines to augment evaluation coverage, stress-test safety boundaries, and support evaluation in low-resource languages.
Synthetic Data Generation: Develop and maintain synthetic data generation pipelines to augment evaluation coverage, stress-test safety boundaries, and support evaluation in low-resource languages.
Member of Technical Staff - Simulation (Synthetic Data Generation), Frontier AI & Robotics (FAR)
San Francisco, CA · On-site
We are seeking a Simulation Engineer to join our AI robotics research team, focusing on high-fidelity synthetic data generation. In this role, you will leverage classic game engine architecture, 3D ...
Member of Technical Staff - Simulation (Synthetic Data Generation), Frontier AI & Robotics (FAR)
San Francisco, CA · On-site
We are seeking a Simulation Engineer to join our AI robotics research team, focusing on high-fidelity synthetic data generation. In this role, you will leverage classic game engine architecture, 3D ...
Member of Engineering (Synthetic Data Research)
$117K - $140K/yr
Staying in sync with the latest state-of-the-art research in synthetic data generation and LLM training is key to success in this role. You will constantly lead original research initiatives through ...
Member of Engineering (Synthetic Data Research)
$117K - $140K/yr
Staying in sync with the latest state-of-the-art research in synthetic data generation and LLM training is key to success in this role. You will constantly lead original research initiatives through ...
Test Data Management Consultant
Little Rock, AR · On-site +1
... or synthetic data generation as needed. Ensuring compliance with IT security guidelines and data compliance regulations Provisioning data for QA testing, user acceptance testing, and performance ...
Test Data Management Consultant
Little Rock, AR · On-site +1
... or synthetic data generation as needed. Ensuring compliance with IT security guidelines and data compliance regulations Provisioning data for QA testing, user acceptance testing, and performance ...
Data Scientist with Security Clearance
Herndon, VA · On-site
$106K - $180K/yr
Working within a cross-functional team and reporting to a technical lead, you will operate across the machine learning development lifecycle, from data curation and synthetic data generation to model ...
Data Scientist with Security Clearance
Herndon, VA · On-site
$106K - $180K/yr
Working within a cross-functional team and reporting to a technical lead, you will operate across the machine learning development lifecycle, from data curation and synthetic data generation to model ...
This role focuses on building scalable pipelines for image and video datasets, ensuring ethical data collection, and leveraging simulation tools for synthetic data generation. Ideal candidates will ...
This role focuses on building scalable pipelines for image and video datasets, ensuring ethical data collection, and leveraging simulation tools for synthetic data generation. Ideal candidates will ...
Innovate and experiment with new approaches for synthetic data generation to improve the diversity, realism, and representativeness of datasets. Collaborate with multi-functional teams to understand ...
Innovate and experiment with new approaches for synthetic data generation to improve the diversity, realism, and representativeness of datasets. Collaborate with multi-functional teams to understand ...
Data Scientist
Herndon, VA · On-site
$106K - $180K/yr
Working within a cross-functional team and reporting to a technical lead, you will operate across the machine learning development lifecycle, from data curation and synthetic data generation to model ...
Data Scientist
Herndon, VA · On-site
$106K - $180K/yr
Working within a cross-functional team and reporting to a technical lead, you will operate across the machine learning development lifecycle, from data curation and synthetic data generation to model ...
AI Engineer
Leawood, KS · On-site
$111K - $133K/yr
Support post-training data workflows such as SFT, instruction tuning, preference data, RLHF/DPO-style data, reward model data, and synthetic data generation. * Use modern annotation tools and AWS ...
AI Engineer
Leawood, KS · On-site
$111K - $133K/yr
Support post-training data workflows such as SFT, instruction tuning, preference data, RLHF/DPO-style data, reward model data, and synthetic data generation. * Use modern annotation tools and AWS ...
Research Scientist - Vision Data Infrastructure (San Francisco)
San Francisco, CA · On-site
$250K - $600K/yr
Synthetic & Simulation‑Based Data Generation * Use simulation tools (Unreal Engine 5, Isaac Sim, Unity) to generate high-quality synthetic vision data. * Create specialized datasets for VLM ...
Research Scientist - Vision Data Infrastructure (San Francisco)
San Francisco, CA · On-site
$250K - $600K/yr
Synthetic & Simulation‑Based Data Generation * Use simulation tools (Unreal Engine 5, Isaac Sim, Unity) to generate high-quality synthetic vision data. * Create specialized datasets for VLM ...
Senior Robotics Data Engineer - Only W2
Warren, MI · On-site
$99K - $135K/yr
... and synthetic data generation. · Manage data versioning, metadata, and dataset governance to support model training, evaluation, and regression testing. · Collaborate with Robotics Perception ...
Quick apply
Senior Robotics Data Engineer - Only W2
Warren, MI · On-site
$99K - $135K/yr
... and synthetic data generation. · Manage data versioning, metadata, and dataset governance to support model training, evaluation, and regression testing. · Collaborate with Robotics Perception ...
Synthetic Data Generation information
See salary details
$31K - $43.5K
12% of jobs
$47.6K is the 25th percentile. Wages below this are outliers.
$43.5K - $56.1K
41% of jobs
$56.1K - $68.6K
4% of jobs
$68.6K - $81.2K
3% of jobs
$81.2K - $93.7K
2% of jobs
$93.7K - $106.3K
0% of jobs
$106.3K - $118.8K
0% of jobs
$118.8K - $131.4K
5% of jobs
$139.5K is the 75th percentile. Wages above this are outliers.
$131.4K - $143.9K
11% of jobs
$143.9K - $156.5K
11% of jobs
$156.5K - $169K
11% of jobs
$31K
$93.2K
$169K
How much do synthetic data generation jobs pay per year?
What are the key skills and qualifications needed to thrive in a Synthetic Data Generation role, and why are they important?
What is the salary of a synthetic data engineer?
Which 3 jobs will survive AI?
What is an example of synthetic data generation?
What is synthetic data generation?
What is the difference between Synthetic Data Generation vs Data Analyst?
| Aspect | Synthetic Data Generation | Data Analyst |
|---|---|---|
| Required Credentials | Knowledge of data science, programming, and data privacy | Degree in statistics, data science, or related field |
| Work Environment | Data science teams, research labs, tech companies | Business environments, analytics teams, consulting firms |
| Industry Usage | AI development, machine learning, data privacy | Business insights, reporting, decision-making |
| Search & Comparison Intent | Understanding data generation techniques, privacy solutions | Analyzing data, generating reports, insights |
While Synthetic Data Generation focuses on creating artificial data for privacy and model training, Data Analysts interpret existing data to provide business insights. Both roles require data-related skills but serve different purposes within the data ecosystem.
What are the main challenges faced by professionals working in synthetic data generation, and how can they be addressed?
Is 40 too late for data science?

Full-time
Posted 4 days ago
Job description
NVIDIA is at the forefront of the AI revolution, and our research is shaping the future of large language models. We are looking for a Senior Scientist to join our team and help advance our capabilities in generating synthetic data and privacy-preserving AI. You will contribute to open-source libraries within the NVIDIA NeMo ecosystem that enable high-quality synthetic data generation and data privacy at scale, including context‑aware anonymization. This role combines hands‑on software engineering with applied research in LLMs and privacy‑enhancing methods, and you will collaborate with research, engineering, product teams, and external labs.
What You’ll Be Doing- Build LLM-based methods for synthetic data generation, privacy, and context-aware anonymization, with automated evaluation across multilingual text, documents, and multimodal content.
- Optimize task-specific LLMs for low‑latency, high‑throughput inference (distillation, quantization), and scale our frameworks to run in real time.
- Design and maintain open-source libraries and SDKs with clean APIs and strong documentation.
- Drive software excellence with modern tooling, architecture based on configuration, and professional Git/CI‑CD.
- Publish original research at top machine learning and AI conferences to maintain NVIDIA‘s technical leadership.
- Mentor interns and junior researchers to develop technical growth within the team.
- PhD in Computer Science, Machine Learning, Statistics, or a related field, or equivalent experience.
- A research background of 2+ years in applied LLM/NLP research and engineering, synthetic data generation, anonymization and PII detection, or related areas. Comparable experience is also considered.
- Proven track record of developing or maintaining software libraries used by a broad developer community.
- Strong publication record at premier venues such as NeurIPS, ICML, ICLR, ACL or similar.
- Active contributions to open‑source projects, particularly in ML, security, or privacy domains.
- Deep technical understanding of LLMs and inference optimization (quantization, distillation, latency/throughput tuning), with frameworks such as vLLM or TGI.
- Ability to build and optimize scalable data processing pipelines for large‑scale models.
- Functional knowledge of global privacy regulations such as GDPR or CCPA.
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 168,000 USD - 264,500 USD for Level 3, and 192,000 USD - 304,750 USD for Level 4. You will also be eligible for equity and benefits.
Applications for this job will be accepted at least until June 14, 2026.
NVIDIA uses AI tools in its recruiting processes.
NVIDIA is committed to fostering an inclusive work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.