Responsibilities : • Use Python, Django and Django Rest Framework (DRF) to implement cloud backend APIs • Use Python, Django, and Celery to implement highly parallelized large dataset processing ...
Responsibilities : • Use Python, Django and Django Rest Framework (DRF) to implement cloud backend APIs • Use Python, Django, and Celery to implement highly parallelized large dataset processing ...
This is a hands-on technical leadership position focused on complex incident resolution, data platform troubleshooting, large dataset support, and root-cause analysis across modern Azure data ...
This is a hands-on technical leadership position focused on complex incident resolution, data platform troubleshooting, large dataset support, and root-cause analysis across modern Azure data ...
Geophysicist - AI Trainer
Manhattan, NY · On-site +1
You'll draw on your hands‐on experience with scientific visualization, fluid dynamics simulation, or large dataset analysis to evaluate AI-generated content and provide feedback that helps AI ...
Geophysicist - AI Trainer
Manhattan, NY · On-site +1
You'll draw on your hands‐on experience with scientific visualization, fluid dynamics simulation, or large dataset analysis to evaluate AI-generated content and provide feedback that helps AI ...
Handle analyses around a very large dataset, identifying outliers, and utilization rate calculations across different geographies. * Perform predictive data analysis work to determine forecast ...
Handle analyses around a very large dataset, identifying outliers, and utilization rate calculations across different geographies. * Perform predictive data analysis work to determine forecast ...
Gastroenterology Opening in Rochester, MN
Rochester, MN · On-site
$406K/yr
... Large dataset analysis Academic appointment at Mayo Clinic College of Medicine (rank commensurate with experience) Top-tier fellowship programs in multiple GI subspecialties Compensation & Benefits:
Gastroenterology Opening in Rochester, MN
Rochester, MN · On-site
$406K/yr
... Large dataset analysis Academic appointment at Mayo Clinic College of Medicine (rank commensurate with experience) Top-tier fellowship programs in multiple GI subspecialties Compensation & Benefits:
SAP BO Reports Developer
Newark, NJ · On-site
Desirable skills BW Query Design • Dash boarding • Understanding Speed in reporting/large dataset • Report Troubleshooting • Job creation in Promotion Management • Lead Unit test • Lead ...
SAP BO Reports Developer
Newark, NJ · On-site
Desirable skills BW Query Design • Dash boarding • Understanding Speed in reporting/large dataset • Report Troubleshooting • Job creation in Promotion Management • Lead Unit test • Lead ...
... Large dataset analysis Academic appointment at Mayo Clinic College of Medicine (rank commensurate with experience) Top-tier fellowship programs in multiple GI subspecialties Compensation & Benefits:
... Large dataset analysis Academic appointment at Mayo Clinic College of Medicine (rank commensurate with experience) Top-tier fellowship programs in multiple GI subspecialties Compensation & Benefits:
Anaplan Architect
Santa Clara, CA · Hybrid
$80/hr
Experience with large dataset handling and performance tuning * Strong stakeholder communication * Experience integrating with ERP / CRM systems preferred
Quick apply
Anaplan Architect
Santa Clara, CA · Hybrid
$80/hr
Experience with large dataset handling and performance tuning * Strong stakeholder communication * Experience integrating with ERP / CRM systems preferred
Senior Workday Payroll Analyst - Payroll Operations & Compliance- 1612
Culver City, CA · On-site
$80/hr
Advanced Excel (large dataset analysis) Responsibilities: * Lead payroll audits, reconciliations, and validation of payroll, tax, and GL data * Ensure compliance with federal, state, and local ...
Senior Workday Payroll Analyst - Payroll Operations & Compliance- 1612
Culver City, CA · On-site
$80/hr
Advanced Excel (large dataset analysis) Responsibilities: * Lead payroll audits, reconciliations, and validation of payroll, tax, and GL data * Ensure compliance with federal, state, and local ...
Technical Program Manager, Dataset Operations
$151K - $196K/yr
Own the lifecycle of large-scale datasets across modalities: * Ego-centric video (AR/VR, human ... Dataset versioning and traceability * Vendor performance benchmarking * Cost vs. quality ...
Technical Program Manager, Dataset Operations
$151K - $196K/yr
Own the lifecycle of large-scale datasets across modalities: * Ego-centric video (AR/VR, human ... Dataset versioning and traceability * Vendor performance benchmarking * Cost vs. quality ...
Able to design innovative solutions for large dataset processing. * hands on Performance tuning - Spark/Scala code optimization. * Adding and removing hosts to cluster/ cluster level parameter change ...
Able to design innovative solutions for large dataset processing. * hands on Performance tuning - Spark/Scala code optimization. * Adding and removing hosts to cluster/ cluster level parameter change ...
Technical Program Manager, Dataset Operations
Santa Clara, CA · On-site
$151K - $196K/yr
Own the lifecycle of large-scale datasets across modalities: * Ego-centric video (AR/VR, human ... Dataset versioning and traceability * Vendor performance benchmarking * Cost vs. quality ...
Technical Program Manager, Dataset Operations
Santa Clara, CA · On-site
$151K - $196K/yr
Own the lifecycle of large-scale datasets across modalities: * Ego-centric video (AR/VR, human ... Dataset versioning and traceability * Vendor performance benchmarking * Cost vs. quality ...
This is a hands-on technical leadership position focused on complex incident resolution, data platform troubleshooting, large dataset support, and root-cause analysis across modern Azure data ...
This is a hands-on technical leadership position focused on complex incident resolution, data platform troubleshooting, large dataset support, and root-cause analysis across modern Azure data ...
Handle analyses around a very large dataset, identifying outliers, and utilization rate calculations across different geographies. * Perform predictive data analysis work to determine forecast ...
Handle analyses around a very large dataset, identifying outliers, and utilization rate calculations across different geographies. * Perform predictive data analysis work to determine forecast ...
Machine Learning Scientist
Waltham, MA · On-site
Apply supervised and semi-supervised machine learning techniques on a variety of problems using our large dataset of spontaneous real-world face data * Experiment with deep architectures for the ...
Machine Learning Scientist
Waltham, MA · On-site
Apply supervised and semi-supervised machine learning techniques on a variety of problems using our large dataset of spontaneous real-world face data * Experiment with deep architectures for the ...
LLM Dataset Engineer
San Francisco, CA · On-site
$155K - $210K/yr
Foundation Dataset Strategy: Own the end-to-end creation of pre-training datasets for LLMs. This ... Experience building large-scale image or video datasets from scratch (e.g., LAION-style pipelines)
LLM Dataset Engineer
San Francisco, CA · On-site
$155K - $210K/yr
Foundation Dataset Strategy: Own the end-to-end creation of pre-training datasets for LLMs. This ... Experience building large-scale image or video datasets from scratch (e.g., LAION-style pipelines)
Senior Infrastructure Engineer
$118K - $161K/yr
Large Dataset Management Expertise * SAN Management Experience * Disaster Recovery and Capacity Planning Comp & Benefits * Competitive comp based on experience level * Healthcare HMO & PPO * Stock ...
Senior Infrastructure Engineer
$118K - $161K/yr
Large Dataset Management Expertise * SAN Management Experience * Disaster Recovery and Capacity Planning Comp & Benefits * Competitive comp based on experience level * Healthcare HMO & PPO * Stock ...
Assistant Research Scientist (Part-Time)
New York, NY · On-site
$22/hr
The goal of the project is to develop a large dataset of transcripts of parent-child interaction. The assistant research scientist will be involved in creating and editing transcripts. The research ...
Assistant Research Scientist (Part-Time)
New York, NY · On-site
$22/hr
The goal of the project is to develop a large dataset of transcripts of parent-child interaction. The assistant research scientist will be involved in creating and editing transcripts. The research ...
Full Stack Developer
Sunnyvale, CA · On-site
Have worked with time-series or streaming data and understand the performance implications of large dataset rendering. * Learn quickly and enjoy working at the intersection of software, data, and ...
Full Stack Developer
Sunnyvale, CA · On-site
Have worked with time-series or streaming data and understand the performance implications of large dataset rendering. * Learn quickly and enjoy working at the intersection of software, data, and ...
Senior Data Analyst Lead
$85K - $107K/yr
Required Skills & Experience • 8-10 years of experience as a Data Analyst, with strong exposure to Banking. • Advanced proficiency in SQL (query optimization and large dataset handling). • ...
New
Senior Data Analyst Lead
$85K - $107K/yr
Required Skills & Experience • 8-10 years of experience as a Data Analyst, with strong exposure to Banking. • Advanced proficiency in SQL (query optimization and large dataset handling). • ...
New
Large Dataset information
See salary details
$47K - $55.5K
1% of jobs
$55.5K - $64K
3% of jobs
$64K - $72.5K
7% of jobs
$72.5K - $81K
9% of jobs
$82.7K is the 25th percentile. Wages below this are outliers.
$81K - $89.5K
20% of jobs
The median wage is $95.1K / yr.
$89.5K - $98K
14% of jobs
$98K - $106.5K
13% of jobs
$110.9K is the 75th percentile. Wages above this are outliers.
$106.5K - $115K
15% of jobs
$115K - $123.5K
8% of jobs
$123.5K - $132K
5% of jobs
$132K - $140.5K
4% of jobs
$47K
$98.6K
$140.5K
How much do large dataset jobs pay per year?
What are the key skills and qualifications needed to thrive as a Data Scientist working with large datasets, and why are they important?
What is the difference between Large Dataset vs Data Analyst?
| Aspect | Large Dataset | Data Analyst |
|---|---|---|
| Required Credentials | Often no formal degree, but knowledge of data management | Bachelor's degree in data science, statistics, or related field |
| Work Environment | Data storage, database management, data processing | Data interpretation, reporting, visualization |
| Industry Usage | Used across industries for storing and managing data | Applied in business, finance, healthcare for analysis |
| Search & Comparison Intent | Understanding data volume management | Analyzing data to generate insights |
Large Dataset refers to the volume of data stored and managed, often requiring data engineering skills. Data Analysts focus on interpreting and visualizing data to support decision-making. While large datasets are the raw material, data analysts turn that data into actionable insights.
What are large datasets?
What are some common challenges when working with large datasets, and how can professionals overcome them?

Job description
Lumafield is a company founded to upgrade manufacturing through innovative engineering solutions. The Backend Software Engineer will work on the core of the cloud platform, implementing APIs and processing large datasets while collaborating with research teams and product managers.
Responsibilities:
• Use Python, Django and Django Rest Framework (DRF) to implement cloud backend APIs
• Use Python, Django, and Celery to implement highly parallelized large dataset processing tasks ranging up to 100s of GBs of data
• Design for and deploy your code to our AWS environment leveraging EKS, S3, CloudFront, and other AWS technologies
• Work closely with our research and algorithms team to incorporate cutting edge algorithms into our production codebases
• Collaborate closely with product managers and engineering leadership to align technical objectives with business deliverables.
• Get your hands dirty and build – expect to be hands-on and building regularly
Qualifications:
Required:
• Bachelor's Degree in Engineering or related field
• 2+ years of experience with Python in a production backend setting using major web frameworks (Flask, Django, FastAPI, etc.)
• Experience with large dataset processing using numpy
• Strong software engineering fundamentals including git, unit testing, pull request reviews, module/interface design, and applications using parallelism and concurrency.
• Strong team collaboration, communication and interpersonal skills
• Experience with Linux server administration, network troubleshooting, docker deployments, productionizing systems
Preferred:
• Experience with Agile Development practices
• Experience with AWS including EKS, S3, CloudFront, or similar
• Experience with image processing pipelines and/or image acquisition
• Experience in configuring Linux systems, applying best practices, and automating workflows with Ansible and scripting.
Company:
Lumafield develops industrial computed tomography (CT) solutions for non-destructive testing and inspection for engineers. Founded in 2019, the company is headquartered in Cambridge, USA, with a team of 51-200 employees. The company is currently Growth Stage.
About Lumafield
Sourced by ZipRecruiter
Industry
Computer and peripheral equipment manufacturing
Company size
51 - 200 Employees
Headquarters location
Cambridge, MA, US
Year founded
2019