1

Big Data Scientist Jobs (NOW HIRING)

This role requires strong expertise in Big Data processing, modern ML tools, and the ability to build scalable, production-ready data science solutions. Key Responsibilities Structured Data Machine ...

Data Scientist Company Description Newton Research is a fast-growing software start-up founded by ... Familiarity with big data technologies or cloud computing platforms is a plus. * Experience in ...

Big Data Engineer

Sunnyvale, CA · On-site

$65.50 - $86.50/hr

Collaborate with cross-functional teams including data scientists, analysts, and DevOps engineers ... big data engineering and backend development . * Expertise in Hadoop ecosystem (HDFS, MapReduce ...

Company Description SRP is a big data startup company located in Princeton, NJ focused on Dynamic Pricing, run by seasoned alumni from Stanford University and Wharton. Senior Data Scientist: someone ...

Company Description SRP is a big data startup company located in Princeton, NJ focused on Dynamic Pricing, run by seasoned alumni from Stanford University and Wharton. Senior Data Scientist: someone ...

Data Scientist Location: Houston, TX (Fulltime) Environment: Standard, 5-days onsite : Must-Have ... Experience with big data technologies and cloud platforms * Knowledge of optimization techniques ...

Big data processing tools * Version control systems (Git) * Jupyter notebooks * Data preprocessing ... Scientific computing * Geospatial data formats and standards * Publications in peer-reviewed ...

About the team At Roku, as a Senior Data Scientists you will leverage Big Data to generate insights that fuel Roku's most significant business decisions. As a Senior Data Scientist, you will also use ...

Senior Data Scientist Job Summary The Senior Data Scientist plays a crucial role in transforming unstructured big data into actionable insights through advanced algorithms and innovative data models.

Are you a Data Scientist that likes to perform research-level data analytics and are willing to ... Work on small projects analyzing a variety of big data covering national security, cyber security, ...

next page

Showing results 1-20

Big Data Scientist information

See salary details

$46K

$165K

$243.5K

How much do big data scientist jobs pay per year?

As of Jun 16, 2026, the average yearly pay for big data scientist in the United States is $165,018.00, according to ZipRecruiter salary data. Most workers in this role earn between $133,500.00 and $170,000.00 per year, depending on experience, location, and employer.

What are Big Data Scientists?

Big Data Scientists are professionals who analyze and interpret large, complex data sets to uncover patterns, trends, and insights that help organizations make data-driven decisions. They use advanced analytics, machine learning, and statistical modeling techniques to process and extract value from massive volumes of structured and unstructured data. Their work often involves programming, data mining, and working with big data technologies such as Hadoop and Spark.

What are some common challenges Big Data Scientists face when working with large datasets, and how can they be addressed?

Big Data Scientists often encounter challenges such as handling data quality issues, ensuring data security, and managing the complexity of distributed computing environments. Large datasets frequently contain inconsistencies or missing values, requiring robust data cleaning and preprocessing techniques. Additionally, working with distributed systems like Hadoop or Spark introduces complexities around data storage, processing speed, and coordination with engineering teams. To address these challenges, it’s important to stay updated on best practices, leverage automation tools, and maintain close collaboration with data engineers and IT teams.

What are the key skills and qualifications needed to thrive as a Big Data Scientist, and why are they important?

To thrive as a Big Data Scientist, you need a strong background in statistics, machine learning, programming (typically Python or R), and a relevant degree in computer science, mathematics, or a related field. Expertise with big data technologies such as Hadoop, Spark, and experience with cloud platforms and relevant certifications (like AWS Certified Data Analytics) are highly valuable. Analytical thinking, problem-solving, and effective communication are crucial soft skills for translating complex data insights to stakeholders. These capabilities are essential for extracting actionable, high-impact insights from massive datasets to drive business decisions.

What is the difference between Big Data Scientist vs Data Analyst?

AspectBig Data ScientistData Analyst
Required CredentialsAdvanced degrees in Data Science, Computer Science, or related fields; certifications like Cloudera, HortonworksBachelor's or Master's in Statistics, Data Analysis, or related fields; certifications like Microsoft Data Analyst
Work EnvironmentBig data platforms, cloud environments, programming in Python, R, ScalaExcel, SQL, visualization tools, basic scripting
Employer & Industry UsageTech companies, finance, healthcare, e-commerce handling large datasetsRetail, marketing, small to medium businesses analyzing customer data

Big Data Scientists focus on developing models and algorithms to analyze vast datasets using advanced tools and programming. Data Analysts interpret data, generate reports, and provide insights primarily through visualization and SQL queries. Both roles are essential but differ in complexity, tools, and scope of data handled.

What cities are hiring for Big Data Scientist jobs? Cities with the most Big Data Scientist job openings:

Data Scientist - INDIA

Vytwo

Prosper, TX • On-site, Remote

Full-time

Posted 16 days ago


Job description

Role: Data Scientist - INDIA
Location: Hyderabad / Noida, INDIA

*Consultants local to INDIA are eligible.Category: Data Science Structured Data / Text Data (NLP & GenAI)
About the Role

We are seeking a highly skilled Data Scientist (37 years of experience) to join our team and work across two major data science domains:
  1. Structured Data (8090%) Predictive analytics, forecasting, cost estimation, likelihood modeling, and batchoriented machine learning pipelines.
  2. Text / Unstructured Data (NLP & GenAI) Building lowlatency realtime systems using deep learning, LLMs, prompt engineering, and agentic AI frameworks.
This role requires strong expertise in Big Data processing, modern ML tools, and the ability to build scalable, production-ready data science solutions.
Key Responsibilities

Structured Data Machine Learning & Analytics

  • Build, deploy, and optimize ML models for predictive analytics, forecasting, classification, and regression.
  • Perform large-scale feature engineering using PySpark and Big Data tools.
  • Work on batch pipelines, model versioning, and experiment tracking.
  • Develop cost estimation and risk/likelihood models using statistical and ML techniques.
Text Data / NLP / GenAI

  • Build NLP pipelines using deep learning frameworks such as PyTorch, TensorFlow, or similar.
  • Develop realtime, lowlatency inference systems for text classification, embeddings, semantic search, summarization, and retrieval.
  • Create prompts, context graphs, and agentic workflows for LLM-based systems.
  • Apply knowledge of prompt engineering, context engineering, and autonomous agent frameworks to production systems.
Core Data Science Engineering & MLOps

  • Work in Databricks for ETL, feature engineering, ML training, and orchestration.
  • Use Azure services for model deployment, data pipelines, and infrastructure.
  • Collaborate using Git-based workflows; leverage tools like GitHub Copilot, Claude Code, etc.
  • Implement model monitoring, observability, drift detection, and performance tracking.
Required Skills & Experience

Core Skills

  • Strong hands-on experience with Databricks (Delta Lake, MLflow, Job Orchestration).
  • Excellent PySpark skills for large-scale distributed data processing.
  • Proficiency in Azure cloud services (ADF, Azure ML, AKS, Databricks on Azure).
  • Strong understanding of ML algorithms, statistical methods, and data analysis.
  • Experience with deep learning frameworks:
    • PyTorch
    • TensorFlow
    • Transformers (HuggingFace)
  • Experience with model monitoring and ML observability.
  • Ability to write clean, optimized code and leverage AI code assistants.
NLP / GenAI Specific Skills

  • Prompt engineering (task prompts, chain of thought, tool calling, retrieval prompts).
  • Context engineering (retrieval pipelines, RAG, memory management, context structuring).
  • Knowledge of LLM-based agentic frameworks (LangChain, Semantic Kernel, CrewAI, AutoGen, etc.).
  • Experience with vector databases and embedding models is a plus.
Good to Have Skills

  • Experience with containerization (Docker, Kubernetes, AKS).
  • Experience deploying models to production (REST APIs, real-time endpoints).
  • Knowledge of streaming technologies (Kafka, EventHub, Spark Streaming).
  • Understanding of CI/CD for ML (Azure DevOps / GitHub Actions).
Who You Are

  • A problem solver who is comfortable working with both structured and unstructured data.
  • Someone who enjoys using modern AI tools to accelerate development.
  • A data scientist who writes clean, production-grade code.
  • A collaborator who thrives in cross-functional teams and fast-paced environments.

Flexible work from home options available.