1

Applied Data Scientist Jobs (NOW HIRING)

Microsoft AI is seeking a highly skilled and experienced Applied Data Scientist to join their dynamic team. The role focuses on elevating the quality of User Identity graph and involves designing and ...

New

Applied Data Scientist

New York, NY ยท On-site

$160K - $200K/yr

About the Role We are looking for a skilled Applied Data Scientist with a strong foundation in data engineering principles to play a key role in developing and implementing data-driven solutions. A ...

Senior (Applied) Data Scientist

New York, NY ยท On-site

$144K - $181K/yr

Learn more about us at getclair.com/about About the Role As an Applied Data Scientist on Clair's Data Science team, you'll be responsible for developing and maintaining the next generation of credit ...

Senior / Lead Applied Data Scientist 100% Remote (U.S. Based Only, Select States - See Below) About the role We're looking for a product-minded, AI-native Data Scientist who operates like a mini ...

We're seeking an Applied Data Scientist III to turn our backlog of AI use cases into scalable, production-ready solutions. This role is both hands-on technical and strategy, combining model ...

Data Scientist II

Camden, NJ ยท On-site

$100K - $120K/yr

This role is ideal for a hands-on, applied data scientist who enjoys working on complex systems, translating ambiguous business problems into analytical models, and seeing their work put into ...

Data Scientist II

Camden, NJ ยท On-site

$100K - $120K/yr

This role is ideal for a hands-on, applied data scientist who enjoys working on complex systems, translating ambiguous business problems into analytical models, and seeing their work put into ...

Top of market salary + equity As a Staff Applied Data Scientist: You will play a key technical role on our Engineering team, identifying and evaluating trends, insights across large data sets and ...

Top of market salary + equity As a Staff Applied Data Scientist: You will play a key technical role on our Engineering team, identifying and evaluating trends, insights across large data sets and ...

Track experiments using MLflow, Weights & Biases Required Qualifications: * 3-5 years in applied data science or machine learning * Experience with Snowflake SQL , Python ML ecosystem, healthcare ...

next page

Showing results 1-20

Applied Data Scientist information

See salary details

$37.5K

$122.7K

$196.5K

How much do applied data scientist jobs pay per year?

As of Jun 10, 2026, the average yearly pay for applied data scientist in the United States is $122,738.00, according to ZipRecruiter salary data. Most workers in this role earn between $98,500.00 and $136,000.00 per year, depending on experience, location, and employer.

What are applied data scientists?

Applied data scientists are professionals who use statistical analysis, machine learning, and programming to extract insights from data and solve real-world business or research problems. Unlike theoretical data scientists, applied data scientists focus on implementing practical solutions that directly impact organizations, such as improving processes, predicting trends, or optimizing operations. They work closely with stakeholders to understand requirements, develop predictive models, and communicate results in a clear and actionable way. Their expertise typically spans data wrangling, model building, and deployment of data-driven applications.

What are the key skills and qualifications needed to thrive as an Applied Data Scientist, and why are they important?

To thrive as an Applied Data Scientist, you need a strong background in statistics, machine learning, and programming languages such as Python or R, often supported by a degree in a quantitative field. Familiarity with tools like TensorFlow, PyTorch, SQL, and data visualization platforms, as well as experience deploying models in production, is typically required. Critical thinking, effective communication, and problem-solving skills help translate complex data insights into actionable business solutions. These competencies are essential for extracting value from data and driving impactful, data-informed decisions within organizations.

What is the difference between Applied Data Scientist vs Data Analyst?

AspectApplied Data ScientistData Analyst
Required SkillsStatistical modeling, machine learning, programming (Python, R)Data visualization, basic statistics, Excel, SQL
Work EnvironmentDeveloping predictive models, advanced analytics, researchReporting, data cleaning, descriptive analysis
Common ToolsPython, R, SQL, cloud platformsExcel, Tableau, Power BI, SQL
Industry UsageTech, finance, healthcare, e-commerceRetail, marketing, finance, healthcare

Applied Data Scientists focus on building predictive models and advanced analytics, requiring programming and statistical skills. Data Analysts primarily handle data reporting and visualization, with a focus on descriptive insights. Both roles are essential in data-driven organizations but differ in complexity and scope.

What is the 80 20 rule in data science?

In data science, the 80/20 rule, also known as the Pareto principle, suggests that roughly 80% of results come from 20% of the efforts or features. Applied by applied data scientists, it helps prioritize data cleaning, feature selection, and model tuning to focus on the most impactful variables and tasks.

What are the typical collaboration dynamics between Applied Data Scientists and other teams within an organization?

Applied Data Scientists frequently work cross-functionally, partnering with engineering, product management, and business analytics teams. They translate complex data insights into actionable recommendations, often presenting findings to non-technical stakeholders. Regular interaction with software engineers is common for model deployment, while close alignment with business teams ensures data solutions address real-world challenges. Effective communication and adaptability are key to thriving in these collaborative environments.
More about Applied Data Scientist jobs
What cities are hiring for Applied Data Scientist jobs? Cities with the most Applied Data Scientist job openings:
What states have the most Applied Data Scientist jobs? States with the most job openings for Applied Data Scientist jobs include:
Infographic showing various Applied Data Scientist job openings in the United States as of June 2026, with employment types broken down into 2% Internship, 2% As Needed, 86% Full Time, 5% Part Time, and 5% Contract. Highlights an 90% Physical, 3% Hybrid, and 7% Remote job distribution, with an average salary of $122,738 per year, or $59 per hour.

Applied Data Scientist, LLM Evaluation

Driver AI Inc.

Austin, TX โ€ข Remote

Other

Medical, Dental, Vision, Life, Retirement

Posted 17 days ago


Job description

Applied Data Scientist, LLM Evaluation Introduction

At Driver, we're building systems that turn source code into human language. The tech stack includes a core compiler-like engine, a heavily asynchronous/distributed backend server, and a frontend web application that provides a rich user experience.

About Driver

We're an early-stage startup backed by Y Combinator and Google Ventures that combines first principles technical approaches and applied LLM expertise to tackle context engineering at scale. Driver builds the context layer for employees and AI agents alike to use in developing software.

Working at Driver

Driver is an early-stage but fast-growing startup. As such, we take advantage of that which startups can excel: delivery speed, flexibility, and enjoying working with a small close-knit team.

Organizational and engineering values at Driver include first-principles thinking, correct by construction, writing things down, experimentation and iteration, pragmatism, commitment to effective communication and transparency, autonomy, and ambition.

Job Overview

Title: Applied Data Scientist, LLM Evaluation

Location: Remote or Austin, Tx

Our value is directly tied to the quality of our content at scale. The platform generates technical documentation across a complex, multi-stage pipeline - producing multiple content types at different levels of abstraction, from individual code elements up to high-level summaries. Today, changes to models, context strategies, or pipeline architecture are evaluated largely through manual review and intuition. There is no systematic way to answer: "Did this change make our output better, worse, or the same - and for which languages, repo sizes, and content types?"

This is a hard problem. LLM outputs are non-deterministic - identical inputs produce different outputs across runs, and small variations at early pipeline stages compound into meaningfully different end-user content downstream. Evaluating quality requires methodology that accounts for this: statistical reasoning over multiple runs, understanding of cascade effects through the pipeline, and rubrics that balance human judgment with automated signals.

This role builds the evaluation function from scratch. You'll define what "good" means for our generated content, build the infrastructure to measure it, and create the experimental framework that lets the team ship changes with confidence.

What You'll Do

You'll own the LLM evaluation strategy at Driver - from first principles to production infrastructure. This is a foundational role: you're not joining an existing eval team, you're building it. As the function matures, you'll seed and grow a team around it.

Define quality metrics and build evaluation datasets. Establish what "good" looks like for each content type across the pipeline. Build and curate gold-standard evaluation datasets across languages and repo archetypes (monorepos, microservices, libraries, applications). Design rubrics that capture accuracy, completeness, usefulness, and readability.

Build benchmarking and experimentation infrastructure. Create automated evaluation pipelines that score output against reference datasets. Instrument the content generation pipeline to support A/B comparisons - run the same codebase through two strategies and compare results. Build tooling for LLM-as-judge evaluation and regression detection. Integrate evaluation into CI so pipeline changes come with quality evidence.

Develop automated quality signals at scale. Build quality checks that flag degraded output without requiring human review of every document. Monitor content quality trends over time. Design sampling strategies for human review that maximize signal with minimal annotation effort.

Quantify tradeoffs and inform decisions. Run experiments on model selection, context strategies, and pipeline architecture changes. Quantify cost/quality/latency tradeoffs. Partner with the engineering team to turn evaluation insights into shipped improvements.

Qualifications

Education: Bachelor's, Master's, or PhD in Statistics, Machine Learning, Data Science, Computational Linguistics, or a related quantitative field.

Experience: Minimum 3 - 5 years in applied science, ML engineering, or data science roles with a focus on evaluation, NLP, or generative AI. 7+ years experience preferred.

Required Technical Skills

  • Strong statistical foundations: experimental design, hypothesis testing, confidence intervals, effect sizes, power analysis.
  • Experience designing and running evaluations for LLM or NLP systems - you've thought carefully about what "better" means when outputs are open-ended text.
  • Proficient in Python and the scientific/data stack (pandas, NumPy, scipy, sklearn).
  • Comfortable working in Jupyter notebooks for exploration and prototyping, and turning that work into automated pipelines.
  • Experience with LLM-as-judge approaches, inter-annotator agreement, and rubric design for subjective quality assessment.
  • Familiarity with the practical challenges of non-deterministic systems: variance decomposition, multi-run methodology, distinguishing signal from noise at scale.
  • Strong data storytelling - you can turn experiment results into clear recommendations that drive engineering and product decisions.

Preferred and Nice-to-Have Technical Skills

  • Experience with LLM APIs and prompt engineering across multiple providers.
  • Familiarity with evaluation frameworks (e.g., RAGAS, DeepEval, custom harnesses).
  • Experience building data pipelines or ETL workflows (Airflow, Dagster, or similar).
  • Comfort with SQL and working directly against production data stores.
  • Experience with visualization tools (Matplotlib, Plotly, Streamlit) for building internal dashboards and reports.
  • Background in code understanding, developer tools, or technical documentation.
  • Experience building or managing annotation pipelines and human evaluation workflows.
Benefits
  • Competitive Compensation Packages - Cash & Equity
  • Flexible Work Culture
  • Unlimited Time Off + 12 Paid Company Holidays
  • Insurance - Health, Dental, & Vision
  • Life Insurance & FSA Accounts
  • 401(k) Retirement Accounts - Traditional, Roth, or Both
  • Quarterly Team Offsites

Driver is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.