Scientist Llm Jobs (NOW HIRING)

Applied Data Scientist, LLM Evaluation

Applied Data Scientist, LLM Evaluation Introduction At Driver, we're building systems that turn source code into human language. The tech stack includes a core compiler-like engine, a heavily ...

Driver AI Inc.

Applied Data Scientist, LLM Evaluation

Austin, TX · Remote

Applied Data Scientist, LLM Evaluation Introduction At Driver, we're building systems that turn source code into human language. The tech stack includes a core compiler-like engine, a heavily ...

Driver AI Inc.

Applied Data Scientist, LLM Evaluation

Austin, TX · On-site +1

$175K - $275K/yr

Applied Data Scientist, LLM Evaluation Introduction At Driver, we're building systems that turn source code into human language. The tech stack includes a core compiler-like engine, a heavily ...

Driver AI Inc.

Applied Data Scientist, LLM Evaluation

Austin, TX · On-site +1

$175K - $275K/yr

Applied Data Scientist, LLM Evaluation Introduction At Driver, we're building systems that turn source code into human language. The tech stack includes a core compiler-like engine, a heavily ...

Centific

Research Scientist, LLM Evaluation & Post-Training

About Job Research Scientist, LLM Evaluation & Post-Training Company: Centific Location: Palo Alto, CA or Seattle, WA (Hybrid/Remote) Type: Full-time Role Overview As a Research Scientist, LLM ...

Centific

Research Scientist, LLM Evaluation & Post-Training

Rethink Recruit

Research Scientist - LLM

Redwood City, CA · On-site

They are seeking a Research Scientist specializing in LLMs to explore new techniques, develop ... Required : • Strong ML research background with experience on advanced problems such as LLM pre ...

Rethink Recruit

Research Scientist - LLM

Redwood City, CA · On-site

Innodata Inc.

Applied Research Scientist, LLM Evaluation & Post-Training

As an Applied Research Scientist, LLM Evaluation & Post-Training, you will lead research and experimentation on how evaluation design, measurement strategies, and feedback signals influence model ...

Innodata Inc.

Applied Research Scientist, LLM Evaluation & Post-Training

AceStack LLC

Senior Applied AI Research Scientist - LLM / Generative AI

Seattle, WA · On-site

$112K - $142K/yr

Senior Applied AI Research Scientist - LLM / Generative AI Location: Seattle, WA FTE * 5+ years of building models for business application experience * PhD, or Master's degree and 6+ years of CS, CE ...

AceStack LLC

Senior Applied AI Research Scientist - LLM / Generative AI

Seattle, WA · On-site

$112K - $142K/yr

Propio

AI Data Strategy Engineer / Applied Scientist, LLM Data

Leawood, KS · On-site

They are seeking an AI Data Strategy Engineer / Applied Scientist, LLM Data to manage the data strategy and workflows that enhance their multilingual AI systems. The role involves overseeing the ...

Propio

AI Data Strategy Engineer / Applied Scientist, LLM Data

Leawood, KS · On-site

Apple

Research Scientist/Engineer - LLM Efficiency

Santa Clara, CA

$184K - $324K/yr

We're seeking research scientists and engineers to create breakthrough innovations in machine learning. We are particularly interested in efficiency of frontier models, including LLMs and diffusion ...

Apple

Research Scientist/Engineer - LLM Efficiency

Santa Clara, CA

$184K - $324K/yr

Capital One

Principal Associate, Data Scientist - LLM Customization Team

New York, NY · On-site

$64K - $65K/yr

Principal Associate, Data Scientist - LLM Customization Team At Capital One, we think big and do big things. We are not just a nationally recognized credit card issuer, a top 10 bank by deposit, but ...

Capital One

Principal Associate, Data Scientist - LLM Customization Team

New York, NY · On-site

$64K - $65K/yr

Capital One

Principal Associate, Data Scientist - LLM Customization Team

Mclean, VA · On-site

$59K - $60K/yr

Capital One

Principal Associate, Data Scientist - LLM Customization Team

Mclean, VA · On-site

$59K - $60K/yr

Capital One

Principal Associate, Data Scientist - LLM Customization Team

New York, NY · On-site

$64K - $65K/yr

Capital One

Principal Associate, Data Scientist - LLM Customization Team

New York, NY · On-site

$64K - $65K/yr

Propio

AI Data Strategy Engineer / Applied Scientist, LLM Data

Leawood, KS · On-site

Propio is hiring an AI Data Strategy Engineer / Applied Scientist, LLM Data to own the data strategy, curation pipelines, annotation workflows, and evaluation datasets that power our multilingual AI ...

Quick apply

Propio

AI Data Strategy Engineer / Applied Scientist, LLM Data

Leawood, KS · On-site

Propio Language Services

AI Data Strategy Engineer / Applied Scientist, LLM Data

Overland Park, KS · On-site

Propio Language Services

AI Data Strategy Engineer / Applied Scientist, LLM Data

Overland Park, KS · On-site

Propio

AI Data Strategy Engineer / Applied Scientist, LLM Data

Overland Park, KS · On-site

Propio

AI Data Strategy Engineer / Applied Scientist, LLM Data

Overland Park, KS · On-site

Amazon

Senior Applied Scientist, LLM Code Agents, Kiro Science

Santa Clara, CA

$107K - $146K/yr

Amazon Web Services (AWS) is assembling an elite team of world-class scientists and engineers to ... Join the Amazon Kiro LLM-Training team and help create groundbreaking generative AI technologies ...

Amazon

Senior Applied Scientist, LLM Code Agents, Kiro Science

Santa Clara, CA

$107K - $146K/yr

Amazon

Applied Scientist, LLM Code Agents, Kiro Science

Santa Clara, CA · On-site

Amazon

Applied Scientist, LLM Code Agents, Kiro Science

Santa Clara, CA · On-site

Amazon

Senior Applied Scientist, LLM Code Agents, Kiro Science

Santa Clara, CA

$107K - $146K/yr

Amazon

Senior Applied Scientist, LLM Code Agents, Kiro Science

Santa Clara, CA

$107K - $146K/yr

Amazon

Applied Scientist, LLM Code Agents, Kiro Science

Santa Clara, CA

Amazon

Applied Scientist, LLM Code Agents, Kiro Science

Santa Clara, CA

Amazon

Applied Scientist, LLM Code Agents, Kiro Science

Santa Clara, CA · On-site

Amazon

Applied Scientist, LLM Code Agents, Kiro Science

Santa Clara, CA · On-site

Amazon

Senior Applied Scientist, LLM Code Agents, Kiro Science

Santa Clara, CA · On-site

$107K - $146K/yr

Amazon

Senior Applied Scientist, LLM Code Agents, Kiro Science

Santa Clara, CA · On-site

$107K - $146K/yr

Showing results 1-20

Scientist Llm Jobs

Scientist Llm information

How do Scientists specializing in large language models (LLMs) typically collaborate with engineering and product teams?

Scientists focusing on large language models often work closely with engineering teams to translate research breakthroughs into scalable, production-ready systems. They regularly participate in cross-functional meetings to align model development with product requirements and user needs. This collaboration ensures that scientific advancements are effectively integrated into real-world applications, and scientists frequently provide technical guidance on model deployment, optimization, and evaluation. These interactions foster a dynamic environment where ideas are rapidly prototyped and iterated upon, contributing directly to impactful product features.

What is a Scientist LLM?

A Scientist LLM is a professional who specializes in applying large language models (LLMs) to scientific research and problem-solving. They combine expertise in machine learning, natural language processing, and a specific scientific domain to develop, fine-tune, and evaluate LLMs for tasks such as literature analysis, hypothesis generation, and data interpretation. Scientist LLMs often work in multidisciplinary teams to advance scientific discovery using state-of-the-art AI technologies.

What are the key skills and qualifications needed to thrive as a Scientist LLM, and why are they important?

To thrive as a Scientist LLM (Large Language Model Scientist), you need a strong background in machine learning, natural language processing, and advanced programming skills, typically supported by a relevant PhD or master’s degree. Familiarity with tools like Python, TensorFlow, PyTorch, and experience working with large-scale data sets and cloud computing platforms is essential. Strong problem-solving abilities, collaboration, and clear communication are valuable soft skills for innovating and sharing complex ideas. These competencies ensure the effective development, deployment, and advancement of large language models in real-world applications.

What is the difference between Scientist Llm vs Data Scientist?

Aspect	Scientist Llm	Data Scientist
Required Credentials	Advanced degree in law, AI, or related fields; knowledge of legal and AI principles	Degree in computer science, statistics, or related fields; strong programming skills
Work Environment	Research labs, AI companies, legal tech firms	Tech companies, finance, healthcare, consulting
Industry Usage	Legal AI applications, compliance, legal research	Data analysis, predictive modeling, business insights

Scientist Llm professionals focus on applying AI within legal contexts, requiring legal and AI expertise, while Data Scientists analyze data across various industries, emphasizing statistical and programming skills. Both roles involve research and technical work but serve different industry needs.

More about Scientist Llm jobs

The 10 Top Types Of Scientist Llm Jobs

What cities are hiring for Scientist Llm jobs? Cities with the most Scientist Llm job openings:

What states have the most Scientist Llm jobs? States with the most job openings for Scientist Llm jobs include:

What job categories do people searching Scientist Llm jobs look for? The top searched job categories for Scientist Llm jobs are:

Scientist Llm jobs near you

Infographic showing various Scientist Llm job openings in the United States as of July 2026, with employment types broken down into 1% As Needed, 96% Full Time, 1% Part Time, and 2% Contract. Highlights an 78% Physical, 5% Hybrid, and 17% Remote job distribution.

Applied Data Scientist, LLM Evaluation

Driver AI Inc.

Austin, TX • Remote

Apply

Other

Medical, Dental, Vision, Life, Retirement

Re-posted 9 days ago

Job description

Applied Data Scientist, LLM Evaluation Introduction

At Driver, we're building systems that turn source code into human language. The tech stack includes a core compiler-like engine, a heavily asynchronous/distributed backend server, and a frontend web application that provides a rich user experience.

About Driver

We're an early-stage startup backed by Y Combinator and Google Ventures that combines first principles technical approaches and applied LLM expertise to tackle context engineering at scale. Driver builds the context layer for employees and AI agents alike to use in developing software.

Working at Driver

Driver is an early-stage but fast-growing startup. As such, we take advantage of that which startups can excel: delivery speed, flexibility, and enjoying working with a small close-knit team.

Organizational and engineering values at Driver include first-principles thinking, correct by construction, writing things down, experimentation and iteration, pragmatism, commitment to effective communication and transparency, autonomy, and ambition.

Job Overview

Title: Applied Data Scientist, LLM Evaluation

Location: Remote or Austin, Tx

Our value is directly tied to the quality of our content at scale. The platform generates technical documentation across a complex, multi-stage pipeline - producing multiple content types at different levels of abstraction, from individual code elements up to high-level summaries. Today, changes to models, context strategies, or pipeline architecture are evaluated largely through manual review and intuition. There is no systematic way to answer: "Did this change make our output better, worse, or the same - and for which languages, repo sizes, and content types?"

This is a hard problem. LLM outputs are non-deterministic - identical inputs produce different outputs across runs, and small variations at early pipeline stages compound into meaningfully different end-user content downstream. Evaluating quality requires methodology that accounts for this: statistical reasoning over multiple runs, understanding of cascade effects through the pipeline, and rubrics that balance human judgment with automated signals.

This role builds the evaluation function from scratch. You'll define what "good" means for our generated content, build the infrastructure to measure it, and create the experimental framework that lets the team ship changes with confidence.

What You'll Do

You'll own the LLM evaluation strategy at Driver - from first principles to production infrastructure. This is a foundational role: you're not joining an existing eval team, you're building it. As the function matures, you'll seed and grow a team around it.

Define quality metrics and build evaluation datasets. Establish what "good" looks like for each content type across the pipeline. Build and curate gold-standard evaluation datasets across languages and repo archetypes (monorepos, microservices, libraries, applications). Design rubrics that capture accuracy, completeness, usefulness, and readability.

Build benchmarking and experimentation infrastructure. Create automated evaluation pipelines that score output against reference datasets. Instrument the content generation pipeline to support A/B comparisons - run the same codebase through two strategies and compare results. Build tooling for LLM-as-judge evaluation and regression detection. Integrate evaluation into CI so pipeline changes come with quality evidence.

Develop automated quality signals at scale. Build quality checks that flag degraded output without requiring human review of every document. Monitor content quality trends over time. Design sampling strategies for human review that maximize signal with minimal annotation effort.

Quantify tradeoffs and inform decisions. Run experiments on model selection, context strategies, and pipeline architecture changes. Quantify cost/quality/latency tradeoffs. Partner with the engineering team to turn evaluation insights into shipped improvements.

Qualifications

Education: Bachelor's, Master's, or PhD in Statistics, Machine Learning, Data Science, Computational Linguistics, or a related quantitative field.

Experience: Minimum 3 - 5 years in applied science, ML engineering, or data science roles with a focus on evaluation, NLP, or generative AI. 7+ years experience preferred.

Required Technical Skills

Strong statistical foundations: experimental design, hypothesis testing, confidence intervals, effect sizes, power analysis.
Experience designing and running evaluations for LLM or NLP systems - you've thought carefully about what "better" means when outputs are open-ended text.
Proficient in Python and the scientific/data stack (pandas, NumPy, scipy, sklearn).
Comfortable working in Jupyter notebooks for exploration and prototyping, and turning that work into automated pipelines.
Experience with LLM-as-judge approaches, inter-annotator agreement, and rubric design for subjective quality assessment.
Familiarity with the practical challenges of non-deterministic systems: variance decomposition, multi-run methodology, distinguishing signal from noise at scale.
Strong data storytelling - you can turn experiment results into clear recommendations that drive engineering and product decisions.

Preferred and Nice-to-Have Technical Skills

Experience with LLM APIs and prompt engineering across multiple providers.
Familiarity with evaluation frameworks (e.g., RAGAS, DeepEval, custom harnesses).
Experience building data pipelines or ETL workflows (Airflow, Dagster, or similar).
Comfort with SQL and working directly against production data stores.
Experience with visualization tools (Matplotlib, Plotly, Streamlit) for building internal dashboards and reports.
Background in code understanding, developer tools, or technical documentation.
Experience building or managing annotation pipelines and human evaluation workflows.

Benefits

Competitive Compensation Packages - Cash & Equity
Flexible Work Culture
Unlimited Time Off + 12 Paid Company Holidays
Insurance - Health, Dental, & Vision
Life Insurance & FSA Accounts
401(k) Retirement Accounts - Traditional, Roth, or Both
Quarterly Team Offsites

Driver is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.

Apply

Scientist Llm Jobs (NOW HIRING)

Applied Data Scientist, LLM Evaluation

Applied Data Scientist, LLM Evaluation

Applied Data Scientist, LLM Evaluation

Applied Data Scientist, LLM Evaluation

Research Scientist, LLM Evaluation & Post-Training

Research Scientist, LLM Evaluation & Post-Training

Research Scientist - LLM

Research Scientist - LLM

Applied Research Scientist, LLM Evaluation & Post-Training

Applied Research Scientist, LLM Evaluation & Post-Training

Senior Applied AI Research Scientist - LLM / Generative AI

Senior Applied AI Research Scientist - LLM / Generative AI

AI Data Strategy Engineer / Applied Scientist, LLM Data

AI Data Strategy Engineer / Applied Scientist, LLM Data

Research Scientist/Engineer - LLM Efficiency

Research Scientist/Engineer - LLM Efficiency

Principal Associate, Data Scientist - LLM Customization Team

Principal Associate, Data Scientist - LLM Customization Team

Principal Associate, Data Scientist - LLM Customization Team

Principal Associate, Data Scientist - LLM Customization Team

Principal Associate, Data Scientist - LLM Customization Team

Principal Associate, Data Scientist - LLM Customization Team

AI Data Strategy Engineer / Applied Scientist, LLM Data

AI Data Strategy Engineer / Applied Scientist, LLM Data

AI Data Strategy Engineer / Applied Scientist, LLM Data

AI Data Strategy Engineer / Applied Scientist, LLM Data

AI Data Strategy Engineer / Applied Scientist, LLM Data

AI Data Strategy Engineer / Applied Scientist, LLM Data

Senior Applied Scientist, LLM Code Agents, Kiro Science

Senior Applied Scientist, LLM Code Agents, Kiro Science

Applied Scientist, LLM Code Agents, Kiro Science

Applied Scientist, LLM Code Agents, Kiro Science

Senior Applied Scientist, LLM Code Agents, Kiro Science

Senior Applied Scientist, LLM Code Agents, Kiro Science

Applied Scientist, LLM Code Agents, Kiro Science

Applied Scientist, LLM Code Agents, Kiro Science

Applied Scientist, LLM Code Agents, Kiro Science

Applied Scientist, LLM Code Agents, Kiro Science

Senior Applied Scientist, LLM Code Agents, Kiro Science

Senior Applied Scientist, LLM Code Agents, Kiro Science

Scientist Llm information

How do Scientists specializing in large language models (LLMs) typically collaborate with engineering and product teams?

What is a Scientist LLM?

What are the key skills and qualifications needed to thrive as a Scientist LLM, and why are they important?

What is the difference between Scientist Llm vs Data Scientist?

Applied Data Scientist, LLM Evaluation

Share this job

Job description

Share this job