1

Llm Jobs (NOW HIRING)

LLM Engineer[Onsite]

Houston, TX · Remote

$170K/yr

LLM Engineer 6-month contract Houston,TX (Onsite) USC/GC Required Skills & Experience -3+ years of large language modeling experience -5+ years of python experience -Strong problem solving skills and ...

As an AI/LLM Engineer, you will lead the design and implementation of advanced systems centered on large language models and natural language understanding. Your work will involve techniques such as ...

As an AI/LLM Engineer, you will lead the design and implementation of advanced systems centered on large language models and natural language understanding. Your work will involve techniques such as ...

next page

Showing results 1-20

Llm information

See salary details

$85K

$142.7K

$182K

How much do llm jobs pay per year?

As of Jun 11, 2026, the average yearly pay for llm in the United States is $142,663.00, according to ZipRecruiter salary data. Most workers in this role earn between $126,000.00 and $157,500.00 per year, depending on experience, location, and employer.

What are the key skills and qualifications needed to thrive as an LLM (Master of Laws) graduate, and why are they important?

To thrive as an LLM graduate, you need advanced knowledge of legal principles, strong research and analytical skills, and a prior law degree such as an LLB or JD. Familiarity with legal databases, research tools like Westlaw or LexisNexis, and sometimes bar admission or certification in specific jurisdictions is advantageous. Exceptional written and verbal communication, attention to detail, and cross-cultural competence are standout soft skills in this field. These abilities are crucial for interpreting complex legal issues, advising clients, and succeeding in global or specialized legal practice.

What is a $900,000 AI job?

A $900,000 AI job typically refers to high-level roles in artificial intelligence, such as senior machine learning engineers, AI research directors, or chief AI officers, often requiring advanced skills in deep learning, data science, and programming. These positions usually involve leadership, strategic planning, and extensive experience, and may include stock options or bonuses that contribute to the total compensation package.

What can you do with an LLM degree?

An LLM degree qualifies individuals for advanced legal roles such as legal analyst, compliance officer, or law professor. It can also enhance expertise in specialized areas like international law or intellectual property, often requiring strong research, writing, and analytical skills. The degree may lead to opportunities in law firms, corporate legal departments, or government agencies.

What are LLMs (Large Language Models)?

LLMs, or Large Language Models, are advanced artificial intelligence systems designed to understand and generate human-like text based on vast amounts of data. These models, such as OpenAI's GPT series, are trained on diverse datasets and can perform a range of tasks, including answering questions, writing content, translating languages, and more. LLMs work by predicting the next word in a sequence, allowing them to create coherent and contextually relevant responses. They are widely used in applications like chatbots, virtual assistants, and automated content generation.

What jobs can you do with LLM?

With an LLM (Master of Laws), you can pursue careers in legal practice, such as lawyer, legal advisor, or corporate counsel. It also qualifies you for roles in academia, legal research, compliance, and policy analysis, often requiring strong analytical and research skills.

What job makes $10,000 a month without a degree?

A machine learning engineer or AI specialist can earn $10,000 or more per month through expertise in developing and deploying AI models, often requiring strong programming skills in Python and knowledge of frameworks like TensorFlow or PyTorch. Success in such roles depends on experience, project complexity, and the ability to work independently or in high-demand environments, often without formal degrees but with relevant skills and certifications.

What Is an LL.M.?

A Master of Laws, or LL.M., is an advanced legal degree designed for lawyers or legal scholars who want to demonstrate their expertise in a specific area of law after law school. While a juris doctoris (JD) is the most common degree people receive at law school, an LL.M. is a secondary degree that typically takes an additional year to complete. The LL.M. curriculum includes coursework in U.S., Canadian, and international law, and is meant for an attorney wanting to specialize in a specific area of law. An LL.M. also provides foreign attorneys with the necessary skills and background to practice law in the United States.

What is the difference between Llm vs Paralegal?

AspectLlmParalegal
Required CredentialsLaw degree (JD or equivalent), possibly an LLM for specializationAssociate's degree or certificate in paralegal studies
Work EnvironmentLaw firms, corporate legal departments, academiaLaw firms, corporate legal departments, government agencies
Industry UsageLegal practice, academia, researchLegal support, case preparation, client communication

The main difference is that an Llm is an advanced law degree for specialization or academic purposes, while a paralegal provides legal support and case assistance without being licensed to practice law. Both roles work closely within legal environments, but the Llm is more focused on legal expertise and research, whereas paralegals handle administrative and preparatory tasks.

What are some common challenges faced by professionals working with large language models (LLMs) and how can they be addressed?

Professionals working with large language models often encounter challenges such as managing computational resource demands, ensuring data privacy, and mitigating biases in model outputs. Collaboration with data engineers and IT teams is essential to optimize infrastructure and streamline model deployment. Staying updated on best practices and regulatory guidelines helps address ethical concerns and improve model performance. Continuous monitoring and iteration are key to maintaining accuracy and relevance in real-world applications.
What cities are hiring for Llm jobs? Cities with the most Llm job openings:
What are the most commonly searched types of Llm jobs? The most popular types of Llm jobs are:
What states have the most Llm jobs? States with the most job openings for Llm jobs include:

Applied Data Scientist, LLM Evaluation

Driver AI Inc.

Austin, TX • On-site, Remote

$175K - $275K/yr

Full-time

Medical, Dental, Vision, Life, Retirement

Posted 18 days ago


Job description

Applied Data Scientist, LLM Evaluation
Introduction
At Driver, we're building systems that turn source code into human language. The tech stack includes a core compiler-like engine, a heavily asynchronous/distributed backend server, and a frontend web application that provides a rich user experience.
About Driver
We're an early-stage startup backed by Y Combinator and Google Ventures that combines first principles technical approaches and applied LLM expertise to tackle context engineering at scale. Driver builds the context layer for employees and AI agents alike to use in developing software.
Working at Driver
Driver is an early-stage but fast-growing startup. As such, we take advantage of that which startups can excel: delivery speed, flexibility, and enjoying working with a small close-knit team.
Organizational and engineering values at Driver include first-principles thinking, correct by construction, writing things down, experimentation and iteration, pragmatism, commitment to effective communication and transparency, autonomy, and ambition.
Job Overview
Title: Applied Data Scientist, LLM Evaluation
Location: Remote or Austin, Tx
Our value is directly tied to the quality of our content at scale. The platform generates technical documentation across a complex, multi-stage pipeline - producing multiple content types at different levels of abstraction, from individual code elements up to high-level summaries. Today, changes to models, context strategies, or pipeline architecture are evaluated largely through manual review and intuition. There is no systematic way to answer: "Did this change make our output better, worse, or the same - and for which languages, repo sizes, and content types?"
This is a hard problem. LLM outputs are non-deterministic - identical inputs produce different outputs across runs, and small variations at early pipeline stages compound into meaningfully different end-user content downstream. Evaluating quality requires methodology that accounts for this: statistical reasoning over multiple runs, understanding of cascade effects through the pipeline, and rubrics that balance human judgment with automated signals.
This role builds the evaluation function from scratch. You'll define what "good" means for our generated content, build the infrastructure to measure it, and create the experimental framework that lets the team ship changes with confidence.
What You'll Do
You'll own the LLM evaluation strategy at Driver - from first principles to production infrastructure. This is a foundational role: you're not joining an existing eval team, you're building it. As the function matures, you'll seed and grow a team around it.
Define quality metrics and build evaluation datasets. Establish what "good" looks like for each content type across the pipeline. Build and curate gold-standard evaluation datasets across languages and repo archetypes (monorepos, microservices, libraries, applications). Design rubrics that capture accuracy, completeness, usefulness, and readability.
Build benchmarking and experimentation infrastructure. Create automated evaluation pipelines that score output against reference datasets. Instrument the content generation pipeline to support A/B comparisons - run the same codebase through two strategies and compare results. Build tooling for LLM-as-judge evaluation and regression detection. Integrate evaluation into CI so pipeline changes come with quality evidence.
Develop automated quality signals at scale. Build quality checks that flag degraded output without requiring human review of every document. Monitor content quality trends over time. Design sampling strategies for human review that maximize signal with minimal annotation effort.
Quantify tradeoffs and inform decisions. Run experiments on model selection, context strategies, and pipeline architecture changes. Quantify cost/quality/latency tradeoffs. Partner with the engineering team to turn evaluation insights into shipped improvements.
Qualifications
Education: Bachelor's, Master's, or PhD in Statistics, Machine Learning, Data Science, Computational Linguistics, or a related quantitative field.
Experience: Minimum 3 - 5 years in applied science, ML engineering, or data science roles with a focus on evaluation, NLP, or generative AI. 7+ years experience preferred.
Required Technical Skills
  • Strong statistical foundations: experimental design, hypothesis testing, confidence intervals, effect sizes, power analysis.
  • Experience designing and running evaluations for LLM or NLP systems - you've thought carefully about what "better" means when outputs are open-ended text.
  • Proficient in Python and the scientific/data stack (pandas, NumPy, scipy, sklearn).
  • Comfortable working in Jupyter notebooks for exploration and prototyping, and turning that work into automated pipelines.
  • Experience with LLM-as-judge approaches, inter-annotator agreement, and rubric design for subjective quality assessment.
  • Familiarity with the practical challenges of non-deterministic systems: variance decomposition, multi-run methodology, distinguishing signal from noise at scale.
  • Strong data storytelling - you can turn experiment results into clear recommendations that drive engineering and product decisions.

Preferred and Nice-to-Have Technical Skills
  • Experience with LLM APIs and prompt engineering across multiple providers.
  • Familiarity with evaluation frameworks (e.g., RAGAS, DeepEval, custom harnesses).
  • Experience building data pipelines or ETL workflows (Airflow, Dagster, or similar).
  • Comfort with SQL and working directly against production data stores.
  • Experience with visualization tools (Matplotlib, Plotly, Streamlit) for building internal dashboards and reports.
  • Background in code understanding, developer tools, or technical documentation.
  • Experience building or managing annotation pipelines and human evaluation workflows.
Benefits
  • Competitive Compensation Packages - Cash & Equity
  • Flexible Work Culture
  • Unlimited Time Off + 12 Paid Company Holidays
  • Insurance - Health, Dental, & Vision
  • Life Insurance & FSA Accounts
  • 401(k) Retirement Accounts - Traditional, Roth, or Both
  • Quarterly Team Offsites

Driver is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.