Llm Testing Jobs (NOW HIRING)

LLM Security Evaluation Expert

In this role, you will be responsible for rigorously testing the security and integrity of Large ... Familiarity with prominent LLM families (e.g., GPT series, Claude, Llama, PaLM) and their common ...

SilverEdge

LLM Security Evaluation Expert

Columbia, MD · On-site

SilverEdge

LLM Security Evaluation Expert

Columbia, MD · On-site

SilverEdge

LLM Security Evaluation Expert

Columbia, MD · On-site

Apple

Machine Learning Engineer - LLM

Cupertino, CA

$147K - $220K/yr

Experience with LLM and LMM development and fine-tuning ... Experience applying ML techniques in manufacturing, testing, or hardware optimization. Proficiency ...

Apple

Machine Learning Engineer - LLM

Cupertino, CA

$147K - $220K/yr

Experience with LLM and LMM development and fine-tuning ... Experience applying ML techniques in manufacturing, testing, or hardware optimization. Proficiency ...

SilverEdge

LLM Security Evaluation Expert

Columbia, MD · On-site

SilverEdge

LLM Security Evaluation Expert

Columbia, MD · On-site

Noblesoft Technologies

Test Lead - Accessibility Testing

Oregon City, OR · Remote

$48.50 - $66.25/hr

Experience in designing LLM/RAG test automation solutions * Experience in testing for bias, drift, and fairness. * Familiarity with performance metrics (precision, recall, F1, ROC-AUC). * Knowledge ...

Quick apply

Noblesoft Technologies

Test Lead - Accessibility Testing

Oregon City, OR · Remote

$48.50 - $66.25/hr

Purple Drive Technologies

AI Full Stack Developer (GenAI / LLM Focus)

Minneapolis, MN · On-site

... LLM/GenAI capabilities into real-world business workflows. This role is focused on AI-driven ... testing

Purple Drive Technologies

AI Full Stack Developer (GenAI / LLM Focus)

Minneapolis, MN · On-site

... LLM/GenAI capabilities into real-world business workflows. This role is focused on AI-driven ... testing

Saviance

Generative AI Engineer (LLM Expert - AWS Focus)

... and testing. • Ensure secure, containerized deployments using Docker and integrate SSO and role ... Preferred : • Experience with LangGraph or other LLM orchestration frameworks. • Familiarity ...

New

Saviance

Generative AI Engineer (LLM Expert - AWS Focus)

New

Horizon3.ai

Staff Attack Engineer, AI/LLM

Solid penetration testing fundamentals and understanding of common attack chains. * Familiarity with AI/LLM security frameworks (e.g., OWASP Top 10 for LLMs, MITRE ATLAS). * Experience in a security ...

Horizon3.ai

Staff Attack Engineer, AI/LLM

Datadog

Product Solutions Architect - LLM Observability

Boston, MA · On-site

$151K - $222K/yr

Experience with LLM application evaluation and testing frameworks Datadog values people from all walks of life. We understand not everyone will meet all the above qualifications on day one. That ...

Datadog

Product Solutions Architect - LLM Observability

Boston, MA · On-site

$151K - $222K/yr

Brightplan

Applied AI Engineer

$125K - $155K/yr

Familiarity with prompt evaluation or LLM testing approaches * Experience integrating AI into production SaaS platforms * Experience working in fintech or B2B SaaS Benefits Why Join BrightPlan

Brightplan

Applied AI Engineer

$125K - $155K/yr

Familiarity with prompt evaluation or LLM testing approaches * Experience integrating AI into production SaaS platforms * Experience working in fintech or B2B SaaS Benefits Why Join BrightPlan

Request Technology, LLC

Principal Artificial Intelligence LLM Engineer

Chicago, IL · Hybrid

... Implement testing, evaluation, and monitoring frameworks for AI systems including hallucination detection and bias assessment Establish safety guardrails and responsible AI practices for LLM ...

Request Technology, LLC

Principal Artificial Intelligence LLM Engineer

Chicago, IL · Hybrid

Driver AI Inc.

Applied Data Scientist, LLM Evaluation

Austin, TX · Remote

Strong statistical foundations: experimental design, hypothesis testing, confidence intervals, effect sizes, power analysis. * Experience designing and running evaluations for LLM or NLP systems ...

Driver AI Inc.

Applied Data Scientist, LLM Evaluation

Austin, TX · Remote

Innova Solutions, Inc

Cyber Security PenTester - GenAI & LLM

Charlotte, NC · Remote

$75 - $80/hr

Perform penetration testing on LLM and ML-based applications * Review and analyze complex multi-faceted larger scale or longer-term Information Security Analysis challenges. * Use Burp Suite heavily ...

Innova Solutions, Inc

Cyber Security PenTester - GenAI & LLM

Charlotte, NC · Remote

$75 - $80/hr

TMS

Principal AI Engineer - Agentic Systems and LLM Platforms

Irvine, CA · On-site

... testing. Technical Skills: Agent frameworks Lang Graph Semantic Kernel similar orchestration ... LLM outputs Key Responsibilities: 1. Agentic AI System Design Engineering Design and implement ...

Quick apply

TMS

Principal AI Engineer - Agentic Systems and LLM Platforms

Irvine, CA · On-site

Driver AI Inc.

Applied Data Scientist, LLM Evaluation

Austin, TX · On-site +1

$175K - $275K/yr

Driver AI Inc.

Applied Data Scientist, LLM Evaluation

Austin, TX · On-site +1

$175K - $275K/yr

Internet Brands

Data Scientist AI

El Segundo, CA · Hybrid

Prior work with model evaluation benchmarks and large-scale LLM testing frameworks. * Experience with prompt fine-tuning techniques, LLMs optimization techniques(pre-training, post-training etc)

Internet Brands

Data Scientist AI

El Segundo, CA · Hybrid

Prior work with model evaluation benchmarks and large-scale LLM testing frameworks. * Experience with prompt fine-tuning techniques, LLMs optimization techniques(pre-training, post-training etc)

Noblesoft Technologies

Golang Developer with Devops/LLM exp

San Francisco, CA · Remote

Golang Developer with Devops/LLM exp Location: California (Remote) We are looking for devs with ... testing, release, and operations. * Build tooling and observability to monitor system health, and ...

Quick apply

Noblesoft Technologies

Golang Developer with Devops/LLM exp

San Francisco, CA · Remote

Internet Brands

Data Scientist AI

El Segundo, CA · On-site

Prior work with model evaluation benchmarks and large-scale LLM testing frameworks. * Experience with prompt fine-tuning techniques, LLMs optimization techniques(pre-training, post-training etc)

Internet Brands

Data Scientist AI

El Segundo, CA · On-site

Prior work with model evaluation benchmarks and large-scale LLM testing frameworks. * Experience with prompt fine-tuning techniques, LLMs optimization techniques(pre-training, post-training etc)

WorkNovas LLC

Full Stack Developer (AI/LLM Focus)

Rockville, MD · On-site

Full Stack Developer (AI/LLM Focus) Location: 9509 key west drive Rockville, MD 20850 need local ... Implement automated testing using tools like Selenium, Cypress, or Playwright * Optimize ...

Quick apply

WorkNovas LLC

Full Stack Developer (AI/LLM Focus)

Rockville, MD · On-site

smart folks inc

ML OPS and LLM OPS (Architect)

Austin, TX · On-site

ML OPS and LLM OPS (Architect) Location: Remote Duration: Full-time Position: ML OPS and LLM OPS ... testing ML models, Git and version control, frameworks like Flask, FastAPI. - Hands-on experience ...

Quick apply

smart folks inc

ML OPS and LLM OPS (Architect)

Austin, TX · On-site

Showing results 1-20

Llm Testing Jobs

Llm Testing information

See salary details

$11

$39

$60

How much do llm testing jobs pay per hour?

As of Jun 6, 2026, the average hourly pay for llm testing in the United States is $39.41, according to ZipRecruiter salary data. Most workers in this role earn between $30.05 and $47.84 per hour, depending on experience, location, and employer.

Infographic showing various Llm Testing job openings in the United States as of May 2026, with employment types broken down into 5% As Needed, 20% Full Time, 56% Part Time, 2% Temporary, 15% Contract, and 2% Nights. Highlights an 76% Physical, 5% Hybrid, and 19% Remote job distribution, with an average salary of $81,981 per year, or $39.4 per hour.

LLM Security Evaluation Expert

SilverEdge

Columbia, MD • On-site

Apply

Full-time

Posted 14 days ago

Job description

Overview
SilverEdge Government Solutions is seeking a highly skilled LLM Security Evaluation Expert to join our team. In this role, you will be responsible for rigorously testing the security and integrity of Large Language Models (LLMs). Your primary focus will be on designing and executing sophisticated adversarial prompt attacks to identify potential vulnerabilities, assess the model's resistance to exploitation, and ensure it maintains consistent, secure behavior. This is a critical role in safeguarding our AI systems and ensuring they operate responsibly.
Required Qualifications

TS/SCI with Polygraph level Clearance
Strong knowledge of how LLMs work, including their architecture, training processes, capabilities, and inherent limitations.
Familiarity with prominent LLM families (e.g., GPT series, Claude, Llama, PaLM) and their common characteristics.
Proven experience in crafting and refining prompts to elicit specific behaviors or bypass restrictions in LLMs.
Demonstrable understanding of techniques like jailbreaking, prompt injection, role-playing attacks, and exploiting model biases.
Strong understanding of cybersecurity principles and common attack vectors, particularly as they apply to AI/ML systems.
Ability to think like an attacker and anticipate potential exploits.
Excellent ability to analyze complex systems, identify subtle vulnerabilities, and systematically test hypotheses.
Clear and concise written and verbal communication skills, with the ability to document technical findings thoroughly.
Understanding of the ethical implications of AI security and commitment to responsible testing practices.

About SilverEdge
SilverEdge Government Solutions was founded on the belief that nurturing talent and collaborating closely with our customers enables us to think big and deliver the best for our country. Our mission is to bring top technology talent together to solve the world's most challenging problems while protecting the United States and our allies.SilverEdge Government Solutions, LLC is an Equal Opportunity Employer and applicants receive lawful consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or protected veteran status.

Apply

Llm Testing Jobs (NOW HIRING)

LLM Security Evaluation Expert

LLM Security Evaluation Expert

LLM Security Evaluation Expert

LLM Security Evaluation Expert

Machine Learning Engineer - LLM

Machine Learning Engineer - LLM

LLM Security Evaluation Expert

LLM Security Evaluation Expert

Test Lead - Accessibility Testing

Test Lead - Accessibility Testing

AI Full Stack Developer (GenAI / LLM Focus)

AI Full Stack Developer (GenAI / LLM Focus)

Generative AI Engineer (LLM Expert - AWS Focus)

Generative AI Engineer (LLM Expert - AWS Focus)

Staff Attack Engineer, AI/LLM

Staff Attack Engineer, AI/LLM

Product Solutions Architect - LLM Observability

Product Solutions Architect - LLM Observability

Applied AI Engineer

Applied AI Engineer

Principal Artificial Intelligence LLM Engineer

Principal Artificial Intelligence LLM Engineer

Applied Data Scientist, LLM Evaluation

Applied Data Scientist, LLM Evaluation

Cyber Security PenTester - GenAI & LLM

Cyber Security PenTester - GenAI & LLM

Principal AI Engineer - Agentic Systems and LLM Platforms

Principal AI Engineer - Agentic Systems and LLM Platforms

Applied Data Scientist, LLM Evaluation

Applied Data Scientist, LLM Evaluation

Data Scientist AI

Data Scientist AI

Golang Developer with Devops/LLM exp

Golang Developer with Devops/LLM exp

Data Scientist AI

Data Scientist AI

Full Stack Developer (AI/LLM Focus)

Full Stack Developer (AI/LLM Focus)

ML OPS and LLM OPS (Architect)

ML OPS and LLM OPS (Architect)

Llm Testing information

See salary details

How much do llm testing jobs pay per hour?

LLM Security Evaluation Expert

Share this job

Job description

Share this job