In this role, you will be responsible for rigorously testing the security and integrity of Large ... Familiarity with prominent LLM families (e.g., GPT series, Claude, Llama, PaLM) and their common ...
In this role, you will be responsible for rigorously testing the security and integrity of Large ... Familiarity with prominent LLM families (e.g., GPT series, Claude, Llama, PaLM) and their common ...
In this role, you will be responsible for rigorously testing the security and integrity of Large ... Familiarity with prominent LLM families (e.g., GPT series, Claude, Llama, PaLM) and their common ...
In this role, you will be responsible for rigorously testing the security and integrity of Large ... Familiarity with prominent LLM families (e.g., GPT series, Claude, Llama, PaLM) and their common ...
Machine Learning Engineer - LLM
$147K - $220K/yr
Experience with LLM and LMM development and fine-tuning ... Experience applying ML techniques in manufacturing, testing, or hardware optimization. Proficiency ...
Machine Learning Engineer - LLM
$147K - $220K/yr
Experience with LLM and LMM development and fine-tuning ... Experience applying ML techniques in manufacturing, testing, or hardware optimization. Proficiency ...
In this role, you will be responsible for rigorously testing the security and integrity of Large ... Familiarity with prominent LLM families (e.g., GPT series, Claude, Llama, PaLM) and their common ...
In this role, you will be responsible for rigorously testing the security and integrity of Large ... Familiarity with prominent LLM families (e.g., GPT series, Claude, Llama, PaLM) and their common ...
Test Lead - Accessibility Testing
Oregon City, OR · Remote
$48.50 - $66.25/hr
Experience in designing LLM/RAG test automation solutions * Experience in testing for bias, drift, and fairness. * Familiarity with performance metrics (precision, recall, F1, ROC-AUC). * Knowledge ...
Quick apply
Test Lead - Accessibility Testing
Oregon City, OR · Remote
$48.50 - $66.25/hr
Experience in designing LLM/RAG test automation solutions * Experience in testing for bias, drift, and fairness. * Familiarity with performance metrics (precision, recall, F1, ROC-AUC). * Knowledge ...
... LLM/GenAI capabilities into real-world business workflows. This role is focused on AI-driven ... testing
... LLM/GenAI capabilities into real-world business workflows. This role is focused on AI-driven ... testing
... and testing. • Ensure secure, containerized deployments using Docker and integrate SSO and role ... Preferred : • Experience with LangGraph or other LLM orchestration frameworks. • Familiarity ...
New
... and testing. • Ensure secure, containerized deployments using Docker and integrate SSO and role ... Preferred : • Experience with LangGraph or other LLM orchestration frameworks. • Familiarity ...
New
Solid penetration testing fundamentals and understanding of common attack chains. * Familiarity with AI/LLM security frameworks (e.g., OWASP Top 10 for LLMs, MITRE ATLAS). * Experience in a security ...
Solid penetration testing fundamentals and understanding of common attack chains. * Familiarity with AI/LLM security frameworks (e.g., OWASP Top 10 for LLMs, MITRE ATLAS). * Experience in a security ...
Product Solutions Architect - LLM Observability
Boston, MA · On-site
$151K - $222K/yr
Experience with LLM application evaluation and testing frameworks Datadog values people from all walks of life. We understand not everyone will meet all the above qualifications on day one. That ...
Product Solutions Architect - LLM Observability
Boston, MA · On-site
$151K - $222K/yr
Experience with LLM application evaluation and testing frameworks Datadog values people from all walks of life. We understand not everyone will meet all the above qualifications on day one. That ...
Applied AI Engineer
$125K - $155K/yr
Familiarity with prompt evaluation or LLM testing approaches * Experience integrating AI into production SaaS platforms * Experience working in fintech or B2B SaaS Benefits Why Join BrightPlan
Applied AI Engineer
$125K - $155K/yr
Familiarity with prompt evaluation or LLM testing approaches * Experience integrating AI into production SaaS platforms * Experience working in fintech or B2B SaaS Benefits Why Join BrightPlan
... Implement testing, evaluation, and monitoring frameworks for AI systems including hallucination detection and bias assessment Establish safety guardrails and responsible AI practices for LLM ...
... Implement testing, evaluation, and monitoring frameworks for AI systems including hallucination detection and bias assessment Establish safety guardrails and responsible AI practices for LLM ...
Strong statistical foundations: experimental design, hypothesis testing, confidence intervals, effect sizes, power analysis. * Experience designing and running evaluations for LLM or NLP systems ...
Strong statistical foundations: experimental design, hypothesis testing, confidence intervals, effect sizes, power analysis. * Experience designing and running evaluations for LLM or NLP systems ...
Cyber Security PenTester - GenAI & LLM
Charlotte, NC · Remote
$75 - $80/hr
Perform penetration testing on LLM and ML-based applications * Review and analyze complex multi-faceted larger scale or longer-term Information Security Analysis challenges. * Use Burp Suite heavily ...
Cyber Security PenTester - GenAI & LLM
Charlotte, NC · Remote
$75 - $80/hr
Perform penetration testing on LLM and ML-based applications * Review and analyze complex multi-faceted larger scale or longer-term Information Security Analysis challenges. * Use Burp Suite heavily ...
... testing. Technical Skills: Agent frameworks Lang Graph Semantic Kernel similar orchestration ... LLM outputs Key Responsibilities: 1. Agentic AI System Design Engineering Design and implement ...
Quick apply
... testing. Technical Skills: Agent frameworks Lang Graph Semantic Kernel similar orchestration ... LLM outputs Key Responsibilities: 1. Agentic AI System Design Engineering Design and implement ...
Applied Data Scientist, LLM Evaluation
Austin, TX · On-site +1
$175K - $275K/yr
Strong statistical foundations: experimental design, hypothesis testing, confidence intervals, effect sizes, power analysis. * Experience designing and running evaluations for LLM or NLP systems ...
Applied Data Scientist, LLM Evaluation
Austin, TX · On-site +1
$175K - $275K/yr
Strong statistical foundations: experimental design, hypothesis testing, confidence intervals, effect sizes, power analysis. * Experience designing and running evaluations for LLM or NLP systems ...
Data Scientist AI
El Segundo, CA · Hybrid
Prior work with model evaluation benchmarks and large-scale LLM testing frameworks. * Experience with prompt fine-tuning techniques, LLMs optimization techniques(pre-training, post-training etc)
Data Scientist AI
El Segundo, CA · Hybrid
Prior work with model evaluation benchmarks and large-scale LLM testing frameworks. * Experience with prompt fine-tuning techniques, LLMs optimization techniques(pre-training, post-training etc)
Golang Developer with Devops/LLM exp Location: California (Remote) We are looking for devs with ... testing, release, and operations. * Build tooling and observability to monitor system health, and ...
Quick apply
Golang Developer with Devops/LLM exp Location: California (Remote) We are looking for devs with ... testing, release, and operations. * Build tooling and observability to monitor system health, and ...
Data Scientist AI
El Segundo, CA · On-site
Prior work with model evaluation benchmarks and large-scale LLM testing frameworks. * Experience with prompt fine-tuning techniques, LLMs optimization techniques(pre-training, post-training etc)
Data Scientist AI
El Segundo, CA · On-site
Prior work with model evaluation benchmarks and large-scale LLM testing frameworks. * Experience with prompt fine-tuning techniques, LLMs optimization techniques(pre-training, post-training etc)
Full Stack Developer (AI/LLM Focus) Location: 9509 key west drive Rockville, MD 20850 need local ... Implement automated testing using tools like Selenium, Cypress, or Playwright * Optimize ...
Quick apply
Full Stack Developer (AI/LLM Focus) Location: 9509 key west drive Rockville, MD 20850 need local ... Implement automated testing using tools like Selenium, Cypress, or Playwright * Optimize ...
ML OPS and LLM OPS (Architect)
Austin, TX · On-site
ML OPS and LLM OPS (Architect) Location: Remote Duration: Full-time Position: ML OPS and LLM OPS ... testing ML models, Git and version control, frameworks like Flask, FastAPI. - Hands-on experience ...
Quick apply
ML OPS and LLM OPS (Architect)
Austin, TX · On-site
ML OPS and LLM OPS (Architect) Location: Remote Duration: Full-time Position: ML OPS and LLM OPS ... testing ML models, Git and version control, frameworks like Flask, FastAPI. - Hands-on experience ...
Llm Testing information
See salary details
$11.78 - $16.22
4% of jobs
$16.22 - $20.65
3% of jobs
$20.65 - $25.09
7% of jobs
$25.09 - $29.52
8% of jobs
$30.82 is the 25th percentile. Wages below this are outliers.
$29.52 - $33.96
6% of jobs
$33.96 - $38.40
13% of jobs
The median wage is $40.15 / hr.
$38.40 - $42.83
20% of jobs
$47.01 is the 75th percentile. Wages above this are outliers.
$42.83 - $47.27
14% of jobs
$47.27 - $51.70
12% of jobs
$51.70 - $56.14
7% of jobs
$56.14 - $60.58
5% of jobs
$11
$39
$60
How much do llm testing jobs pay per hour?

Full-time
Posted 14 days ago
Job description
SilverEdge Government Solutions is seeking a highly skilled LLM Security Evaluation Expert to join our team. In this role, you will be responsible for rigorously testing the security and integrity of Large Language Models (LLMs). Your primary focus will be on designing and executing sophisticated adversarial prompt attacks to identify potential vulnerabilities, assess the model's resistance to exploitation, and ensure it maintains consistent, secure behavior. This is a critical role in safeguarding our AI systems and ensuring they operate responsibly.
Required Qualifications
- TS/SCI with Polygraph level Clearance
- Strong knowledge of how LLMs work, including their architecture, training processes, capabilities, and inherent limitations.
- Familiarity with prominent LLM families (e.g., GPT series, Claude, Llama, PaLM) and their common characteristics.
- Proven experience in crafting and refining prompts to elicit specific behaviors or bypass restrictions in LLMs.
- Demonstrable understanding of techniques like jailbreaking, prompt injection, role-playing attacks, and exploiting model biases.
- Strong understanding of cybersecurity principles and common attack vectors, particularly as they apply to AI/ML systems.
- Ability to think like an attacker and anticipate potential exploits.
- Excellent ability to analyze complex systems, identify subtle vulnerabilities, and systematically test hypotheses.
- Clear and concise written and verbal communication skills, with the ability to document technical findings thoroughly.
- Understanding of the ethical implications of AI security and commitment to responsible testing practices.
About SilverEdge
SilverEdge Government Solutions was founded on the belief that nurturing talent and collaborating closely with our customers enables us to think big and deliver the best for our country. Our mission is to bring top technology talent together to solve the world's most challenging problems while protecting the United States and our allies.SilverEdge Government Solutions, LLC is an Equal Opportunity Employer and applicants receive lawful consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or protected veteran status.