2

Full Time Ai Tester Jobs (NOW HIRING)

AI Testing Tools: Familiarity with AI evaluation frameworks such as LangSmith, DeepEval, RAGAS, or ... full time employees. This position is not available for independent contractors No applications ...

Penetration Tester

Herndon, VA · On-site

$86K - $198K/yr

Knowledge of tools, tactics, and techniques targeting Artificial Intelligence (AI) systems and ... Full-time and part-time employees working at least 20 hours a week on a regular basis are eligible ...

Knowledge of tools, tactics, and techniques targeting Artificial Intelligence (AI) systems and ... Full-time and part-time employees working at least 20 hours a week on a regular basis are eligible ...

AI Quality Engineer

Alpharetta, GA · On-site

$70K - $90K/yr

Employ automated testing tools for RESTful services connected to marketplace integrations ... Coffee bar with cold brew on tap and a full time barista * Standing desk (if you're into that sort ...

Washington, DC or Chandler, AZ Terms: Full-time Clearance: Active Secret Required Travel: 0-20 ... Experience with AI/ML system security testing or emerging attack surfaces * Active TS/SCI clearance ...

next page

Showing results 1-20

Full Time Ai Tester information

See salary details

$10

$38

$62

How much do full time ai tester jobs pay per hour?

As of Jun 17, 2026, the average hourly pay for full time ai tester in the United States is $38.36, according to ZipRecruiter salary data. Most workers in this role earn between $21.39 and $50.72 per hour, depending on experience, location, and employer.

What are some common challenges faced by Full Time AI Testers when validating machine learning models?

Full Time AI Testers often encounter challenges related to the complexity and unpredictability of AI systems, such as ensuring models perform accurately across diverse real-world data and edge cases. Testing for bias, fairness, and explainability can also be demanding, as these require specialized tools and a deep understanding of both the data and the algorithms. Additionally, AI Testers must frequently collaborate with data scientists and developers to clarify requirements and reproduce issues, making strong communication skills essential. Keeping up with rapidly evolving AI technologies and best practices is another important aspect of the role.

What is a $900,000 AI job?

A $900,000 AI job typically refers to high-level roles in artificial intelligence, such as senior AI researchers, machine learning directors, or AI executives, often requiring advanced skills, extensive experience, and sometimes specialized certifications. These positions usually involve leadership, strategic planning, and development of cutting-edge AI technologies, and they tend to be found in large tech companies or innovative startups. Compensation at this level reflects the complexity and impact of the work, as well as the value placed on AI expertise in the industry.

What are the key skills and qualifications needed to thrive as a Full Time AI Tester, and why are they important?

Thriving as a Full Time AI Tester requires a solid understanding of software testing methodologies, programming fundamentals, and experience with AI/ML concepts, usually supported by a degree in computer science or a related field. Familiarity with test automation tools (like Selenium or PyTest), version control systems (such as Git), and platforms for AI model deployment is typically necessary. Strong analytical thinking, attention to detail, and effective communication help testers identify issues and collaborate with development teams. These competencies ensure the reliability, accuracy, and fairness of AI systems, which is critical for their safe and effective deployment.

What job makes $10,000 a month without a degree?

A full-time AI tester can potentially earn $10,000 a month through freelance or contract work, especially with specialized skills in AI tools, programming, and data analysis. High earnings often depend on experience, reputation, and the complexity of projects, rather than formal degrees alone.

How do I become an AI tester?

To become an AI tester, you typically need a background in computer science, software testing, or related fields, along with knowledge of machine learning and AI concepts. Skills in programming languages like Python, experience with data annotation, and familiarity with testing tools are important. Earning certifications in software testing or AI can also enhance your qualifications.

How much do AI testers get paid?

AI testers in full-time roles typically earn between $50,000 and $100,000 annually, depending on experience, location, and company size. Entry-level positions may start lower, while experienced testers with specialized skills can earn higher salaries, often with benefits and opportunities for advancement.

What does a Full Time AI Tester do?

A Full Time AI Tester is responsible for evaluating and validating artificial intelligence systems to ensure they function as intended. Their duties include designing and executing test cases, identifying bugs or issues, and collaborating with developers to improve AI models. They may also assess the fairness, accuracy, and reliability of AI algorithms, and ensure compliance with ethical standards. This role typically requires strong analytical skills, attention to detail, and familiarity with machine learning concepts.
More about Full Time Ai Tester jobs
What cities are hiring for Full Time Ai Tester jobs? Cities with the most Full Time Ai Tester job openings:
What are the most commonly searched types of Ai Tester jobs? The most popular types of Ai Tester jobs are:
What states have the most Full Time Ai Tester jobs? States with the most job openings for Full Time Ai Tester jobs include:
Infographic showing various Full Time Ai Tester job openings in the United States as of June 2026, with employment types broken down into 3% Internship, 5% Full Time, 2% Part Time, and 90% Contract. Highlights an 59% Physical, 1% Hybrid, and 40% Remote job distribution, with an average salary of $79,791 per year, or $38.4 per hour.

Test Engineer-AI/LLM

OPPO US Research Center

Palo Alto, CA • On-site

Full-time

Posted 21 days ago


Job description

OPPO US Research Center is seeking a full-time meticulous and innovative AI/LLM Test Engineer to join our cutting-edge AI team. In this critical role, you will evaluate the performance, reliability, and safety of Large Language Models (LLMs) in real-world product scenarios and test end-to-end generative AI solutions. Your work will directly shape how users experience AI-powered features by ensuring robustness, accuracy, and alignment with product goals. This is a unique opportunity to pioneer testing methodologies for next-generation AI systems at the forefront of technology.

We are also seeking a Contractor based LLM Evaluation & QA Engineer to support the testing and validation of large language model (LLM)-powered applications. You will help implement test strategies, execute evaluation workflows, and assist in model performance validation across diverse generative AI use cases.

This contract role is ideal for someone with hands-on experience in AI/ML evaluation, QA engineering, or data analysis who wants to deepen their exposure to generative AI systems.

Requirements

Full-time position requirement:

Core Testing & Evaluation

  • Design and execute performance tests for LLMs across diverse product use cases (e.g., chatbots, content generation etc.).
  • Develop automated test frameworks to evaluate LLM outputs for accuracy, bias, safety, and coherence.
  • Conduct end-to-end testing of integrated generative AI solutions, including APIs, data pipelines, and user interfaces.

Optimization & Validation

  • Collaborate with ML engineers to validate fine-tuned models and optimize prompts for target scenarios.
  • Analyze model failures, edge cases, and adversarial inputs to identify risks and improvement areas.
  • Benchmark LLM performance against industry standards and product-specific KPIs.

Collaboration & Quality Assurance

  • Partner with product, engineering, and research teams to define test requirements and acceptance criteria.
  • Document defects, performance metrics, and test results to drive data-driven improvements.
  • Advocate for AI ethics and safety through rigorous testing of fairness, bias mitigation, and content moderation.

Innovation & Tooling

  • Build scalable tools for synthetic test data generation, prompt variation testing, and automated evaluation workflows.
  • Stay current with advancements in generative AI testing, including red-teaming techniques and evaluation frameworks (e.g., HELM, Dynabench).
  • Propose novel testing strategies for emerging challenges (e.g., hallucinations, context drift).

Basic Qualifications:

  • Bachelor’s degree in Computer Science, Data Science, Engineering, or a related technical field, or equivalent practical experience.
  • 1+ years of experience in software testing, data science, or ML validation, with exposure to AI/ML systems.
  • Proficiency in Python and testing frameworks (e.g., PyTest, Selenium).
  • Hands-on experience evaluating LLMs in production environments (e.g., GPT, Claude, Llama, Gemini).
  • Strong analytical skills for dissecting model behavior, statistical performance, and failure modes.
  • Familiarity with cloud platforms (GCP, Azure, or AWS) and MLOps tooling (e.g., MLflow, Weights & Biases).
  • Experience with version control (Git) and agile development methodologies.

Preferred Qualifications:

  • Master’s degree in AI, Machine Learning, or a related field.
  • Expertise in prompt engineering, LLM fine-tuning (e.g., LoRA, RLHF), or optimization techniques.
  • Experience with automated evaluation tools (e.g., LangChain, TruLens) or LLM-specific test suites.
  • Knowledge of data pipelines, SQL/NoSQL databases, and API testing (e.g., Postman).
  • Background in statistics, quantitative analysis, or data visualization for test insights.
  • Contributions to AI safety/ethics initiatives or open-source LLM evaluation projects.
  • Experience testing mobile-integrated AI solutions (Android/iOS).

Contractor position requirements:

Testing & Evaluation Support:

  • Execute pre-defined performance tests for LLMs across various tasks (e.g., summarization, Q&A, chatbot flows).
  • Run scripted evaluations to assess outputs for factuality, coherence, and safety.
  • Perform manual and automated test execution on APIs and LLM-integrated user interfaces.

Prompt & model validation:

  • Assist ML engineers in evaluating prompt variations and prompt-tuning outcomes.
  • Log and analyze failure cases, anomalies, and edge cases based on provided guidelines.

Collabration & Documentation

  • Work with QA leads, product managers, and ML engineers to understand test goals and criteria.
  • Report defects, compile evaluation summaries, and maintain testing logs.

Tooling & Antomation:

  • Use existing internal tools or frameworks to automate test runs and result collection.
  • Contribute to prompt generation, input templating, or result tagging processes.

Basic Qualifications:

  • Bachelor's degree or equivalent work experience in a technical field (e.g., Computer Science, Engineering, Data Science).
  • 6+ months experience in software QA, data labeling, LLM evaluation, or ML testing projects.
  • Basic Python proficiency, especially for data processing and automation tasks.
  • Familiarity with LLMs (e.g., GPT, Claude, Gemini) and prompt-based outputs.
  • Comfortable working with tools like Jupyter, Postman, or testing dashboards.
  • Detail-oriented with good documentation habits.

Contractor Details:

  • Duration: Long term
  • Rate: Commensurate with experience
  • Conversion Opportunity: High-performing contractors may be considered for full-time roles

Benefits

OPPO is proud to be an equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity or Veteran status. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements.

The US base salary range for this full-time position is $100,000-$200,000 + bonus + long term incentives benefits. Our salary ranges are determined by role, level, and location.