You'll write eval sets against both, and you'll validate the surfaces our users actually touch: iOS ... Drive contract testing across LLM providers (Bedrock, Anthropic, OpenAI) to catch parity drift. CI ...
You'll write eval sets against both, and you'll validate the surfaces our users actually touch: iOS ... Drive contract testing across LLM providers (Bedrock, Anthropic, OpenAI) to catch parity drift. CI ...
Manages vendor relationships, service contracts, and provides budgetary input for equipment ... We may use artificial intelligence (AI) tools to support parts of the hiring process, such as ...
New
Manages vendor relationships, service contracts, and provides budgetary input for equipment ... We may use artificial intelligence (AI) tools to support parts of the hiring process, such as ...
New
What You'll Do Contract Processing & Validation * Review and process a high volume of contracts to ... We may use artificial intelligence (AI) tools to support parts of the hiring process, such as ...
What You'll Do Contract Processing & Validation * Review and process a high volume of contracts to ... We may use artificial intelligence (AI) tools to support parts of the hiring process, such as ...
Financial reporting AI Expert
$100 - $120/hr
... Contract Location: US, UK, Canada, France, Portugal (remote) We are seeking a detail-oriented and ... A core part of your role will involve preparing, reviewing, and validating journal entries ...
Financial reporting AI Expert
$100 - $120/hr
... Contract Location: US, UK, Canada, France, Portugal (remote) We are seeking a detail-oriented and ... A core part of your role will involve preparing, reviewing, and validating journal entries ...
What You'll Do Contract Processing & Validation * Review and process a high volume of contracts to ... We may use artificial intelligence (AI) tools to support parts of the hiring process, such as ...
What You'll Do Contract Processing & Validation * Review and process a high volume of contracts to ... We may use artificial intelligence (AI) tools to support parts of the hiring process, such as ...
What You'll Do Contract Processing & Validation * Review and process a high volume of contracts to ... We may use artificial intelligence (AI) tools to support parts of the hiring process, such as ...
Quick apply
What You'll Do Contract Processing & Validation * Review and process a high volume of contracts to ... We may use artificial intelligence (AI) tools to support parts of the hiring process, such as ...
Insurance Policy Administration AI Expert
$100 - $120/hr
... Contract Location: US, UK, Canada, France, Portugal (remote) We are seeking a skilled Insurance ... Build intelligent systems for document processing, policy validation, and exception handling using ...
Insurance Policy Administration AI Expert
$100 - $120/hr
... Contract Location: US, UK, Canada, France, Portugal (remote) We are seeking a skilled Insurance ... Build intelligent systems for document processing, policy validation, and exception handling using ...
Strategic finance AI Expert
$100 - $120/hr
... Part-time / Contract Location: US, UK, Canada, France, Portugal (remote) We are seeking a ... Collaborate with cross-functional teams (product, operations, data science) to validate assumptions ...
Strategic finance AI Expert
$100 - $120/hr
... Part-time / Contract Location: US, UK, Canada, France, Portugal (remote) We are seeking a ... Collaborate with cross-functional teams (product, operations, data science) to validate assumptions ...
Senior Systems Validation & Integration Engineer - Modular Data Centers [Hybrid/Houston, TX -USA]
Houston, TX · On-site
ASAP Type of Contract: Full Time / Permanent Travel Requirements: 10% About Submer Submer designs ... We help organizations scale AI beyond the limits of traditional datacenters by enabling higher ...
Senior Systems Validation & Integration Engineer - Modular Data Centers [Hybrid/Houston, TX -USA]
Houston, TX · On-site
ASAP Type of Contract: Full Time / Permanent Travel Requirements: 10% About Submer Submer designs ... We help organizations scale AI beyond the limits of traditional datacenters by enabling higher ...
AI Quality Infrastructure Engineer
Mountain View, CA · On-site
$126K - $166K/yr
MTV, CA or San Diego, CA or NYC, NY or Remote Job Duration: Long Term Contract Overview: As an AI ... support tool call validations, and the "LLM-as-a-judge" scoring engines. * Build Production ...
Quick apply
AI Quality Infrastructure Engineer
Mountain View, CA · On-site
$126K - $166K/yr
MTV, CA or San Diego, CA or NYC, NY or Remote Job Duration: Long Term Contract Overview: As an AI ... support tool call validations, and the "LLM-as-a-judge" scoring engines. * Build Production ...
Financial modeling AI Expert
$100 - $120/hr
... Part-time / Contract Location: US, UK, Canada, France, Portugal (remote) We are seeking a ... validate model assumptions. * Create dashboards, reports, and visualizations that communicate ...
Financial modeling AI Expert
$100 - $120/hr
... Part-time / Contract Location: US, UK, Canada, France, Portugal (remote) We are seeking a ... validate model assumptions. * Create dashboards, reports, and visualizations that communicate ...
Capital allocation AI Expert
$100 - $120/hr
... Contract Location: US, UK, Canada, France, Portugal (remote) We are seeking a Capital Allocation AI ... Partner with finance, product, engineering, and leadership teams to gather inputs, validate ...
Capital allocation AI Expert
$100 - $120/hr
... Contract Location: US, UK, Canada, France, Portugal (remote) We are seeking a Capital Allocation AI ... Partner with finance, product, engineering, and leadership teams to gather inputs, validate ...
Contract Administrator
New Johnsonville, TN · On-site
In key sectors such as clean energy, advanced electronics, high-performance computing and AI ... Validate contractor incident reports and lead or support investigations of all contractor-related ...
Contract Administrator
New Johnsonville, TN · On-site
In key sectors such as clean energy, advanced electronics, high-performance computing and AI ... Validate contractor incident reports and lead or support investigations of all contractor-related ...
Contract Role Summary We are looking for an AI Native Development Architect to design and guide the ... validation/testing). Preferred Qualifications * Experience designing GenAI solutions (RAG, tool ...
Quick apply
Contract Role Summary We are looking for an AI Native Development Architect to design and guide the ... validation/testing). Preferred Qualifications * Experience designing GenAI solutions (RAG, tool ...
Manager of Contract Compliance
Reston, VA · Hybrid
$100K - $150K/yr
Partner with finance to validate contract terms related to revenue recognition, billing and ... AI, and Enterprise Applications. These capabilities are backed by Gridiron IT's experienced ...
Manager of Contract Compliance
Reston, VA · Hybrid
$100K - $150K/yr
Partner with finance to validate contract terms related to revenue recognition, billing and ... AI, and Enterprise Applications. These capabilities are backed by Gridiron IT's experienced ...
AI Engineer - Decision & Optimization Systems
El Segundo, CA · On-site
$80K - $210K/yr
We build AI systems that determine how logistics decisions are made - not just how they're executed ... Define and enforce data contracts, normalization, and validation pipelines. * Build robustness ...
AI Engineer - Decision & Optimization Systems
El Segundo, CA · On-site
$80K - $210K/yr
We build AI systems that determine how logistics decisions are made - not just how they're executed ... Define and enforce data contracts, normalization, and validation pipelines. * Build robustness ...
Technical Product Manager (AI & Systems)
$150K - $180K/yr
... contracts, and performance, reliability, and scalability considerations * Partner closely with AI ... Own deployment readiness and launch execution, including validation in customer environments, risk ...
Technical Product Manager (AI & Systems)
$150K - $180K/yr
... contracts, and performance, reliability, and scalability considerations * Partner closely with AI ... Own deployment readiness and launch execution, including validation in customer environments, risk ...
About Us Nucs AI is a pioneering MedTech startup focused on transforming prostate cancer care ... As we scale, we are strengthening our clinical annotation and validation efforts to ensure our ...
About Us Nucs AI is a pioneering MedTech startup focused on transforming prostate cancer care ... As we scale, we are strengthening our clinical annotation and validation efforts to ensure our ...
New Venture Advisor, Agentic AI Systems (LG NOVA) *part-time contract*
Santa Clara, CA · Remote
$150 - $250/hr
... the creation and validation of new venture opportunities. The role focuses on AI agent ... Contract Rate of Pay: this is a part-time contract position. Depending on the experience and ...
New Venture Advisor, Agentic AI Systems (LG NOVA) *part-time contract*
Santa Clara, CA · Remote
$150 - $250/hr
... the creation and validation of new venture opportunities. The role focuses on AI agent ... Contract Rate of Pay: this is a part-time contract position. Depending on the experience and ...
Contract Administrator
New Johnsonville, TN · On-site
In key sectors such as clean energy, advanced electronics, high-performance computing and AI ... Validate contractor incident reports and lead or support investigations of all contractor-related ...
Contract Administrator
New Johnsonville, TN · On-site
In key sectors such as clean energy, advanced electronics, high-performance computing and AI ... Validate contractor incident reports and lead or support investigations of all contractor-related ...
Contract Ai Validation information
See salary details
$22.60 - $27.64
2% of jobs
$27.64 - $32.69
6% of jobs
$32.69 - $37.74
13% of jobs
$39.32 is the 25th percentile. Wages below this are outliers.
$37.74 - $42.79
13% of jobs
$42.79 - $47.84
11% of jobs
The median wage is $50.36 / hr.
$47.84 - $52.88
12% of jobs
$52.88 - $57.93
9% of jobs
$61.82 is the 75th percentile. Wages above this are outliers.
$57.93 - $62.98
13% of jobs
$62.98 - $68.03
13% of jobs
$68.03 - $73.08
6% of jobs
$73.08 - $78.13
3% of jobs
$22
$51
$78
How much do contract ai validation jobs pay per hour?
What is the difference between Contract Ai Validation vs Data Analyst?
| Aspect | Contract Ai Validation | Data Analyst |
|---|---|---|
| Required Credentials | Typically certifications in AI, machine learning, or data science; sometimes a degree in computer science or related fields | Bachelor's or master's in statistics, data science, or related fields; certifications like Microsoft Data Analyst or Tableau are common |
| Work Environment | Project-based, often remote, focused on AI model validation and testing | Office or remote, analyzing data sets, creating reports, and providing insights |
| Industry Usage | Used in AI development, tech companies, and industries deploying AI solutions | Used across finance, healthcare, marketing, and other sectors for data-driven decision making |
Contract Ai Validation professionals focus on testing and validating AI models, ensuring accuracy and compliance, often requiring AI-specific certifications. Data Analysts interpret data to inform business decisions, using statistical tools and data visualization. While both roles handle data, Contract Ai Validation is specialized in AI model validation, whereas Data Analysts work broadly with data analysis and reporting.

$137K - $181K/yr
Full-time
Posted 3 days ago
Job description
The Organization Â
At TWG Group Holdings, LLC ("TWG Global"), we drive innovation and business transformation across a range of industries-including financial services, insurance, technology, media, and sports-by leveraging data and AI as core assets. Our AI-first, cloud-native approach delivers real-time intelligence and interactive business applications, empowering informed decision-making for both customers and employees.Â
We prioritize responsible data and AI practices, ensuring ethical standards and regulatory compliance. Our decentralized structure enables each business unit to operate autonomously, supported by a central AI Solutions Group, while strategic partnerships with leading data and AI vendors fuel game-changing efforts in marketing, operations, and product development.Â
You will collaborate with management to advance our data and analytics transformation, enhance productivity, and enable agile, data-driven decisions. By leveraging relationships with top tech startups and universities, you will help create competitive advantages and drive enterprise innovation.Â
At TWG Global, your contributions will support our goal of sustained growth and superior returns, as we deliver rare value and impact across our businesses.Â
The RoleÂ
TWG Global is seeking a Senior or Staff AI Software Engineer in Test to join our AI Engineering team building commercial-grade AI products. This is a software engineering role focused on test automation. You won't just write test cases, you'll design and build the frameworks, harnesses, evaluation infrastructure, and tooling that make testing AI agents and LLM-powered applications possible at scale.Â
Our agents are written in LangGraph and run on Azure on the TWG side, with a parallel Vercel-based stack on the Palantir side. You'll write eval sets against both, and you'll validate the surfaces our users actually touch: iOS apps, plugins, and Chrome extensions, not just the model layer.Â
You'll work shoulder-to-shoulder with AI engineers and data scientists, contributing production-quality code to shared repositories. The ideal candidate is a strong coder, fluent in Python and Java - who has shipped automated test infrastructure in a production environment and has hands-on experience evaluating LLM and agentic systems.Â
Key Responsibilities:Â
Framework and harness engineeringÂ
- Design and build scalable, reusable test automation frameworks for AI agents, LLM-powered applications, and underlying APIs.Â
- Write clean, maintainable Python for test harnesses, eval pipelines, synthetic data generation utilities, and internal tooling.Â
- Treat test code as production code: code review, type hints, documentation, library design.Â
Evaluation infrastructureÂ
- Build evaluation infrastructure for benchmarking agent performance against SOTA LLMs, competitors, and internal baselines.Â
- Own regression suites, golden datasets, rubric-based evals, and metric dashboards.Â
- Build tooling for synthetic test data generation, edge-case discovery, and adversarial testing.Â
Resilience and loadÂ
- Design and run release, system, performance, and load tests against streaming, stateful, and async systems.Â
- Build chaos and fault injection tooling for token expiry, connection pool exhaustion, provider failover, and cache pressure scenarios.Â
- Drive contract testing across LLM providers (Bedrock, Anthropic, OpenAI) to catch parity drift.Â
CI/CD and observabilityÂ
- Integrate automated tests into CI/CD so every model, prompt, and code change is validated before it ships.Â
- Build trace-based assertions on LangGraph state, tool calls, and agent decisions - debugging an agent failure means replaying graph state, not re-running a prompt.Â
- Make observability a first-class testing surface (LangSmith, audit logs).Â
Human-in-the-loop and partnershipÂ
- Implement HIL review workflows where automation alone cannot validate quality, then push the automation boundary outward.Â
- Partner with AI engineers and data scientists on model evaluation, training and eval data prep, and root-cause debugging of complex end-to-end failures.Â
- Champion quality engineering practices across the team: code review, coverage standards, observability, reproducibility.Â
- Ensure user-centric validation so AI outputs are accurate, reliable, and meet real-world application needs.Â
Requirements
Qualifications:
- 3-7 years of software engineering experience, with a meaningful portion focused on test automation, SDET, or software engineering in test roles.Â
- Expert-level Python. You write Python every day, design libraries other engineers use, and apply OOP and clean-code practices.Â
- Hands-on Java experience, enough to read, write, and test Java services, not just touch them.Â
- Working understanding of the LangGraph or Vercel frameworks: graph state, nodes, edges, tool calls, and how to write evals against agentic flows.Â
- Demonstrated experience building eval sets for LLM models (this is critical to the role).Â
- Experience testing across multiple client surfaces: iOS apps, plugins, and Chrome extensions.Â
- Hands-on experience building automated test suites with frameworks such as pytest, Selenium, Playwright, Cypress, or similar.Â
- Proven experience integrating test automation into CI/CD systems (GitHub Actions, Jenkins, CircleCI, GitLab CI, or similar).Â
- Strong skills in data manipulation, test data preparation, and SQL.Â
- Bachelor's degree or higher in Computer Science, Engineering, or a related field.Â
Preferred Qualifications:Â
- Experience with Azure (our primary cloud) and containerization (Docker).Â
- Experience testing RAG pipelines, agentic workflows, or multi-step tool-calling systems.
Benefits
Position Location:
This position is located in Santa Monica, CA (on-site).Â
Compensation:
The base pay for this position is $190,000-250,000. A bonus will be provided as part of the compensation package, in addition to a full range of medical, financial, and/or other benefits.
TWG is an equal opportunity employer, and all qualified applicants will receive consideration for employment without regard to race, color, religion, gender, sexual orientation, gender identity, national origin, disability, or status as a protected veteran.