Machine Learning Engineer, ML/GenAI Evaluation Work Locations (3) Submit Resume Would you like to ... Develop evaluation methodologies for robustness testing: distribution shift, out-of-distribution ...
Machine Learning Engineer, ML/GenAI Evaluation Work Locations (3) Submit Resume Would you like to ... Develop evaluation methodologies for robustness testing: distribution shift, out-of-distribution ...
Director, AI & Machine Learning
Flower Mound, TX · On-site
$220K - $240K/yr
Oversee the design, development, testing, and deployment of AI and automation solutions ... Advanced knowledge of artificial intelligence, machine learning, and generative AI technologies.
Director, AI & Machine Learning
Flower Mound, TX · On-site
$220K - $240K/yr
Oversee the design, development, testing, and deployment of AI and automation solutions ... Advanced knowledge of artificial intelligence, machine learning, and generative AI technologies.
Lead Machine Learning Engineer
Plano, TX · On-site +1
$98K - $129K/yr
Lead Machine Learning Engineer As a Capital One Machine Learning Engineer (MLE), you'll be part of ... Solve complex problems by writing and testing application code, developing and validating ML models ...
Lead Machine Learning Engineer
Plano, TX · On-site +1
$98K - $129K/yr
Lead Machine Learning Engineer As a Capital One Machine Learning Engineer (MLE), you'll be part of ... Solve complex problems by writing and testing application code, developing and validating ML models ...
Eurofins is dedicated to delivering testing services that contribute to the health and safety of ... We are seeking a Machine Learning (ML) Expert with experience in chemical data processing to drive ...
Eurofins is dedicated to delivering testing services that contribute to the health and safety of ... We are seeking a Machine Learning (ML) Expert with experience in chemical data processing to drive ...
Senior Machine Learning Engineer
Austin, TX · On-site
$121K - $160K/yr
... and testing workflows. Bachelor's degree in Computer Science, Statistics, Mathematics with equivalent experience.5+ years of related experience building high throughput scalable applications or ...
Senior Machine Learning Engineer
Austin, TX · On-site
$121K - $160K/yr
... and testing workflows. Bachelor's degree in Computer Science, Statistics, Mathematics with equivalent experience.5+ years of related experience building high throughput scalable applications or ...
Senior Machine Learning Engineer
Austin, TX · On-site
$220K - $250K/yr
As a Senior Machine Learning Engineer, you'll own impactful problems end-to-end-from data ... Contribute to experimentation frameworks, including A/B testing and offline evaluation, to iterate ...
Senior Machine Learning Engineer
Austin, TX · On-site
$220K - $250K/yr
As a Senior Machine Learning Engineer, you'll own impactful problems end-to-end-from data ... Contribute to experimentation frameworks, including A/B testing and offline evaluation, to iterate ...
Senior Machine Learning Engineer
$220K - $250K/yr
As a Senior Machine Learning Engineer, you'll own impactful problems end-to-end-from data ... Contribute to experimentation frameworks, including A/B testing and offline evaluation, to iterate ...
Senior Machine Learning Engineer
$220K - $250K/yr
As a Senior Machine Learning Engineer, you'll own impactful problems end-to-end-from data ... Contribute to experimentation frameworks, including A/B testing and offline evaluation, to iterate ...
Computer Vision & Machine Learning Engineer
Austin, TX · On-site
$110K - $130K/yr
The Computer Vision & Machine Learning Engineer will be responsible for developing and optimizing ... extensive testing and validation of computer vision algorithms in various scenarios to ensure ...
Computer Vision & Machine Learning Engineer
Austin, TX · On-site
$110K - $130K/yr
The Computer Vision & Machine Learning Engineer will be responsible for developing and optimizing ... extensive testing and validation of computer vision algorithms in various scenarios to ensure ...
Onsite :: SDET - Software Engineer :: Austin, Texas :: Contract
Austin, TX · On-site
$49.50 - $64/hr
The ideal candidate will possess deep expertise in Java, Python, and Splunk, with a solid foundation in machine learning testing strategies. You'll work closely with developers, QA engineers, and ...
Quick apply
Onsite :: SDET - Software Engineer :: Austin, Texas :: Contract
Austin, TX · On-site
$49.50 - $64/hr
The ideal candidate will possess deep expertise in Java, Python, and Splunk, with a solid foundation in machine learning testing strategies. You'll work closely with developers, QA engineers, and ...
Computer Vision & Machine Learning Engineer
Austin, TX · On-site
$110K - $130K/yr
The Computer Vision & Machine Learning Engineer will work on developing and optimizing algorithms ... extensive testing and validation of computer vision algorithms in various scenarios to ensure ...
Computer Vision & Machine Learning Engineer
Austin, TX · On-site
$110K - $130K/yr
The Computer Vision & Machine Learning Engineer will work on developing and optimizing algorithms ... extensive testing and validation of computer vision algorithms in various scenarios to ensure ...
Senior Machine Learning Engineer
Austin, TX · On-site
$220K - $250K/yr
As a Senior Machine Learning Engineer, you'll own impactful problems end-to-end-from data ... Contribute to experimentation frameworks, including A/B testing and offline evaluation, to iterate ...
Senior Machine Learning Engineer
Austin, TX · On-site
$220K - $250K/yr
As a Senior Machine Learning Engineer, you'll own impactful problems end-to-end-from data ... Contribute to experimentation frameworks, including A/B testing and offline evaluation, to iterate ...
Senior Machine Learning Engineer
Houston, TX · On-site
$117K - $154K/yr
... testing, and code review * 5+ years of industry experience developing and deploying machine learning or statistical models, with a proven track record of delivering end-to-end solutions in production ...
Senior Machine Learning Engineer
Houston, TX · On-site
$117K - $154K/yr
... testing, and code review * 5+ years of industry experience developing and deploying machine learning or statistical models, with a proven track record of delivering end-to-end solutions in production ...
Machine Learning Engineer, ML/GenAI Evaluation
Austin, TX · On-site
$171K - $302K/yr
As a Machine Learning Engineer specializing in Evaluation, you will establish the evaluation ... Develop evaluation methodologies for robustness testing: distribution shift, out-of-distribution ...
Machine Learning Engineer, ML/GenAI Evaluation
Austin, TX · On-site
$171K - $302K/yr
As a Machine Learning Engineer specializing in Evaluation, you will establish the evaluation ... Develop evaluation methodologies for robustness testing: distribution shift, out-of-distribution ...
As a Machine Learning Engineering Manager, you will onboard and oversee junior scientists and ... Improve existing Agile, ML, and A/B testing processes and develop new ones. * Scope projects ...
As a Machine Learning Engineering Manager, you will onboard and oversee junior scientists and ... Improve existing Agile, ML, and A/B testing processes and develop new ones. * Scope projects ...
Senior Machine Learning Engineer
Houston, TX · On-site
$117K - $154K/yr
... testing, and code review * 5+ years of industry experience developing and deploying machine learning or statistical models, with a proven track record of delivering end-to-end solutions in production ...
Quick apply
Senior Machine Learning Engineer
Houston, TX · On-site
$117K - $154K/yr
... testing, and code review * 5+ years of industry experience developing and deploying machine learning or statistical models, with a proven track record of delivering end-to-end solutions in production ...
Machine Learning Engineer, ML/GenAI Evaluation
Austin, TX · On-site
$171K - $302K/yr
As a Machine Learning Engineer specializing in Evaluation, you will establish the evaluation ... Develop evaluation methodologies for robustness testing: distribution shift, out-of-distribution ...
Machine Learning Engineer, ML/GenAI Evaluation
Austin, TX · On-site
$171K - $302K/yr
As a Machine Learning Engineer specializing in Evaluation, you will establish the evaluation ... Develop evaluation methodologies for robustness testing: distribution shift, out-of-distribution ...
Senior Machine Learning Engineer
$117K - $154K/yr
... testing, and code review * 5+ years of industry experience developing and deploying machine learning or statistical models, with a proven track record of delivering end-to-end solutions in production ...
Senior Machine Learning Engineer
$117K - $154K/yr
... testing, and code review * 5+ years of industry experience developing and deploying machine learning or statistical models, with a proven track record of delivering end-to-end solutions in production ...
Machine Learning Engineer, ML/GenAI Evaluation
Austin, TX · On-site
$171K - $302K/yr
As a Machine Learning Engineer specializing in Evaluation, you will establish the evaluation ... Develop evaluation methodologies for robustness testing: distribution shift, out-of-distribution ...
Machine Learning Engineer, ML/GenAI Evaluation
Austin, TX · On-site
$171K - $302K/yr
As a Machine Learning Engineer specializing in Evaluation, you will establish the evaluation ... Develop evaluation methodologies for robustness testing: distribution shift, out-of-distribution ...
... extensive testing and validation of computer vision algorithms in various scenarios to ensure ... Required : • Deep passion for machine learning, computer vision, and robotics, and have been ...
... extensive testing and validation of computer vision algorithms in various scenarios to ensure ... Required : • Deep passion for machine learning, computer vision, and robotics, and have been ...
Build and deploy end-to-end AI solutions including data ingestion, model development, testing, and ... Experience developing and deploying machine learning and deep learning models * Knowledge of end-to ...
Build and deploy end-to-end AI solutions including data ingestion, model development, testing, and ... Experience developing and deploying machine learning and deep learning models * Knowledge of end-to ...
Machine Learning Testing information
See Texas salary details
$12.99 - $14.43
3% of jobs
$14.43 - $15.88
7% of jobs
$15.88 - $17.33
6% of jobs
$18.41 is the 25th percentile. Wages below this are outliers.
$17.33 - $18.77
11% of jobs
$18.77 - $20.22
13% of jobs
The median wage is $20.98 / hr.
$20.22 - $21.66
18% of jobs
$21.66 - $23.11
16% of jobs
$23.20 is the 75th percentile. Wages above this are outliers.
$23.11 - $24.55
9% of jobs
$24.55 - $26
6% of jobs
$26 - $27.44
7% of jobs
$27.44 - $28.89
3% of jobs
$12
$21
$28
How much do machine learning testing jobs pay per hour?
Is ML a high paying job?
What jobs pay $2000 a day?
What are the key skills and qualifications needed to thrive in the Machine Learning Testing position, and why are they important?
To excel in Machine Learning Testing, you need a solid understanding of machine learning concepts, data analysis, and programming skills in languages like Python, as well as a background in quality assurance or software testing. Familiarity with frameworks such as TensorFlow, PyTorch, automated testing tools, and relevant certifications like ISTQB are highly beneficial. Strong attention to detail, analytical thinking, and effective communication skills help testers identify issues and collaborate with data scientists and developers. These competencies are essential to ensure the reliability, fairness, and accuracy of machine learning models deployed in production environments.
What are the typical challenges faced by professionals in Machine Learning Testing roles?
Professionals in Machine Learning Testing often encounter challenges such as dealing with non-deterministic model outputs, insufficient or imbalanced datasets, and unclear or evolving testing criteria. They may need to work closely with data scientists and engineers to develop robust test cases and validation methods tailored for dynamic machine learning systems. Staying updated on advancements in testing methodologies and tools is also important, as the field evolves rapidly. Successfully overcoming these challenges leads to higher quality models and more reliable AI solutions for end users.
How much do AI testers get paid?
What is a Machine Learning Testing job?
A Machine Learning Testing job involves evaluating and validating machine learning models to ensure they function correctly, efficiently, and ethically. This includes testing for accuracy, reliability, bias, and performance under different conditions. Professionals in this role employ techniques such as unit testing, integration testing, data validation, and model performance monitoring. They also work closely with data scientists and engineers to debug issues and improve model robustness. The goal is to ensure that machine learning systems perform as expected and meet business or regulatory requirements.
What is a $900000 AI job?
- Machine Learning Architect
- Scientific Machine Learning
- Machine Learning Data Engineer
- Deep Learning Scientist
- Temporary Machine Learning Scientist
- Contract Applied Scientist Machine Learning
- Machine Learning Research Scientist
- Senior Applied Scientist Machine Learning
- Data Science For Social Good
- Part Time Machine Learning Research Scientist
- Remote Director Machine Learning
- Freelance Google Machine Learning Engineer
- Learning Disability
- Mlflow
- No Experience Nvidia Machine Learning
- Remote Machine Learning
- Machine Learning Manager
- Freelance Artificial Intelligence Machine Learning
- Intern Data Scientist Machine Learning
- Machine Learning Biomedical Engineer
Apple rating
8.1
Based on 661 frontline employees who took The Breakroom Quiz
6th of 30 rated technology retailers
Job description
Work Locations (3) Submit Resume
Would you like to contribute to Machine Learning and Generative AI technologies? Are you passionate about measuring what matters and ensuring AI systems work reliably for everyone? Do you believe that rigorous evaluation — including holding models accountable to fairness standards — is what separates great ML from good ML? We truly believe it is! We are defining what exceptional looks like for machine learning across Wallet, Payments, and Commerce. As a Machine Learning Engineer specializing in Evaluation, you will establish the evaluation criteria, metrics frameworks, and quality standards that determine when models are ready to reach hundreds of millions of users. Your judgment shapes model quality and earns the confidence to ship. You'll work at the intersection of rigorous ML science and high-impact product decisions, collaborating closely with ML Engineering, Product, Privacy, and Legal teams. This unique opportunity puts you at the center of model quality — designing adversarial test strategies, surfacing failure modes before they reach users, and owning the sign-off process that ensures Apple's financial features meet the highest bar for accuracy, robustness, and reliability.
Responsibilities- Define evaluation criteria and quality metrics for ML models powering Wallet features
- Design and maintain structured test sets covering the full diversity of real-world scenarios — varied document formats, distributions, languages, edge cases, and adversarial inputs.
- Develop evaluation methodologies for robustness testing: distribution shift, out-of-distribution generalization, temporal drift, and aggressor scenarios
- Own fairness evaluation end-to-end — define fairness metrics appropriate to each Wallet feature, build bias test suites across protected attributes and user populations, measure disparate performance across subgroups, and gate model launches on fairness criteria with the same rigor as other conventional metrics.
- Build user persona–stratified benchmarks that reflect the breadth of Wallet's global user population across spending patterns, locales, and document types
- Evaluate generative and agentic model outputs — assessing hallucination rates, faithfulness, and groundedness using LLM-as-a-judge frameworks, human evaluation protocols, and prompt regression testing
- Own model quality sign-off — establish the launch criteria, run final evaluations, and make the call on model readiness before any feature ships
- Synthesize evaluation results into clear, actionable insights that guide model development priorities and product decisions
- Partner with ML engineers and Quality engineers to identify failure modes early in the development cycle and close the loop between evaluation findings and model improvements
- Establish and evangelize evaluation best practices across the Wallet ML team, raising the quality bar for how models are tested, monitored, and maintained post-launch
- M.S. in Machine Learning, Computer Science, Statistics, Applied Mathematics, or a related technical field strongly preferred.
- Bachelor's degree with 7+ years hands-on experience in ML evaluation, model quality, or applied research will be considered
- 5+ years of hands-on ML experience, with deep expertise in model evaluation, offline metrics design, and behavioral testing
- Strong track record designing evaluation frameworks for production ML systems — not just accuracy/F1, but precision-recall tradeoffs, calibration, fairness, and task-specific quality dimensions
- Creative mindset with the ability to translate standard ML evaluation metrics (F1, AUC, etc.) into utility and user trust measures
- Experience testing for distribution shift, out-of-distribution generalization, and temporal drift in real-world deployed models
- Proven ability to construct adversarial test suites, aggressor scenarios, and edge-case corpora that surface model failure modes before they reach users
- Experience with structured and semi-structured document understanding, OCR pipelines, or financial data extraction is a strong plus
- Strong programming skills in Python; fluency with evaluation tooling, data pipelines, and experiment tracking (e.g., MLflow, W&B, or equivalent)
- Excellent communication skills — ability to translate metric results into product-quality narratives for engineering and executive audiences
- Experience owning model quality sign-off in a cross-functional launch process
- PhD in Computer Science, Data Science, Statistics, AI/ML, or a related field.
- Experience with Bayesian or causal graph-based approaches to data generation.
- Experience with causal approaches to fairness evaluation — counterfactual fairness, causal Shapley values, or structural causal model–based bias auditing.
- Experience evaluating models under privacy constraints or on-device inference settings is a plus.
- Familiarity with confidence calibration techniques and uncertainty quantification a plus
- Background in financial services, fintech, or consumer payment products
Apple is an equal opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant At Apple, we believe accessibility is a fundamental human right. You'll find that idea reflected in everything here — in our culture, our benefits and our digital tools. By welcoming as many perspectives as possible, we help you build a career where you feel like you belong. Learn about accessibility in Apple's workplace Learn about reasonable accommodations for job applicants Apple accepts applications to this posting on an ongoing basis. Submit Resume Back to search results See all roles in Austin
About Apple
Sourced by ZipRecruiter
Imagine what you could do here! At Apple, new ideas have a way of becoming extraordinary products, services, and customer experiences very quickly. Bring passion and dedication to your job and there's no telling what you could accomplish. Dynamic, intelligent people and inspiring, innovative technologies are the norm here. The people who work here have reinvented entire industries with all Apple Hardware products. The same real passion for innovation that goes into our products also applies to our practices strengthening our dedication to leave the world better than we found it.
Industry
Computer and electronic product manufacturing
Company size
10,000+ Employees
Headquarters location
Cupertino, CA, US
Year founded
1976