2

Remote Evaluation Jobs (NOW HIRING)

Remote contract for PhDs in Physics, Applied Physics, or related fields. Work on cutting-edge ... Shortlisted experts complete an evaluation before selection. * Assignments: Contract roles with ...

Remote Commitment: 40 hours/week Role Responsibilities * Guide research teams to close knowledge ... Develop evaluation frameworks and rubrics for assessing scientific reasoning quality across STEM ...

Remote Commitment: 40 hours/week Role Responsibilities * Guide research teams to close knowledge ... Develop evaluation frameworks and rubrics for assessing scientific reasoning quality across STEM ...

As a Clinical AI Evaluation Specialist on CentralReach's AI Governance team, you will be ... Experience in a remote work environment with strong self-direction and accountability. #LI-Remote ...

Follow structured guidelines to ensure consistency in evaluations Basic Qualifications * Native or ... Fully remote and flexible work environment Eligibility * Candidates must be based in the United ...

DESCRIPTION About the Job The Testing Evaluator position is a remote, on demand, part-time position. The Testing Evaluator assists with testing for language proficiency examinations, interacting with ...

DESCRIPTION About the Job The Testing Evaluator position is a remote, on demand, part-time position. The Testing Evaluator assists with testing for language proficiency examinations, interacting with ...

DESCRIPTION About the Job The Testing Evaluator position is a remote, on demand, part-time position. The Testing Evaluator assists with testing for language proficiency examinations, interacting with ...

next page

Showing results 1-20

Remote Evaluation information

See salary details

$83.5K

$127K

$171K

How much do remote evaluation jobs pay per year?

As of Jun 9, 2026, the average yearly pay for remote evaluation in the United States is $127,031.00, according to ZipRecruiter salary data. Most workers in this role earn between $109,000.00 and $143,500.00 per year, depending on experience, location, and employer.

What are the key skills and qualifications needed to thrive in the Remote Evaluation position, and why are they important?

To thrive in a Remote Evaluation role, you need strong analytical skills, attention to detail, and a relevant bachelor’s degree or specialized training in assessment or evaluation. Proficiency in digital assessment platforms, data analysis tools such as Excel or Tableau, and familiarity with industry-specific evaluation software are often required. Outstanding written communication, independent time management, and adaptability help remote evaluators excel in a virtual environment. These skills ensure accurate, efficient, and meaningful assessments while maintaining productivity and effective collaboration across remote teams.

What are the typical daily responsibilities for someone working in Remote Evaluation?

As a Remote Evaluation professional, your daily responsibilities often include reviewing and scoring applications, surveys, or assessments, analyzing data, and compiling detailed evaluation reports. You'll typically work independently but may also participate in virtual meetings with team members or stakeholders to ensure alignment on evaluation criteria and share findings. Adhering to strict deadlines and maintaining confidentiality of sensitive information are common expectations. The workload can vary based on project cycles, so strong organizational skills are essential for managing multiple assignments. Collaboration tools such as email, project management platforms, and video conferencing software are frequently used in this role.

What is a Remote Evaluation job?

A Remote Evaluation job involves assessing data, products, services, or processes from a remote location. This can include reviewing applications, analyzing research data, conducting quality assessments, or providing feedback on various projects. It is commonly found in education, market research, healthcare, and tech industries. Evaluators use digital tools to complete assessments without needing to travel or work on-site. These roles require strong analytical skills, attention to detail, and the ability to work independently.

More about Remote Evaluation jobs
What cities are hiring for Remote Evaluation jobs? Cities with the most Remote Evaluation job openings:
What are the most commonly searched types of Evaluation jobs? The most popular types of Evaluation jobs are:
What states have the most Remote Evaluation jobs? States with the most job openings for Remote Evaluation jobs include:
Infographic showing various Remote Evaluation job openings in the United States as of May 2026, with employment types broken down into 3% As Needed, 79% Full Time, 14% Part Time, and 4% Contract. Highlights an 91% Physical, 2% Hybrid, and 7% Remote job distribution, with an average salary of $127,031 per year, or $61.1 per hour.

Python Insfrastructure Engineer - Model Evaluation

Alignerr

Seattle, WA • Remote

Other

Posted 26 days ago


Job description

Python Infrastructure Engineer - Model Evaluation (AI Training)
About the Role
What if your Python expertise could directly shape how the world's most advanced AI models are built, tested, and improved? We're looking for a Senior Python Infrastructure Engineer to design and build the data pipelines, annotation tooling, and evaluation systems that leading AI labs depend on to train and validate next-generation models.
This is a fully remote contract role with flexible hours - you'll be working on real production systems at the cutting edge of AI development.
  • Organization
    : Alignerr
  • Type
    : Hourly Contract
  • Location
    : Remote
  • Commitment
    : 20-40 hours/week
What You'll Do
  • Design, build, and optimize high-performance Python systems supporting AI data pipelines and model evaluation workflows
  • Develop full-stack tooling and backend services for large-scale data annotation, validation, and quality control
  • Build and maintain evaluation harnesses for ML models, integrating with inference frameworks
  • Improve reliability, performance, and safety across existing Python codebases
  • Implement observability, metrics collection, and monitoring to track system reliability and model performance
  • Identify bottlenecks and edge cases in data and system behavior, and ship scalable fixes
  • Collaborate with data, research, and engineering teams to support model training and evaluation workflows
  • Participate in synchronous design reviews to iterate on system architecture and implementation decisions
Who You Are
  • Native or fluent English speaker with clear written and verbal communication skills
  • Full-stack developer with a strong systems programming background
  • 3-5+ years of professional experience writing production-grade Python
  • Experienced building evaluation harnesses for ML models and integrating with inference frameworks
  • Strong background in observability, metrics collection, and system reliability monitoring
  • Able to commit 20-40 hours per week consistently
  • Self-directed and comfortable working asynchronously across distributed teams
Nice to Have
  • Prior experience with data annotation, data quality, or evaluation systems
  • Familiarity with AI/ML workflows, model training, or benchmarking pipelines
  • Experience with distributed systems or developer tooling
  • Background in MLOps, infrastructure engineering, or platform engineering
Why Join Us
  • Work on real production systems powering some of the most advanced AI research in the world
  • Fully remote and flexible - structure your work around your life
  • Freelance autonomy with the depth and meaning of high-impact engineering work
  • Contribute directly to AI infrastructure that shapes how next-generation models are built and evaluated
  • Potential for ongoing work and contract extension as new projects launch