1

Map Evaluator Jobs (NOW HIRING)

Contribute to the design of human-in-the-loop evaluation workflows, integrating qualitative and quantitative insight into evaluation reports. * Assist in mapping evaluation outcomes to responsible AI ...

AI Evaluation Scientist

Mclean, VA · On-site

$105K - $145K/yr

Contribute to the design of human-in-the-loop evaluation workflows, integrating qualitative and quantitative insight into evaluation reports. * Assist in mapping evaluation outcomes to responsible AI ...

Contribute to the design of human-in-the-loop evaluation workflows, integrating qualitative and quantitative insight into evaluation reports. * Assist in mapping evaluation outcomes to responsible AI ...

AI Evaluation Scientist

Mclean, VA · On-site

$105K - $145K/yr

Contribute to the design of human-in-the-loop evaluation workflows, integrating qualitative and quantitative insight into evaluation reports. * Assist in mapping evaluation outcomes to responsible AI ...

Contribute to the design of human-in-the-loop evaluation workflows, integrating qualitative and quantitative insight into evaluation reports. * Assist in mapping evaluation outcomes to responsible AI ...

Contribute to the design of human-in-the-loop evaluation workflows, integrating qualitative and quantitative insight into evaluation reports. * Assist in mapping evaluation outcomes to responsible AI ...

Case Manager, MAP

Barstow, CA · On-site

$26.45/hr

MAP is an innovative program dedicated to engaging the community and providing rehabilitative ... CSI is committed to systematically evaluating the integration of the trauma-informed values of ...

Implement Lean Manufacturing concepts, including Value Stream Map evaluation of current state production opportunities and the implementation of future state throughout all areas of operations

next page

Showing results 1-20

Map Evaluator information

See salary details

$29.5K

$65.5K

$106.5K

How much do map evaluator jobs pay per year?

As of May 31, 2026, the average yearly pay for map evaluator in the United States is $65,471.00, according to ZipRecruiter salary data. Most workers in this role earn between $44,500.00 and $79,500.00 per year, depending on experience, location, and employer.

What is a Map Evaluator job?

A Map Evaluator is a remote job that involves assessing online maps for accuracy, relevance, and quality. Evaluators analyze search results, map data, and geographic information to ensure they reflect real-world locations and user intent. The role often requires familiarity with local areas, strong analytical skills, and adherence to specific guidelines. This job helps improve mapping services for better user experiences. Typically, it is a flexible, part-time position offered by companies that contract with major search engine providers.

What are the key skills and qualifications needed to thrive in the Map Evaluator position, and why are they important?

To thrive as a Map Evaluator, you need analytical skills, attention to detail, and a background in geography, GIS, or related fields. Familiarity with mapping software such as ArcGIS, Google Maps, or other geospatial tools is often expected, while relevant certifications can be advantageous. Strong communication, problem-solving abilities, and the capacity to provide constructive feedback help set candidates apart. These skills ensure the accurate evaluation and improvement of digital maps, directly impacting user experience and data reliability.

What are the typical daily responsibilities of a Map Evaluator?

As a Map Evaluator, your daily responsibilities typically include reviewing digital map data for accuracy, identifying and correcting errors, and assessing user contributions or automated changes. You may work independently or as part of a remote team, providing feedback to improve mapping algorithms and inform system updates. Tasks often involve researching locations, verifying place names, and flagging outdated or incorrect information to maintain high standards of map quality. Collaboration with data scientists or engineers may occasionally be required to address specific mapping challenges or implement user feedback.
What cities are hiring for Map Evaluator jobs? Cities with the most Map Evaluator job openings:
What are the most commonly searched types of Map Evaluator jobs? The most popular types of Map Evaluator jobs are:
What states have the most Map Evaluator jobs? States with the most job openings for Map Evaluator jobs include:
Infographic showing various Map Evaluator job openings in the United States as of May 2026, with employment types broken down into 95% Full Time, 1% Part Time, and 4% Contract. Highlights an 43% Physical, and 57% Remote job distribution, with an average salary of $65,471 per year, or $31.5 per hour.

$105K - $145K/yr

Full-time

Posted 14 days ago


Job description

We are looking for an AI Evaluation Scientist to design and execute evaluation processes that ensure our predictive and generative AI systems are accurate, reliable, safe, and aligned with mission requirements. This role is essential for establishing trust in AI solutions and supporting continuous improvement across the AI lifecycle. The AI Evaluation Scientist will work closely with engineers, data scientists, governance analysts, and product teams to develop evaluation metrics, build test harnesses, analyze model behavior, and support responsible deployment. 


  • Implement evaluation frameworks for AI models, including accuracy, robustness, relevance, bias, hallucination rate, and safety metrics.
  • Build and maintain automated evaluation scripts, tests, and pipelines that assess AI model outputs and detect performance drift over time.
  • Develop benchmark datasets, challenge sets, and scenario-based test cases tailored to mission and user needs.
  • Perform structured error analysis and behavioral audits of LLMs, retrieval-augmented generation (RAG) systems, and predictive models, documenting findings and improvement recommendations.
  • Collaborate with AI Developers, LLMOps Engineers, and Data Scientists to support iterative experimentation, model hardening, and quality improvements.
  • Contribute to the design of human-in-the-loop evaluation workflows, integrating qualitative and quantitative insight into evaluation reports.
  • Assist in mapping evaluation outcomes to responsible AI principles such as fairness, transparency, reliability, and safety.
  • Partner with AI Governance Analysts to ensure evaluation outputs support compliance, documentation, and risk assessments.
  • Stay current with emerging evaluation tools, frameworks, metrics, and research related to LLM assessment and generative AI reliability.
  • Document evaluation processes, criteria, and results for both technical and non-technical audiences.
  • You will contribute to the growth of our AI & Data Exploitation Practice! 

  • Ability to hold a position of public trust with the U.S. government.
  • Bachelor’s or Master’s degree in Computer Science, Statistics, Machine Learning, Cognitive Science, Human-Computer Interaction, Data Science, or a related field.
  • 2+ years of experience evaluating machine learning models, NLP systems, or generative AI models (LLMs preferred).
  • Familiarity with evaluation metrics, statistical testing, dataset creation, and experimental design for AI systems.
  • Proficiency in Python and relevant libraries such as PyTorch, Hugging Face, scikit-learn, LangChain.
  • Proficiency in AI evaluation frameworks such as Ragas.
  • Experience analyzing structured and unstructured data, including text, documents, and embeddings.
  • Understanding of LLM behavior, prompt evaluation, retrieval pipelines, or RAG architectures.
  • Exposure to responsible AI concepts and governance-aligned evaluation criteria (e.g., fairness, transparency, reliability).
  • Strong analytical skills with the ability to interpret model weaknesses, extract insights, and recommend actionable improvements.
  • Excellent written and verbal communication skills, with the ability to present evaluation findings clearly to technical and non-technical stakeholders.
  • Experience working in agile or iterative development environments is a plus.
  • Familiarity with OWASP LLM Top 10 Risks. 
  • NIH experience. 
  • Relevant certifications (helpful but not required): 
    • NIST AI RMF (AISIC)
    • INFORMS CAP
    • AWS/Azure/Google ML Certifications. 
  • Local to Washington, DC metro area preferred. 

Steampunk relies on several factors to determine salary, including but not limited to geographic location, contractual requirements, education, knowledge, skills, competencies, and experience. The projected compensation range for this position is $105,000 to $145,000.  The estimate displayed represents a typical annual salary range for this position. Annual salary is just one aspect of Steampunk’s total compensation package for employees. Learn more about additional Steampunk benefits here. 

Identity Statement

As part of the application process, you are expected to be on camera during interviews and assessments. We reserve the right to take your picture to verify your identity and prevent fraud.

Steampunk is a Change Agent in the Federal contracting industry, bringing new thinking to clients in the Homeland, Federal Civilian, Health and DoD sectors.  Through our Human-Centered delivery methodology, we are fundamentally changing the expectations our Federal clients have for true shared accountability in solving their toughest mission challenges.  As an employee owned company, we focus on investing in our employees to enable them to do the greatest work of their careers – and rewarding them for outstanding contributions to our growth. If you want to learn more about our story, visit http://www.steampunk.com.