G2i
G2i

62 G2I Senior Java Software Engineer Jobs Hiring Near You

Senior Java Software Engineer

Tampa, FL · On-site

$118K - $155K/yr

Senior Java Software Engineer, Tampa, FL The Senior Java Software Engineer will assist a group of top notch developers to build the breakthrough features customers will love, adopt, and use while ...

$123K - $163K/yr

\n \n \n \n \n Senior Java Software Engineer \n \n \n Austin, TX \n \n \n $200,000 \n \n \n \n \n \n ShortList Recruitment have partnered with a FinTech company who are looking to bring in a highly ...

$138K - $182K/yr

Senior Java Software Engineer Ft. Meade Area, MD Government/Military Clearance Required: TS/SCI with Polygraph Full-Time | Fully Funded | $200K - $220K | 40 Hours/Week Keep the Mission Running. Helm ...

Senior Java Software Engineer.

Clayton, MO · On-site

$116K - $153K/yr

Senior Java Software Engineer. Location: Weldon or Clayton, MO Duration: Long term Contract Face to Face interview. Responsibilities As a Senior Software Engineer on the Vehicle Management Team, you ...

Senior Java Software Engineer

Olathe, KS · On-site

$118K - $155K/yr

Garmin is a leading technology company seeking a full-time Senior Java Software Engineer at their U.S. headquarters in the Greater Kansas City area. In this role, you will provide technical ...

Sr.Java Software Engineer

Durham, NC · On-site

$120K - $159K/yr

They are seeking a Senior Java Software Engineer to design and implement backend services and APIs, taking ownership of features and mentoring junior engineers. Responsibilities : • Design and ...

As a Senior Java Software Engineer you will play a critical role in designing, developing, and maintaining high-quality software applications. You will work with cross-functional teams to deliver ...

next page

Showing results 1-20

Senior Software Engineer - AI Interaction Evaluator (Codex / Claude Code, up to $200/hr)

Senior Software Engineer - AI Interaction Evaluator (Codex / Claude Code, up to $200/hr)

G2i Inc.

Columbus, OH

$200/hr

Contractor

Posted 16 days ago


Job description

Senior AI Interaction Evaluator (Codex / Claude Code)

Contract | $50-200/hr | 10+ hrs/week | Project-based

Roles open on a rolling basis - apply to join the talent bench and we'll reach out when one matches. Expect 40+ hrs once a project starts; timing depends on availability, but we move people in at the earliest genuine opportunity.

These roles are currently filled but we hire on a rolling basis as new projects open up. Apply now to join our talent bench - qualified candidates will be contacted directly when roles become available.

Check out this Loom video for more details!

We're looking for highly experienced software engineer (SR+) to help evaluate the quality of interactions with modern coding agents such as OpenAI Codex and Claude Code.

This is not a traditional engineering role.

You won't be writing production code.
You'll be evaluating something harder: whether the model thinks like a great engineer.

What This Role Actually Is

You will assess how AI coding agents behave in real-world scenarios - focusing on:

  • Whether the response makes sense

  • Whether the preamble and reasoning are useful

  • Whether the output reflects strong engineering judgment

  • Whether the interaction feels right to an experienced developer

This role is about engineering taste - not syntax correctness.

What You'll Be Doing
  • Evaluate AI-generated coding interactions end-to-end

  • Judge whether outputs are:

    • Useful

    • Correct (at a high level)

    • Aligned with how a strong engineer would think

  • Assess the quality of explanations and reasoning, not just code

  • Distinguish between different levels of response quality (e.g. what makes something a 2 vs 4)

  • Provide clear, opinionated feedback on:

    • What worked

    • What didn't

    • What felt "off" or misleading

  • Help define what great looks like when interacting with tools like Cursor

What We Mean by "Taste"

We're specifically looking for engineers who can answer questions like:

  • Does this feel like something a strong engineer would actually say?

  • Is this explanation helpful, or just technically correct?

  • Is the model guiding the user well, or just dumping output?

  • Would this interaction build or erode trust?

You should be comfortable making subjective but rigorous judgments.

Who You Are
  • Staff / Principal-level engineer (or equivalent experience)

  • Strong background in one of the below:

    • TypeScript / JavaScript

    • Python

  • Hands-on experience using:

    • OpenAI Codex

    • Claude Code

    • Cursor

  • Deep familiarity with modern AI-assisted dev workflows

  • Able to evaluate code without needing to fully execute or deeply review every line

  • Comfortable giving direct, opinionated feedback

  • High bar for what "good engineering" looks like

Nice to Have
  • Experience with tools like Cursor or similar AI-first IDEs

  • Prior exposure to prompt design or evaluation workflows

  • Experience mentoring senior engineers or defining engineering standards

Engagement Details
  • US and Canada up to $200/hr

  • EU and Latam up to $150/hr

  • Other locations up to $100/hr

  • Hours: ~10-20 hours/week

  • Duration: Ongoing - project-based

  • Process:

    • Take-home evaluation exercise

    • One behavioral interview


G2i logo

About G2i

Sourced by ZipRecruiter

Industry

Software development

Company size

11 - 50 Employees

Headquarters location

Delray Beach, FL, US

Year founded

2012