Data Engineer
$110K - $132K/yr
We curated OpenThoughts, one of the best open reasoning datasets, and have trained SOTA models such as Bespoke-MiniCheck and Bespoke-MiniChart. We are embarked on a journey to build Environments that ...
62 jobs near Columbus, OH
$110K - $132K/yr
We curated OpenThoughts, one of the best open reasoning datasets, and have trained SOTA models such as Bespoke-MiniCheck and Bespoke-MiniChart. We are embarked on a journey to build Environments that ...
$110K - $132K/yr
We curated OpenThoughts, one of the best open reasoning datasets, and have trained SOTA models such as Bespoke-MiniCheck and Bespoke-MiniChart. We are embarked on a journey to build Environments that ...
We curated OpenThoughts, one of the best open reasoning datasets, and have trained SOTA models such as Bespoke-MiniCheck and Bespoke-MiniChart. We are embarked on a journey to build Environments that ...
We curated OpenThoughts, one of the best open reasoning datasets, and have trained SOTA models such as Bespoke-MiniCheck and Bespoke-MiniChart. We are embarked on a journey to build Environments that ...
Mountain View, CA · On-site
$126K - $156K/yr
About Bespoke Labs Bespoke Labs is an applied AI research lab pioneering data and RL environment curation for training and evaluating agents. Recently, we curated Open Thoughts, one of the best open ...
Mountain View, CA · On-site
$126K - $156K/yr
About Bespoke Labs Bespoke Labs is an applied AI research lab pioneering data and RL environment curation for training and evaluating agents. Recently, we curated Open Thoughts, one of the best open ...
About Bespoke Labs Bespoke Labs is an applied AI research lab pioneering data and RL environment curation for training and evaluating agents. Recently, we curated Open Thoughts, one of the best open ...
About Bespoke Labs Bespoke Labs is an applied AI research lab pioneering data and RL environment curation for training and evaluating agents. Recently, we curated Open Thoughts, one of the best open ...
About Bespoke Labs Bespoke Labs AI builds large-scale AI environments and production systems that directly shape how next-generation models are trained. Our work has powered foundational datasets ...
About Bespoke Labs Bespoke Labs AI builds large-scale AI environments and production systems that directly shape how next-generation models are trained. Our work has powered foundational datasets ...
Mountain View, CA · On-site
Full Stack Developer at Bespoke Labs Bespoke Labs AI builds large-scale AI environments and production systems that directly shape how next-generation models are trained. Our work has powered ...
Mountain View, CA · On-site
Full Stack Developer at Bespoke Labs Bespoke Labs AI builds large-scale AI environments and production systems that directly shape how next-generation models are trained. Our work has powered ...
Mountain View, CA · On-site
About Bespoke Labs Bespoke Labs is an applied AI research lab pioneering data and RL environment curation for training and evaluating agents. Recently, we curated Open Thoughts, one of the best open ...
Mountain View, CA · On-site
About Bespoke Labs Bespoke Labs is an applied AI research lab pioneering data and RL environment curation for training and evaluating agents. Recently, we curated Open Thoughts, one of the best open ...
Mountain View, CA · On-site
About Bespoke Labs Bespoke Labs AI builds large-scale AI environments and production systems that directly shape how next-generation models are trained. Our work has powered foundational datasets ...
Mountain View, CA · On-site
About Bespoke Labs Bespoke Labs AI builds large-scale AI environments and production systems that directly shape how next-generation models are trained. Our work has powered foundational datasets ...
About Bespoke Labs Bespoke Labs is an applied AI research lab pioneering data and RL environment curation for training and evaluating agents. Recently, we curated Open Thoughts, one of the best open ...
About Bespoke Labs Bespoke Labs is an applied AI research lab pioneering data and RL environment curation for training and evaluating agents. Recently, we curated Open Thoughts, one of the best open ...
About Bespoke Labs Bespoke Labs is an applied AI research lab pioneering data and RL environment curation for training and evaluating agents. Recently, we curated Open Thoughts, one of the best open ...
About Bespoke Labs Bespoke Labs is an applied AI research lab pioneering data and RL environment curation for training and evaluating agents. Recently, we curated Open Thoughts, one of the best open ...
About Bespoke Labs Bespoke Labs is an applied AI research lab pioneering data and RL environment curation for training and evaluating agents. Recently, we curated Open Thoughts, one of the best open ...
About Bespoke Labs Bespoke Labs is an applied AI research lab pioneering data and RL environment curation for training and evaluating agents. Recently, we curated Open Thoughts, one of the best open ...
About Bespoke Labs Bespoke Labs is an applied AI research lab pioneering data and RL environment curation for training and evaluating agents. Recently, we curated Open Thoughts, one of the best open ...
About Bespoke Labs Bespoke Labs is an applied AI research lab pioneering data and RL environment curation for training and evaluating agents. Recently, we curated Open Thoughts, one of the best open ...
Mountain View, CA · On-site +1
About Bespoke Labs Bespoke Labs is an applied AI research lab pioneering data and RL environment curation for training and evaluating agents. Recently, we curated Open Thoughts, one of the best open ...
Mountain View, CA · On-site +1
About Bespoke Labs Bespoke Labs is an applied AI research lab pioneering data and RL environment curation for training and evaluating agents. Recently, we curated Open Thoughts, one of the best open ...
Bespoke Labs is building the modern post-training stack that can be deployed by enterprises to build custom models and agents that work well on their data. Our vision is to help enterprises exceed ...
Bespoke Labs is building the modern post-training stack that can be deployed by enterprises to build custom models and agents that work well on their data. Our vision is to help enterprises exceed ...
About Bespoke Labs Bespoke Labs is an applied AI research lab pioneering data and RL environment curation for training and evaluating agents. Recently, we curated Open Thoughts, one of the best open ...
About Bespoke Labs Bespoke Labs is an applied AI research lab pioneering data and RL environment curation for training and evaluating agents. Recently, we curated Open Thoughts, one of the best open ...
Mountain View, CA · On-site
About Bespoke Labs Bespoke Labs is an applied AI research lab pioneering data and RL environment curation for training and evaluating agents. Recently, we curated Open Thoughts, one of the best open ...
Mountain View, CA · On-site
About Bespoke Labs Bespoke Labs is an applied AI research lab pioneering data and RL environment curation for training and evaluating agents. Recently, we curated Open Thoughts, one of the best open ...
$113K - $135K/yr
We curated OpenThoughts, one of the best open reasoning datasets, and have trained SOTA models such as Bespoke-MiniCheck and Bespoke-MiniChart. We are embarked on a journey to build Environments that ...
$113K - $135K/yr
We curated OpenThoughts, one of the best open reasoning datasets, and have trained SOTA models such as Bespoke-MiniCheck and Bespoke-MiniChart. We are embarked on a journey to build Environments that ...
$123K - $148K/yr
We curated OpenThoughts, one of the best open reasoning datasets, and have trained SOTA models such as Bespoke-MiniCheck and Bespoke-MiniChart. We are embarked on a journey to build Environments that ...
$123K - $148K/yr
We curated OpenThoughts, one of the best open reasoning datasets, and have trained SOTA models such as Bespoke-MiniCheck and Bespoke-MiniChart. We are embarked on a journey to build Environments that ...
$105K - $127K/yr
We curated OpenThoughts, one of the best open reasoning datasets, and have trained SOTA models such as Bespoke-MiniCheck and Bespoke-MiniChart. We are embarked on a journey to build Environments that ...
$105K - $127K/yr
We curated OpenThoughts, one of the best open reasoning datasets, and have trained SOTA models such as Bespoke-MiniCheck and Bespoke-MiniChart. We are embarked on a journey to build Environments that ...
$103K - $123K/yr
We curated OpenThoughts, one of the best open reasoning datasets, and have trained SOTA models such as Bespoke-MiniCheck and Bespoke-MiniChart. We are embarked on a journey to build Environments that ...
$103K - $123K/yr
We curated OpenThoughts, one of the best open reasoning datasets, and have trained SOTA models such as Bespoke-MiniCheck and Bespoke-MiniChart. We are embarked on a journey to build Environments that ...
$110K - $132K/yr
Full-time
Posted 4 days ago
About Us
We are AI researchers and builders who understand how to curate data and RL environments that truly improve models. We curated OpenThoughts, one of the best open reasoning datasets, and have trained SOTA models such as Bespoke-MiniCheck and Bespoke-MiniChart.
We are embarked on a journey to build Environments that are entire digital worlds that can be used to push the frontier of agents.
What You'll Be Working On
You will work directly with our research team on RL environment and task creation for agent training. This means designing observation spaces, action spaces, reward signals, and success criteria for new environments — and building the infrastructure that makes world-scale RL training possible. This is a high-ownership role; you will be building novel systems, not maintaining legacy ones.
Must-Have Skills
3+ years of data engineering experience — pipelines, ETL, data modeling in production or research settings
Strong Python proficiency (numpy, pandas, Parquet, HDF5 are daily tools)
Familiarity with at least one RL framework (Gymnasium / OpenAI Gym, dm_env, or equivalent) and working knowledge of RL environment structure — observation/action spaces, reward signals, episode logic
Experience with data versioning and experiment tracking (DVC, MLflow, W&B, or similar)
Comfortable with Docker and cloud infrastructure (AWS or GCP)
Solid grasp of ML storage formats: Parquet, HDF5, JSON Lines