... deep learning proficiency (PyTorch preferred; familiar with training loops, optimizers, mixed ... HPC (AWS, GCP, SLURM, or Ray) Solid understanding of evaluation methodology -- held-out sets ...
... deep learning proficiency (PyTorch preferred; familiar with training loops, optimizers, mixed ... HPC (AWS, GCP, SLURM, or Ray) Solid understanding of evaluation methodology -- held-out sets ...
... deep learning proficiency (PyTorch preferred; familiar with training loops, optimizers, mixed ... HPC (AWS, GCP, SLURM, or Ray) Solid understanding of evaluation methodology -- held-out sets ...
... deep learning proficiency (PyTorch preferred; familiar with training loops, optimizers, mixed ... HPC (AWS, GCP, SLURM, or Ray) Solid understanding of evaluation methodology -- held-out sets ...
... deep learning proficiency (PyTorch preferred; familiar with training loops, optimizers, mixed ... HPC (AWS, GCP, SLURM, or Ray) Solid understanding of evaluation methodology -- held-out sets ...
... deep learning proficiency (PyTorch preferred; familiar with training loops, optimizers, mixed ... HPC (AWS, GCP, SLURM, or Ray) Solid understanding of evaluation methodology -- held-out sets ...
... deep learning proficiency (PyTorch preferred; familiar with training loops, optimizers, mixed ... HPC (AWS, GCP, SLURM, or Ray) Solid understanding of evaluation methodology -- held-out sets ...
... deep learning proficiency (PyTorch preferred; familiar with training loops, optimizers, mixed ... HPC (AWS, GCP, SLURM, or Ray) Solid understanding of evaluation methodology -- held-out sets ...
... deep learning proficiency (PyTorch preferred; familiar with training loops, optimizers, mixed ... HPC (AWS, GCP, SLURM, or Ray) Solid understanding of evaluation methodology -- held-out sets ...
... deep learning proficiency (PyTorch preferred; familiar with training loops, optimizers, mixed ... HPC (AWS, GCP, SLURM, or Ray) Solid understanding of evaluation methodology -- held-out sets ...
... deep learning proficiency (PyTorch preferred; familiar with training loops, optimizers, mixed ... HPC (AWS, GCP, SLURM, or Ray) Solid understanding of evaluation methodology -- held-out sets ...
... deep learning proficiency (PyTorch preferred; familiar with training loops, optimizers, mixed ... HPC (AWS, GCP, SLURM, or Ray) Solid understanding of evaluation methodology -- held-out sets ...
... deep learning proficiency (PyTorch preferred; familiar with training loops, optimizers, mixed ... HPC (AWS, GCP, SLURM, or Ray) Solid understanding of evaluation methodology -- held-out sets ...
... deep learning proficiency (PyTorch preferred; familiar with training loops, optimizers, mixed ... HPC (AWS, GCP, SLURM, or Ray) Solid understanding of evaluation methodology -- held-out sets ...
We use machine learning and Internet-scale data to elevate customer experience, improve efficiency ... D. students to have an internship in our fast moving team. You will have the opportunity to work on ...
We use machine learning and Internet-scale data to elevate customer experience, improve efficiency ... D. students to have an internship in our fast moving team. You will have the opportunity to work on ...
We use machine learning and Internet-scale data to elevate customer experience, improve efficiency ... D. students to have an internship in our fast moving team. You will have the opportunity to work on ...
We use machine learning and Internet-scale data to elevate customer experience, improve efficiency ... D. students to have an internship in our fast moving team. You will have the opportunity to work on ...
We use machine learning and Internet-scale data to elevate customer experience, improve efficiency ... D. students to have an internship in our fast moving team. You will have the opportunity to work on ...
We use machine learning and Internet-scale data to elevate customer experience, improve efficiency ... D. students to have an internship in our fast moving team. You will have the opportunity to work on ...
We use machine learning and Internet-scale data to elevate customer experience, improve efficiency ... D. students to have an internship in our fast moving team. You will have the opportunity to work on ...
We use machine learning and Internet-scale data to elevate customer experience, improve efficiency ... D. students to have an internship in our fast moving team. You will have the opportunity to work on ...
We use machine learning and Internet-scale data to elevate customer experience, improve efficiency ... D. students to have an internship in our fast moving team. You will have the opportunity to work on ...
We use machine learning and Internet-scale data to elevate customer experience, improve efficiency ... D. students to have an internship in our fast moving team. You will have the opportunity to work on ...
We use machine learning and Internet-scale data to elevate customer experience, improve efficiency ... D. students to have an internship in our fast moving team. You will have the opportunity to work on ...
We use machine learning and Internet-scale data to elevate customer experience, improve efficiency ... D. students to have an internship in our fast moving team. You will have the opportunity to work on ...
We use machine learning and Internet-scale data to elevate customer experience, improve efficiency ... D. students to have an internship in our fast moving team. You will have the opportunity to work on ...
We use machine learning and Internet-scale data to elevate customer experience, improve efficiency ... D. students to have an internship in our fast moving team. You will have the opportunity to work on ...
Machine Learning Engineer New Grad 2024-2025 -Remote
Indianapolis, IN · Remote
$139K - $168K/yr
At Poe, we use Machine Learning in various parts of the product - bot routing, agent flow, code ... Previous software engineering experience via an internship, work experience, or coding competition
Machine Learning Engineer New Grad 2024-2025 -Remote
Indianapolis, IN · Remote
$139K - $168K/yr
At Poe, we use Machine Learning in various parts of the product - bot routing, agent flow, code ... Previous software engineering experience via an internship, work experience, or coding competition
Machine Learning Engineer New Grad 2024-2025 -Remote
Evansville, IN · Remote
$139K - $168K/yr
At Poe, we use Machine Learning in various parts of the product - bot routing, agent flow, code ... Previous software engineering experience via an internship, work experience, or coding competition
Machine Learning Engineer New Grad 2024-2025 -Remote
Evansville, IN · Remote
$139K - $168K/yr
At Poe, we use Machine Learning in various parts of the product - bot routing, agent flow, code ... Previous software engineering experience via an internship, work experience, or coding competition
Machine Learning Engineer New Grad 2024-2025 -Remote
Fort Wayne, IN · Remote
$139K - $168K/yr
At Poe, we use Machine Learning in various parts of the product - bot routing, agent flow, code ... Previous software engineering experience via an internship, work experience, or coding competition
Machine Learning Engineer New Grad 2024-2025 -Remote
Fort Wayne, IN · Remote
$139K - $168K/yr
At Poe, we use Machine Learning in various parts of the product - bot routing, agent flow, code ... Previous software engineering experience via an internship, work experience, or coding competition
Machine Learning Engineer New Grad 2024-2025 -Remote
Hammond, IN · Remote
$139K - $168K/yr
At Poe, we use Machine Learning in various parts of the product - bot routing, agent flow, code ... Previous software engineering experience via an internship, work experience, or coding competition
Machine Learning Engineer New Grad 2024-2025 -Remote
Hammond, IN · Remote
$139K - $168K/yr
At Poe, we use Machine Learning in various parts of the product - bot routing, agent flow, code ... Previous software engineering experience via an internship, work experience, or coding competition
Machine Learning Engineer New Grad 2024-2025 -Remote
Terre Haute, IN · Remote
$139K - $168K/yr
At Poe, we use Machine Learning in various parts of the product - bot routing, agent flow, code ... Previous software engineering experience via an internship, work experience, or coding competition
Machine Learning Engineer New Grad 2024-2025 -Remote
Terre Haute, IN · Remote
$139K - $168K/yr
At Poe, we use Machine Learning in various parts of the product - bot routing, agent flow, code ... Previous software engineering experience via an internship, work experience, or coding competition
Machine Learning Engineer New Grad 2024-2025 -Remote
South Bend, IN · Remote
$139K - $168K/yr
At Poe, we use Machine Learning in various parts of the product - bot routing, agent flow, code ... Previous software engineering experience via an internship, work experience, or coding competition
Machine Learning Engineer New Grad 2024-2025 -Remote
South Bend, IN · Remote
$139K - $168K/yr
At Poe, we use Machine Learning in various parts of the product - bot routing, agent flow, code ... Previous software engineering experience via an internship, work experience, or coding competition
Internship Aws Machine Learning information
- Machine Learning Flexible Hours
- Google Cloud Machine Learning Engineer
- Urgently Hiring Machine Learning Engineer New Grad
- Junior Machine Learning Engineer
- Remote Machine Learning Engineer
- Contract Machine Learning Engineer
- Artificial Intelligence Programmer
- Contract Machine Learning Engineer Biotech
- Machine Learning Engineer
- Generative Ai Writer
- Machine Learning Petroleum Engineer
- Internship Centura Technical Lead Senior Developer
- Internship Machine Learning Quant
- Online Machine Learning
- Machine Learning Engineer Opt
- Machine Learning Engineer Software Engineer
- Machine Learning Manager
- Machine Learning Internship Microsoft
- Contract Google Machine Learning Engineer
- Internship Disney Marvel
Full-time
Posted 3 days ago
Job description
About Us
We are AI researchers and builders who understand how to curate data and RL environments that truly improve models. We curated OpenThoughts, one of the best open reasoning datasets, and have trained SOTA models such as Bespoke-MiniCheck and Bespoke-MiniChart.
We are embarked on a journey to build Environments that are entire digital worlds that can be used to push the frontier of agents.
What You'll Be Working On
You will work directly with our research team on RL environment and task creation for agent training. This means designing observation spaces, action spaces, reward signals, and success criteria for new environments — and building the infrastructure that makes world-scale RL training possible. This is a high-ownership role; you will be building novel systems, not maintaining legacy ones.
Must-Have Skills
3+ years of ML engineering experience — model training, fine-tuning, or post-training pipelines in research or production
Strong Python and deep learning proficiency (PyTorch preferred; familiar with training loops, optimizers, mixed precision)
Hands-on experience with LLM post-training — SFT, RLHF, PPO, DPO, or reward model training — and understanding of how training data quality affects model behavior
Familiarity with RL frameworks (Gymnasium, dm_env) and the ability to design or modify reward functions for agent training objectives
Experience running experiments at scale on cloud or HPC (AWS, GCP, SLURM, or Ray)
Solid understanding of evaluation methodology — held-out sets, benchmark design, avoiding train/eval contamination