Senior ML Scientist (Pricing Reinforcement Learning) Plano TX- Remote Key Responsibilities Algorithm Development Conceptualize design and implement state-of-the-art ML models for dynamic pricing and ...
Senior ML Scientist (Pricing Reinforcement Learning) Plano TX- Remote Key Responsibilities Algorithm Development Conceptualize design and implement state-of-the-art ML models for dynamic pricing and ...
Member of Engineering (Reinforcement Learning)
$99K - $136K/yr
ABOUT THE ROLE You would be working on our reinforcement learning team focused on improving ... Fully remote work & flexible hours * 37 days/year of vacation & holidays * Health insurance ...
Member of Engineering (Reinforcement Learning)
$99K - $136K/yr
ABOUT THE ROLE You would be working on our reinforcement learning team focused on improving ... Fully remote work & flexible hours * 37 days/year of vacation & holidays * Health insurance ...
Reinforcement Learning Expertise - Develop and apply RL techniques including Contextual Bandits Qlearning SARSA and concepts like Thompson Sampling and Bayesian Optimization to solve pricing and ...
Reinforcement Learning Expertise - Develop and apply RL techniques including Contextual Bandits Qlearning SARSA and concepts like Thompson Sampling and Bayesian Optimization to solve pricing and ...
Palo Alto, CA or Seattle, WA (Hybrid/Remote) About the Team Centific AI Research advances foundational AI models and applications through reinforcement learning, alignment, and human-centered ...
Palo Alto, CA or Seattle, WA (Hybrid/Remote) About the Team Centific AI Research advances foundational AI models and applications through reinforcement learning, alignment, and human-centered ...
Member of Engineering (Reinforcement Learning Infrastructure)
$110K - $144K/yr
ABOUT THE ROLE You would be working on our reinforcement learning team focused on improving ... Fully remote work & flexible hours * 37 days/year of vacation & holidays * Health insurance ...
Member of Engineering (Reinforcement Learning Infrastructure)
$110K - $144K/yr
ABOUT THE ROLE You would be working on our reinforcement learning team focused on improving ... Fully remote work & flexible hours * 37 days/year of vacation & holidays * Health insurance ...
Senior Machine Learning Engineer - Learned Planning/Reinforcement Learning
Ann Arbor, MI · On-site +1
$102K - $140K/yr
Meet the Team As a Senior Machine Learning Engineer - Learned Planner / Reinforcement Learning, you ... We are also open to hiring Remote in the United States Perks of Being a Full-time Torc'r Torc cares ...
Senior Machine Learning Engineer - Learned Planning/Reinforcement Learning
Ann Arbor, MI · On-site +1
$102K - $140K/yr
Meet the Team As a Senior Machine Learning Engineer - Learned Planner / Reinforcement Learning, you ... We are also open to hiring Remote in the United States Perks of Being a Full-time Torc'r Torc cares ...
Staff Scientist - Post-Training and Reinforcement Learning for AI for Science
Lemont, IL · On-site +1
This position qualifies as "Hybrid Remote Work - Mostly Onsite": which applies to employees ... Experience with reinforcement learning, policy optimization, bandits, preference learning, or ...
Staff Scientist - Post-Training and Reinforcement Learning for AI for Science
Lemont, IL · On-site +1
This position qualifies as "Hybrid Remote Work - Mostly Onsite": which applies to employees ... Experience with reinforcement learning, policy optimization, bandits, preference learning, or ...
Research Engineer - Reinforcement Learning
San Francisco, CA · On-site +1
$180K - $290K/yr
Research Engineer - Reinforcement Learning You'll bring reinforcement learning to Firecrawl's core ... N/A for Remote About Firecrawl Firecrawl is the easiest way to extract data from the web.
Research Engineer - Reinforcement Learning
San Francisco, CA · On-site +1
$180K - $290K/yr
Research Engineer - Reinforcement Learning You'll bring reinforcement learning to Firecrawl's core ... N/A for Remote About Firecrawl Firecrawl is the easiest way to extract data from the web.
Develop and train reinforcement learning models for real-world applications, focusing on efficiency ... Remote work location. * Competitive salary. * Flexible work schedule. * Opportunities for ...
Develop and train reinforcement learning models for real-world applications, focusing on efficiency ... Remote work location. * Competitive salary. * Flexible work schedule. * Opportunities for ...
Develop and train reinforcement learning models for real-world applications, focusing on efficiency ... Remote work location. * Competitive salary. * Flexible work schedule. * Opportunities for ...
Develop and train reinforcement learning models for real-world applications, focusing on efficiency ... Remote work location. * Competitive salary. * Flexible work schedule. * Opportunities for ...
Machine Learning Intern/Co-op (Fall 2026)
Boston, NY · On-site +1
Where You Fit Our Intern/Co-op program is designed to give you a real-world view of what it's like ... remote candidates (US based only). What You'll Do * Contribute to the design, development, and ...
Machine Learning Intern/Co-op (Fall 2026)
Boston, NY · On-site +1
Where You Fit Our Intern/Co-op program is designed to give you a real-world view of what it's like ... remote candidates (US based only). What You'll Do * Contribute to the design, development, and ...
Research Lead / Principal Scientist & Manager Post-Training - Alignment - Reinforcement Learning ...
San Francisco, CA · On-site +1
... Reinforcement Learning Autodesk AI Lab ... London • San Francisco • Toronto • Remote (US/CA/EU) The Opportunity Foundation models are ...
Research Lead / Principal Scientist & Manager Post-Training - Alignment - Reinforcement Learning ...
San Francisco, CA · On-site +1
... Reinforcement Learning Autodesk AI Lab ... London • San Francisco • Toronto • Remote (US/CA/EU) The Opportunity Foundation models are ...
Coursework or research experience in advanced NLP, advanced ML systems, or reinforcement learning ... Los Gatos, CA headquarters or remote; flexible depending on team * This program is intended for ...
Coursework or research experience in advanced NLP, advanced ML systems, or reinforcement learning ... Los Gatos, CA headquarters or remote; flexible depending on team * This program is intended for ...
Applied Research Intern, Proactive Intelligence & Customer World Models (PhD / Graduate Co-op)
Bodega Bay, CA · Remote
Remote (US / Canada) Duration: Fall/Winter 2026 co-op - 8 months, flexible start September 2026 ... You'll work at the intersection of representation learning, foundation models, reinforcement ...
Applied Research Intern, Proactive Intelligence & Customer World Models (PhD / Graduate Co-op)
Bodega Bay, CA · Remote
Remote (US / Canada) Duration: Fall/Winter 2026 co-op - 8 months, flexible start September 2026 ... You'll work at the intersection of representation learning, foundation models, reinforcement ...
... remote Who We Need We are seeking a highly motivated and talented Machine Learning PhD Intern to ... Familiarity with parameter-efficient tuning techniques, Reinforcement Learning from Human Feedback ...
... remote Who We Need We are seeking a highly motivated and talented Machine Learning PhD Intern to ... Familiarity with parameter-efficient tuning techniques, Reinforcement Learning from Human Feedback ...
Senior Machine Learning Engineer, Data Mining
Pittsburgh, PA · On-site +1
$118K - $156K/yr
Reinforcement Learning for Data Discover : Build RL-based policy learning and reasoning systems for ... remote.
Senior Machine Learning Engineer, Data Mining
Pittsburgh, PA · On-site +1
$118K - $156K/yr
Reinforcement Learning for Data Discover : Build RL-based policy learning and reasoning systems for ... remote.
Senior Machine Learning Engineer, Data Mining
Boston, MA · On-site +1
$133K - $175K/yr
Reinforcement Learning for Data Discover : Build RL-based policy learning and reasoning systems for ... remote.
Senior Machine Learning Engineer, Data Mining
Boston, MA · On-site +1
$133K - $175K/yr
Reinforcement Learning for Data Discover : Build RL-based policy learning and reasoning systems for ... remote.
Senior Machine Learning Engineer, Data Mining
Pittsburgh, PA · On-site +1
$118K - $156K/yr
Reinforcement Learning for Data Discover : Build RL-based policy learning and reasoning systems for ... be fully remote. The salary range for this role is an estimate based on a wide range of ...
Quick apply
Senior Machine Learning Engineer, Data Mining
Pittsburgh, PA · On-site +1
$118K - $156K/yr
Reinforcement Learning for Data Discover : Build RL-based policy learning and reasoning systems for ... be fully remote. The salary range for this role is an estimate based on a wide range of ...
Senior Machine Learning Engineer, Data Mining
San Francisco, CA · On-site +1
$144K - $190K/yr
Reinforcement Learning for Data Discover : Build RL-based policy learning and reasoning systems for ... be fully remote. The salary range for this role is an estimate based on a wide range of ...
Quick apply
Senior Machine Learning Engineer, Data Mining
San Francisco, CA · On-site +1
$144K - $190K/yr
Reinforcement Learning for Data Discover : Build RL-based policy learning and reasoning systems for ... be fully remote. The salary range for this role is an estimate based on a wide range of ...
Senior Machine Learning Engineer, Data Mining
Las Vegas, NV · On-site +1
$117K - $154K/yr
Reinforcement Learning for Data Discover : Build RL-based policy learning and reasoning systems for ... be fully remote. The salary range for this role is an estimate based on a wide range of ...
Quick apply
Senior Machine Learning Engineer, Data Mining
Las Vegas, NV · On-site +1
$117K - $154K/yr
Reinforcement Learning for Data Discover : Build RL-based policy learning and reasoning systems for ... be fully remote. The salary range for this role is an estimate based on a wide range of ...
Remote Reinforcement Learning Intern information
See salary details
$8.89 - $10.29
3% of jobs
$10.29 - $11.69
3% of jobs
$11.69 - $13.09
3% of jobs
$13.09 - $14.49
9% of jobs
$14.94 is the 25th percentile. Wages below this are outliers.
$14.49 - $15.89
21% of jobs
The median wage is $16.47 / hr.
$15.89 - $17.29
26% of jobs
$18.39 is the 75th percentile. Wages above this are outliers.
$17.29 - $18.68
13% of jobs
$18.68 - $20.08
12% of jobs
$20.08 - $21.48
4% of jobs
$21.48 - $22.88
3% of jobs
$22.88 - $24.28
3% of jobs
$8
$17
$24
How much do remote reinforcement learning intern jobs pay per hour?
What does a Remote Reinforcement Learning Intern do?
What are some common challenges faced by remote reinforcement learning interns, and how can they be overcome?
What are the key skills and qualifications needed to thrive as a Remote Reinforcement Learning Intern, and why are they important?
- Remote Architecture Internships
- Remote Algorithm Engineer Intern
- Food Science Intern Remote
- Remote Fashion Writing Internship
- Remote Landscape Architecture Student
- Remote Grant Writing Internship
- Internship Remote Logistics International
- Remote Intern Tax Winter 2026
- Remote Materials Science Intern
- Part Time Remote Prompt Engineer

Full-time
Posted 16 days ago
Job description
Plano TX- Remote
Key Responsibilities
Algorithm Development Conceptualize design and implement state-of-the-art ML models for dynamic pricing and personalized recommendations
Reinforcement Learning Expertise Develop and apply RL techniques including Contextual Bandits Qlearning SARSA and concepts like Thompson Sampling and Bayesian Optimization to solve pricing and optimization challenges
AI Agents for Pricing Build AIdriven pricing agents that incorporate consumer behaviour demand elasticity and competitive insights to optimize revenue and conversion
Rapid ML Prototyping Experience in quickly building testing and iterating on ML prototypes to validate ideas and refine algorithms
Feature Engineering Engineer largescale consumer behavioural feature stores to support ML models ensuring scalability and performance
CrossFunctional Collaboration Work closely with Marketing Product and Sales teams to ensure solutions align with strategic objectives and deliver measurable impact
Controlled Experiments Design analyze and troubleshoot AB and multivariate tests to validate the effectiveness of your models
Qualifications
8 years in machine learning 5 years in reinforcement learning recommendation systems pricing algorithms pattern recognition or artificial intelligence
Expertise in classical ML techniques eg Classification Clustering Regression using algorithms like XGBoost Random Forest SVM and KMeans with handson experience in RL methods such as Contextual Bandits Qlearning SARSA and Bayesian approaches for pricing optimization
Proficiency in handling tabular data including sparsity cardinality analysis standardization and encoding
Proficient in Python and SQL including Window Functions Group By Joins and Partitioning
Experience with ML frameworks and libraries such as scikitlearn TensorFlow and PyTorch
Knowledge of controlled experimentation techniques including causal AB testing and multivariate testing
Required Skills:
5+ Yrs Expereince in Pricing Reinforcement Learning
8+ Yrs Experience in Machine Learning
Expert in Python & Tabular Data
SQL
Knowledge of AB Testing