Design and implement advanced reinforcement learning algorithms (e.g., multi-agent RL, model-based RL, deep RL) for real-time control of data center infrastructure. * Simulation and Training: Build ...
Design and implement advanced reinforcement learning algorithms (e.g., multi-agent RL, model-based RL, deep RL) for real-time control of data center infrastructure. * Simulation and Training: Build ...
Research Scientist - Reinforcement Learning
$150K - $450K/yr
... deep learning, driving impactful discoveries that inspire the next generation of AI pioneers ... Position Summary As a Research Scientist within our Reinforcement Learning team, you will play a ...
Research Scientist - Reinforcement Learning
$150K - $450K/yr
... deep learning, driving impactful discoveries that inspire the next generation of AI pioneers ... Position Summary As a Research Scientist within our Reinforcement Learning team, you will play a ...
Research Scientist - Reinforcement Learning
Sunnyvale, CA · On-site
$150K - $450K/yr
... deep learning, driving impactful discoveries that inspire the next generation of AIpioneers ... Position Summary As a Research Scientist within our Reinforcement Learning team, you will play a ...
Research Scientist - Reinforcement Learning
Sunnyvale, CA · On-site
$150K - $450K/yr
... deep learning, driving impactful discoveries that inspire the next generation of AIpioneers ... Position Summary As a Research Scientist within our Reinforcement Learning team, you will play a ...
Your role on our team Pioneer the application of reinforcement learning (RL) and sequential ... Deep understanding of RL fundamentals, including MDPs, dynamic programming, temporal-difference ...
Your role on our team Pioneer the application of reinforcement learning (RL) and sequential ... Deep understanding of RL fundamentals, including MDPs, dynamic programming, temporal-difference ...
Your role on our team Pioneer the application of reinforcement learning (RL) and sequential ... Deep understanding of RL fundamentals, including MDPs, dynamic programming, temporal-difference ...
Your role on our team Pioneer the application of reinforcement learning (RL) and sequential ... Deep understanding of RL fundamentals, including MDPs, dynamic programming, temporal-difference ...
Senior Reinforcement Learning Engineer
Austin, TX · On-site
$103K - $142K/yr
JOB SUMMARY The Senior Reinforcement Learning Engineer is a key, hands-on role focused on achieving ... This engineer will leverage their deep expertise in RL to solve critical locomotion and ...
Senior Reinforcement Learning Engineer
Austin, TX · On-site
$103K - $142K/yr
JOB SUMMARY The Senior Reinforcement Learning Engineer is a key, hands-on role focused on achieving ... This engineer will leverage their deep expertise in RL to solve critical locomotion and ...
Machine Learning Scientist, Reinforcement Learning
Emeryville, CA · On-site +1
$200K - $330K/yr
Founded in 2022, we develop deep generative models to design and validate novel, functional ... Design and develop state-of-the-art online and offline reinforcement learning algorithms for ...
Machine Learning Scientist, Reinforcement Learning
Emeryville, CA · On-site +1
$200K - $330K/yr
Founded in 2022, we develop deep generative models to design and validate novel, functional ... Design and develop state-of-the-art online and offline reinforcement learning algorithms for ...
Our Reinforcement Learning teams sit at the intersection of cutting-edge research and engineering excellence, with a deep commitment to building high-quality, scalable systems that push the ...
Our Reinforcement Learning teams sit at the intersection of cutting-edge research and engineering excellence, with a deep commitment to building high-quality, scalable systems that push the ...
Your role on our team Pioneer the application of reinforcement learning (RL) and sequential ... Deep understanding of RL fundamentals, including MDPs, dynamic programming, temporal-difference ...
Your role on our team Pioneer the application of reinforcement learning (RL) and sequential ... Deep understanding of RL fundamentals, including MDPs, dynamic programming, temporal-difference ...
Reinforcement Learning Engineer
$87K - $118K/yr
Reinforcement Learning (RL) Engineer Location: New York (Office) On-site | Full-time Compensation ... A deep-dive assessment into RL architecture, simulation frameworks, and live production experience.
Reinforcement Learning Engineer
$87K - $118K/yr
Reinforcement Learning (RL) Engineer Location: New York (Office) On-site | Full-time Compensation ... A deep-dive assessment into RL architecture, simulation frameworks, and live production experience.
Reinforcement Learning Engineer
New York, NY · On-site
$87K - $118K/yr
Reinforcement Learning (RL) Engineer Location: New York (Office) On-site | Full-time Compensation ... A deep-dive assessment into RL architecture, simulation frameworks, and live production experience.
Quick apply
Reinforcement Learning Engineer
New York, NY · On-site
$87K - $118K/yr
Reinforcement Learning (RL) Engineer Location: New York (Office) On-site | Full-time Compensation ... A deep-dive assessment into RL architecture, simulation frameworks, and live production experience.
Staff, ML Research Scientist
Waltham, MA · On-site
$154K - $192K/yr
Experience in the full modeling cycle from research to deployment of modern Deep Learning architectures such as Transformers, VLMs/VLAs, and Deep Reinforcement Learning. * Knowledge of ...
Staff, ML Research Scientist
Waltham, MA · On-site
$154K - $192K/yr
Experience in the full modeling cycle from research to deployment of modern Deep Learning architectures such as Transformers, VLMs/VLAs, and Deep Reinforcement Learning. * Knowledge of ...
Helix AI Engineer, Reinforcement Learning
San Jose, CA · On-site +1
Proficiency in Python and deep learning frameworks such as PyTorch * Experience with large-scale ... Publication record in reinforcement learning, machine learning, or robotics The pay offered for ...
Helix AI Engineer, Reinforcement Learning
San Jose, CA · On-site +1
Proficiency in Python and deep learning frameworks such as PyTorch * Experience with large-scale ... Publication record in reinforcement learning, machine learning, or robotics The pay offered for ...
Member of Engineering (Reinforcement Learning)
$99K - $136K/yr
What unites us is our deep care for what we build together. We're in a race that requires hard work ... ABOUT THE ROLE You would be working on our reinforcement learning team focused on improving ...
Member of Engineering (Reinforcement Learning)
$99K - $136K/yr
What unites us is our deep care for what we build together. We're in a race that requires hard work ... ABOUT THE ROLE You would be working on our reinforcement learning team focused on improving ...
Staff, ML Research Scientist
Waltham, MA · On-site
$154K - $192K/yr
Experience in the full modeling cycle from research to deployment of modern Deep Learning architectures such as Transformers, VLMs/VLAs, and Deep Reinforcement Learning. * Knowledge of ...
Staff, ML Research Scientist
Waltham, MA · On-site
$154K - $192K/yr
Experience in the full modeling cycle from research to deployment of modern Deep Learning architectures such as Transformers, VLMs/VLAs, and Deep Reinforcement Learning. * Knowledge of ...
Research Engineer, Machine Learning (Reinforcement Learning)
San Francisco, CA · On-site
$500K - $850K/yr
Our Reinforcement Learning teams sit at the intersection of cutting-edge research and engineering excellence, with a deep commitment to building high-quality, scalable systems that push the ...
Research Engineer, Machine Learning (Reinforcement Learning)
San Francisco, CA · On-site
$500K - $850K/yr
Our Reinforcement Learning teams sit at the intersection of cutting-edge research and engineering excellence, with a deep commitment to building high-quality, scalable systems that push the ...
Proficiency in Python and deep learning frameworks such as PyTorch * Experience with large-scale ... Publication record in reinforcement learning, machine learning, or robotics The pay offered for ...
Proficiency in Python and deep learning frameworks such as PyTorch * Experience with large-scale ... Publication record in reinforcement learning, machine learning, or robotics The pay offered for ...
Graduate Student Instructor - SIADS 644 Spring Summer 2026
Ann Arbor, MI · On-site
$15 - $18.75/hr
Familiar with implementation of deep reinforcement learning algorithms, including DQN, DDQN, DDPG, SAC. Familiar with the OpenAI.Gym environment. Desired Qualifications* * Experience or interest in ...
Graduate Student Instructor - SIADS 644 Spring Summer 2026
Ann Arbor, MI · On-site
$15 - $18.75/hr
Familiar with implementation of deep reinforcement learning algorithms, including DQN, DDQN, DDPG, SAC. Familiar with the OpenAI.Gym environment. Desired Qualifications* * Experience or interest in ...
Reinforcement Learning Engineer
New York, NY · On-site
$87K - $118K/yr
Reinforcement Learning (RL) Engineer Location: New York (Office) On-site Full-time Compensation ... A deep-dive assessment into RL architecture, simulation frameworks, and live production experience.
Reinforcement Learning Engineer
New York, NY · On-site
$87K - $118K/yr
Reinforcement Learning (RL) Engineer Location: New York (Office) On-site Full-time Compensation ... A deep-dive assessment into RL architecture, simulation frameworks, and live production experience.
Our Reinforcement Learning teams sit at the intersection of cutting-edge research and engineering excellence, with a deep commitment to building high-quality, scalable systems that push the ...
Our Reinforcement Learning teams sit at the intersection of cutting-edge research and engineering excellence, with a deep commitment to building high-quality, scalable systems that push the ...
Deep Reinforcement Learning information
See salary details
$28.5K - $33.2K
2% of jobs
$33.2K - $37.9K
3% of jobs
$37.9K - $42.5K
6% of jobs
$42.5K - $47.2K
3% of jobs
$51.4K is the 25th percentile. Wages below this are outliers.
$47.2K - $51.9K
12% of jobs
The median wage is $56.3K / yr.
$51.9K - $56.6K
25% of jobs
$56.6K - $61.3K
12% of jobs
$61.3K - $66K
9% of jobs
$66.9K is the 75th percentile. Wages above this are outliers.
$66K - $70.6K
12% of jobs
$70.6K - $75.3K
11% of jobs
$75.3K - $80K
5% of jobs
$28.5K
$58.3K
$80K
How much do deep reinforcement learning jobs pay per year?
What does a typical day look like for someone working in Deep Reinforcement Learning?
A typical day for a Deep Reinforcement Learning professional involves designing algorithms, running experiments, analyzing results, and optimizing models to improve performance. You may collaborate regularly with data scientists, software engineers, and domain experts to integrate RL solutions into larger systems or products. Tasks often include reading the latest research, contributing to code reviews, and documenting findings while troubleshooting technical challenges. This dynamic environment encourages continuous learning and teamwork, ensuring you stay at the forefront of AI innovation.
What is a Deep Reinforcement Learning job?
A Deep Reinforcement Learning (DRL) job involves researching, developing, and applying AI models that use reinforcement learning techniques combined with deep learning. Professionals in this role design algorithms that enable agents to learn optimal decision-making policies through trial and error. Common applications include robotics, game AI, autonomous systems, and financial modeling. This job typically requires expertise in machine learning, neural networks, and programming languages like Python, along with frameworks such as TensorFlow or PyTorch.
What are the key skills and qualifications needed to thrive in the Deep Reinforcement Learning position, and why are they important?
To thrive in Deep Reinforcement Learning, you need expertise in machine learning, programming (Python, TensorFlow, or PyTorch), and applied mathematics, often supported by an advanced degree in computer science or a related field. Familiarity with version control systems, cloud computing platforms, and relevant certifications in AI or data science are valuable assets. Strong problem-solving abilities, collaboration, and effective communication are important soft skills in this position. These skills are essential for developing, implementing, and iterating cutting-edge algorithms that solve complex real-world problems in dynamic environments.
- Junior Machine Learning
- Graduate Machine Learning
- Machine Learning Intern Remote
- Artificial Intelligence Machine Learning Physics
- Learning Specialist Athletics
- Freelance Artificial Intelligence Machine Learning
- Math Learning Specialist
- Machine Learning Internship Microsoft
- Remote Machine Learning
- Physics Informed Machine Learning

Other
Medical, Dental, Vision, Retirement
Posted 2 days ago
Job description
We're unleashing AI with intelligent orchestration while addressing one of the most pressing bottlenecks for AI access to Power. Our cutting-edge platform optimizes data center power infrastructure to maximize AI token generation within existing electrical limits, without requiring new power plants or grid expansions. Our team has optimized over 8 gigawatts of mission-critical power globally, and we're addressing a $64 billion-per-year market opportunity while dramatically reducing the environmental footprint of AI infrastructure.
At Hammerhead, you will:
Work at the intersection of AI, energy, and compute creating the next generation AI infrastructure
Collaborate with colleagues that are experts in modern RL and AI, IoT and IIoT software, and infrastructure technologies
Contribute to building a more efficient and sustainable future for AI compute.
Join a company at the cutting edge of modern data center design and operation
Receive competitive compensation, equity, and benefits in a high-growth, mission-driven environment.
Learn from an experienced team that has built and sold startups before
Learn more about Hammerhead
- These AutoGrid alums want to change how data centers use power
- How Hammerhead Wants to Rewrite the Economics of AI
- News & Blogs
As a Reinforcement Learning Engineer, you will be the architect of the core intelligence for Hammerhead's ORCA platform. Reporting to the Head of AI / Reinforcement Learning Engineering, you will design, train, and deploy the Orchestrated RL Control Agents that form the brain of our system, making real-time decisions to optimize power and compute resources across physical data centers. This role is for a hands-on expert who is passionate about applying cutting-edge RL research to complex, real-world industrial systems. You will be instrumental in developing the models that control physical assets like cooling systems and power distribution units to unlock massive efficiency gains in AI workloads.
Key Responsibilities
- RL Model Development: Design and implement advanced reinforcement learning algorithms (e.g., multi-agent RL, model-based RL, deep RL) for real-time control of data center infrastructure.
- Simulation and Training: Build and train RL agents that can generalize to real-world, physical systems.
- From Lab to Production: Lead the transition of RL models from research and simulation to live deployment within the ORCA platform, ensuring stability and performance on mission-critical hardware.
- System Optimization: Analyze agent performance to continuously improve control strategies for tasks like peak shaving, workload shifting, and thermal management.
- Cross-Functional Collaboration: Partner with platform engineers to define the APIs, data telemetry, and infrastructure needed to support and scale our RL agents across a global portfolio of data centers.
- RL Expertise: Proven experience developing and implementing reinforcement learning algorithms, demonstrated through publications in top conferences (e.g., NeurIPS, ICML, ICLR), open-source contributions, or shipped products.
- Industry Experience: 3+ years of experience applying RL to real-world problems, preferably in industrial automation, robotics, autonomous vehicles, energy systems, or other physical systems. Experience from a leading industrial or academic RL lab is highly desirable.
- Technical Skills: Deep proficiency in Python and modern ML frameworks such as PyTorch, Jax, or TensorFlow. Experience with simulation platforms and RL libraries (e.g., Ray RLlib, Isaac Gym) is a plus.
- Educational Background: MS or PhD in Computer Science, Robotics, Operations Research, or a related field with a focus on machine learning or control theory.
- Problem Solver: You possess a strong theoretical background but are driven by practical application, with an ability to bridge the gap between RL theory and the constraints of physical, real-world systems.
- Competitive salary, bonus, 401(k) plan and equity in a rapidly growing startup
- Comprehensive health, dental, and vision coverage
- Opportunity to apply the latest AI technologies working with an experienced team
Join our team to shape the foundation of tomorrow's AI infrastructure
Visit our Careers page at (hammerheadco dot ai / careers) to apply