Remote Reinforcement Learning Jobs (NOW HIRING)

Reinforcement Learning Engineer (Cybersecurity)

$176K - $242K/yr

As a Reinforcement Learning Engineer you will advance the frontier of AI Reinforcement Learning ... Environment - remote, work-from-home 100% of the time. Pay Range Disclosure At Bugcrowd, we strive ...

Bugcrowd

Reinforcement Learning Engineer (Cybersecurity)

$176K - $242K/yr

TriOptus LLC

ML Scientist (Pricing Reinforcement Learning) | REMOTE |

Bellevue, WA · On-site +1

Reinforcement Learning Expertise - Develop and apply RL techniques including Contextual Bandits Qlearning SARSA and concepts like Thompson Sampling and Bayesian Optimization to solve pricing and ...

TriOptus LLC

ML Scientist (Pricing Reinforcement Learning) | REMOTE |

Bellevue, WA · On-site +1

Centific

Research Intern - Applied Reinforcement Learning

$35 - $45/hr

About Job PhD Research Intern - Applied Reinforcement Learning Centific AI Research Role Summary ... Palo Alto, CA (Preferred), Redmond, WA (Preferred) or Remote Duration: 3-6 months What We Offer ...

Centific

Research Intern - Applied Reinforcement Learning

$35 - $45/hr

Centific

Applied Reinforcement Learning Engineer

Palo Alto, CA or Seattle, WA (Hybrid/Remote) About the Team Centific AI Research advances foundational AI models and applications through reinforcement learning, alignment, and human-centered ...

Centific

Applied Reinforcement Learning Engineer

Path Robotics

Senior Machine Learning Engineer - Reinforcement Learning

Columbus, OH · On-site +1

$118K - $156K/yr

As a Sr. ML Engineer focused on Reinforcement Learning , you will design, implement, and optimize ... This role can be located in our Columbus, Ohio Headquarters or Remote. What You'll Do * Design ...

Path Robotics

Senior Machine Learning Engineer - Reinforcement Learning

Columbus, OH · On-site +1

$118K - $156K/yr

Torc Robotics

Senior Machine Learning Engineer - Learned Planning/Reinforcement Learning

Ann Arbor, MI · On-site +1

$102K - $140K/yr

Meet the Team As a Senior Machine Learning Engineer - Learned Planner / Reinforcement Learning, you ... We are also open to hiring Remote in the United States Perks of Being a Full-time Torc'r Torc cares ...

Torc Robotics

Senior Machine Learning Engineer - Learned Planning/Reinforcement Learning

Ann Arbor, MI · On-site +1

$102K - $140K/yr

Argonne National Laboratory

Staff Scientist - Post-Training and Reinforcement Learning for AI for Science

Lemont, IL · On-site +1

This position qualifies as "Hybrid Remote Work - Mostly Onsite": which applies to employees ... Experience with reinforcement learning, policy optimization, bandits, preference learning, or ...

Argonne National Laboratory

Staff Scientist - Post-Training and Reinforcement Learning for AI for Science

Lemont, IL · On-site +1

Statheros

Develop and train reinforcement learning models for real-world applications, focusing on efficiency ... Remote work location. * Competitive salary. * Flexible work schedule. * Opportunities for ...

Statheros

Artificial Intelligence (AI) Engineer / Developer (Remote)

Cookeville, TN · On-site +1

Statheros

Artificial Intelligence (AI) Engineer / Developer (Remote)

Cookeville, TN · On-site +1

Autodesk, Inc.

Research Lead / Principal Scientist & Manager Post-Training - Alignment - Reinforcement Learning Aut

San Francisco, CA · On-site +1

... Reinforcement Learning Autodesk AI Lab ... London • San Francisco • Toronto • Remote (US/CA/EU) The Opportunity Foundation models are ...

Autodesk, Inc.

Research Lead / Principal Scientist & Manager Post-Training - Alignment - Reinforcement Learning Aut

San Francisco, CA · On-site +1

... Reinforcement Learning Autodesk AI Lab ... London • San Francisco • Toronto • Remote (US/CA/EU) The Opportunity Foundation models are ...

talentpluto

RL Environment Software Engineer

$220K - $400K/yr

United States (remote) Work Model: Fully remote Industry: Applied AI / AI research data ... reinforcement-learning environments and agents sold to the world's leading AI labs. In under two ...

talentpluto

RL Environment Software Engineer

$220K - $400K/yr

Omada Health

Principal Applied Machine Learning Scientist

... reinforcement learning, causal inference or health AI. Benefits: * Competitive salary with generous annual cash bonus * Equity grants * Remote first work from home culture * Flexible Time Off to help ...

Omada Health

Principal Applied Machine Learning Scientist

Motional

Senior Machine Learning Engineer, Data Mining

Boston, MA · On-site +1

$133K - $175K/yr

Reinforcement Learning for Data Discover : Build RL-based policy learning and reasoning systems for ... remote.

Motional

Senior Machine Learning Engineer, Data Mining

Boston, MA · On-site +1

$133K - $175K/yr

Reinforcement Learning for Data Discover : Build RL-based policy learning and reasoning systems for ... remote.

Motional

Senior Machine Learning Engineer, Data Mining

Pittsburgh, PA · On-site +1

$118K - $156K/yr

Reinforcement Learning for Data Discover : Build RL-based policy learning and reasoning systems for ... be fully remote. The salary range for this role is an estimate based on a wide range of ...

Quick apply

Motional

Senior Machine Learning Engineer, Data Mining

Pittsburgh, PA · On-site +1

$118K - $156K/yr

Motional

Senior Machine Learning Engineer, Data Mining

San Francisco, CA · On-site +1

$144K - $190K/yr

Quick apply

Motional

Senior Machine Learning Engineer, Data Mining

San Francisco, CA · On-site +1

$144K - $190K/yr

Motional

Senior Machine Learning Engineer, Data Mining

Las Vegas, NV · On-site +1

$117K - $154K/yr

Reinforcement Learning for Data Discover : Build RL-based policy learning and reasoning systems for ... remote.

Motional

Senior Machine Learning Engineer, Data Mining

Las Vegas, NV · On-site +1

$117K - $154K/yr

Reinforcement Learning for Data Discover : Build RL-based policy learning and reasoning systems for ... remote.

Motional

Senior Machine Learning Engineer, Data Mining

Pittsburgh, PA · On-site +1

$118K - $156K/yr

Reinforcement Learning for Data Discover : Build RL-based policy learning and reasoning systems for ... remote.

Motional

Senior Machine Learning Engineer, Data Mining

Pittsburgh, PA · On-site +1

$118K - $156K/yr

Reinforcement Learning for Data Discover : Build RL-based policy learning and reasoning systems for ... remote.

Motional

Senior Machine Learning Engineer, Data Mining

$125K - $165K/yr

Motional

Senior Machine Learning Engineer, Data Mining

$125K - $165K/yr

Motional

Senior Machine Learning Engineer, Data Mining

Boston, MA · On-site +1

$133K - $175K/yr

Quick apply

Motional

Senior Machine Learning Engineer, Data Mining

Boston, MA · On-site +1

$133K - $175K/yr

Motional

Senior Machine Learning Engineer, Data Mining

Las Vegas, NV · On-site +1

$117K - $154K/yr

Quick apply

Motional

Senior Machine Learning Engineer, Data Mining

Las Vegas, NV · On-site +1

$117K - $154K/yr

Showing results 1-20

Remote Reinforcement Learning Jobs

Remote Reinforcement Learning information

See salary details

$11K

$83.9K

$140K

How much do remote reinforcement learning jobs pay per year?

As of Jul 23, 2026, the average yearly pay for remote reinforcement learning in the United States is $83,885.00, according to ZipRecruiter salary data. Most workers in this role earn between $72,000.00 and $139,000.00 per year, depending on experience, location, and employer.

What is a Remote Reinforcement Learning job?

A Remote Reinforcement Learning job involves developing and applying reinforcement learning algorithms while working from a location outside of a traditional office environment. Professionals in this field focus on creating systems where agents learn optimal behaviors through trial and error, often using feedback from their environment. These jobs typically require expertise in machine learning, programming, and mathematics, and are commonly found in industries like robotics, gaming, and autonomous systems. Working remotely allows researchers and engineers to collaborate with global teams using digital tools and platforms.

What are the key skills and qualifications needed to thrive as a Remote Reinforcement Learning Engineer, and why are they important?

To thrive as a Remote Reinforcement Learning Engineer, you need a strong background in machine learning, statistics, and programming (especially Python), often supported by an advanced degree in computer science or a related field. Familiarity with frameworks such as TensorFlow, PyTorch, and RL-specific libraries like OpenAI Gym, along with experience using cloud computing platforms, is typically required. Excellent problem-solving skills, self-motivation, and effective remote communication help individuals excel in distributed teams. These skills ensure the successful design, implementation, and deployment of reinforcement learning solutions while collaborating efficiently in a remote work environment.

What is the difference between Remote Reinforcement Learning vs Remote Machine Learning Engineer?

Aspect	Remote Reinforcement Learning
Required Credentials	Master's or PhD in Computer Science, AI, or related fields; knowledge of RL algorithms
Work Environment	Research-focused, experimental, often involves simulation and algorithm development
Employer & Industry Usage	Tech companies, research labs, AI startups focusing on autonomous systems
Common Search & Comparison Intent	Understanding specialized AI roles, research focus, and technical skills

Remote Reinforcement Learning specialists focus on developing algorithms that enable machines to learn through trial and error in simulated or real environments. In contrast, Remote Machine Learning Engineers typically work on deploying and optimizing various machine learning models across applications. While both roles require strong programming skills and knowledge of AI, reinforcement learning emphasizes decision-making processes, whereas machine learning engineering covers a broader range of models and deployment strategies.

What are common challenges faced when working remotely in a Reinforcement Learning role and how can they be addressed?

Working remotely in a Reinforcement Learning role often involves overcoming communication barriers with cross-functional teams, managing large-scale experiments without on-site resources, and staying updated with rapidly evolving research. To address these challenges, it's important to establish regular check-ins with colleagues, utilize cloud-based platforms for experiment management, and participate in virtual seminars or journal clubs. Developing strong self-motivation and time management skills is also crucial to maintain productivity in a remote environment.

More about Remote Reinforcement Learning jobs

The 10 Top Types Of Remote Reinforcement Learning Jobs

What cities are hiring for Remote Reinforcement Learning jobs? Cities with the most Remote Reinforcement Learning job openings:

What are the most commonly searched types of Reinforcement Learning jobs? The most popular types of Reinforcement Learning jobs are:

What states have the most Remote Reinforcement Learning jobs? States with the most job openings for Remote Reinforcement Learning jobs include:

What job categories do people searching Remote Reinforcement Learning jobs look for? The top searched job categories for Remote Reinforcement Learning jobs are:

Remote Reinforcement Learning jobs near you

Infographic showing various Remote Reinforcement Learning job openings in the United States as of July 2026, with employment types broken down into 70% Full Time, 18% Part Time, and 12% Contract. Highlights an 100% Remote job distribution, with an average salary of $83,885 per year, or $40.3 per hour.

Reinforcement Learning Engineer (Cybersecurity)

Bugcrowd

Remote

Apply

$176K - $242K/yr

Full-time

Posted 15 days ago

Job description

We are Bugcrowd. Since 2012, we've been empowering organizations to take back control and stay ahead of threat actors by uniting the collective ingenuity and expertise of our customers and trusted alliance of elite hackers, with our patented data and AI-powered Security Knowledge Platform™. Our network of hackers brings diverse expertise to uncover hidden weaknesses, adapting swiftly to evolving threats, even against zero-day exploits. With unmatched scalability and adaptability, our data and AI-driven CrowdMatch™ technology in our platform finds the perfect talent for your unique fight. We aim to create a new era of modern crowdsourced security that outpaces threat actors. Unleash the ingenuity of the hacker community with Bugcrowd, visit www.bugcrowd.com. Based in San Francisco and New Hampshire, Bugcrowd is supported by General Catalyst, Rally Ventures, Costanoa Ventures, and others.
Job Summary
The Bugcrowd RL and Reasoning Team focuses on pushing the boundaries of autonomous cybersecurity by building authentic reinforcement learning environments for foundational model companies. As a Reinforcement Learning Engineer you will advance the frontier of AI Reinforcement Learning development and delivery. You will build the infrastructure and tooling that transforms real-world vulnerability research into large-scale reinforcement learning environments used to train next-generation AI systems.
This role is unique. You will help create the training environments that teach AI systems how to hack and defend software. Your work will directly influence the capabilities of the next generation of AI models. Instead of building a single application, you will build the infrastructure that generates thousands of environments used to train frontier AI systems.
Our team works at the intersection of AI, security research, and systems engineering, building environments that allow models to learn skills such as vulnerability discovery, exploitation, and remediation.
Essential Duties and Responsibilities
If you enjoy building high-performance systems that power cutting-edge AI research, this role is for you.
This role focuses on building the systems that generate RL environments, not just the environments themselves. You will design pipelines that ingest software projects, analyze them with Bugcrowd's Mayhem platform, and automatically construct training environments used by frontier AI labs including Anthropic, OpenAI, and Cohere.
The ideal candidate is a strong systems engineer who understands:

Reinforcement learning workflows
Building clean, reproducible Linux ML environments (containers, MCP, etc)
System security background in binary exploitation, such as buffer overflows, fuzzing, exploitation, and x86/64.
Experience developing applications in Python and C, with Rust a plus.

Education, Experience, Knowledge, Skills, and Abilities
Understanding of RL training workflows used by modern LLM systems

Experience with DevOps pipelines (e.g., github actions), reproducible builds (docker, buildkit, nix).
Proficiency in Python and C. Other languages (especially Rust) are a plus.
Understanding of software vulnerabilities, fuzzing, or program analysis
Experience with build systems and large open-source codebases
Comfort working with Linux systems and low-level debugging
Experience working with benchmark environments (CTFs, SWE-bench, security challenges, etc.) is a plus

Working Conditions and Physical Requirements
The ideal candidate must be able to complete all physical requirements of the job with or without reasonable accommodation.
Sitting and / or standing - Must be able to remain in a stationary position 50% of the time
Carrying and / or lifting - Must be able to carry / move laptop as needed throughout the work day.
Environment - remote, work-from-home 100% of the time.
Pay Range Disclosure
At Bugcrowd, we strive for fairness, equality and to create an environment that allows our people to perform at their very best. Our compensation philosophy is to foster a collaborative community that rewards, attracts and retains the best possible talent. The provided salary details are based on US national averages and we retain the flexibility to tailor to the needs of the business.
The national estimate for the current base range for the position of $176,400 - $242,550.
This position may also be eligible to participate in a discretionary bonus program or commission plan, subject to the rules governing the program, whereby an award, if any, depends on various factors, including, without limitation, individual and organizational performance.
Culture

At Bugcrowd, we understand that diversity in the workplace is vital to a company's success and growth. We strive to make sure that people are included and have a sense of being part of making Bugcrowd not only a great product but a great place to work.
We regularly hear from both customers and researchers that Bugcrowd feels like a family, and we strive to maintain that internally as well.
Our team consists of a broad range of people: musicians, adventure sports junkies, nature lovers, parents, cereal enthusiasts, night owls, cyclists, artists-you get the point.

At Bugcrowd, we are solving security threats and vulnerabilities that are relevant to everyone, therefore we believe solving these problems takes all kinds of backgrounds. We value the perspectives and experiences people from underrepresented backgrounds bring.
Disclaimer
This position has access to highly confidential, sensitive information relating to the technologies of Bugcrowd. It is essential that the applicant possess the requisite integrity to maintain the information in the strictest confidence.
The company is authorized to obtain background checks for employment purposes under state and federal law. Background checks will be conducted for positions that involve access to confidential or proprietary information (including trade secrets).
Background checks may include Social Security verification, prior employment verification, personal and professional references, educational verification, and criminal history. Applicants with conviction histories will not be excluded from consideration to the extent required bylaw.
Any personal data you submit in connection with your application will be processed in compliance with Bugcrowd's Privacy Policy, which you may review here: https://www.bugcrowd.com/privacy.
Equal Employment Opportunity:
Bugcrowd is EOE, Disability/Age Employer.
Individuals seeking employment at Bugcrowd are considered without regards to race, color, religion, national origin, age, sex, marital status, ancestry, physical or mental disability, veteran status, gender identity, or sexual orientation.
Bugcrowd is committed to the full inclusion of all qualified individuals. In keeping with our commitment, Bugcrowd will take the steps to assure that people with disabilities are provided reasonable accommodations. Accordingly, if reasonable accommodation is required to fully participate in the job application or interview process, to perform the essential functions of the position, and/or to receive all other benefits and privileges of employment, please contact HR at ADA at bugcrowd.com.
Apply at: https://www.bugcrowd.com/about/careers/

About Bugcrowd

Sourced by ZipRecruiter

Industry

Network security

Company size

501 - 1,000 Employees

Headquarters location

San Francisco, CA, US

Year founded

2012

Website

bugcrowd.com

Social media

View All Bugcrowd Jobs

Apply

Remote Reinforcement Learning Jobs (NOW HIRING)

Reinforcement Learning Engineer (Cybersecurity)

Reinforcement Learning Engineer (Cybersecurity)

ML Scientist (Pricing Reinforcement Learning) | REMOTE |

ML Scientist (Pricing Reinforcement Learning) | REMOTE |

Research Intern - Applied Reinforcement Learning

Research Intern - Applied Reinforcement Learning

Applied Reinforcement Learning Engineer

Applied Reinforcement Learning Engineer

Senior Machine Learning Engineer - Reinforcement Learning

Senior Machine Learning Engineer - Reinforcement Learning

Senior Machine Learning Engineer - Learned Planning/Reinforcement Learning

Senior Machine Learning Engineer - Learned Planning/Reinforcement Learning

Staff Scientist - Post-Training and Reinforcement Learning for AI for Science

Staff Scientist - Post-Training and Reinforcement Learning for AI for Science

Artificial Intelligence (AI) Engineer / Developer (Remote)

Artificial Intelligence (AI) Engineer / Developer (Remote)

Artificial Intelligence (AI) Engineer / Developer (Remote)

Artificial Intelligence (AI) Engineer / Developer (Remote)

Research Lead / Principal Scientist & Manager Post-Training - Alignment - Reinforcement Learning Aut

Research Lead / Principal Scientist & Manager Post-Training - Alignment - Reinforcement Learning Aut

RL Environment Software Engineer

RL Environment Software Engineer

Principal Applied Machine Learning Scientist

Principal Applied Machine Learning Scientist

Senior Machine Learning Engineer, Data Mining

Senior Machine Learning Engineer, Data Mining

Senior Machine Learning Engineer, Data Mining

Senior Machine Learning Engineer, Data Mining

Senior Machine Learning Engineer, Data Mining

Senior Machine Learning Engineer, Data Mining

Senior Machine Learning Engineer, Data Mining

Senior Machine Learning Engineer, Data Mining

Senior Machine Learning Engineer, Data Mining

Senior Machine Learning Engineer, Data Mining

Senior Machine Learning Engineer, Data Mining

Senior Machine Learning Engineer, Data Mining

Senior Machine Learning Engineer, Data Mining

Senior Machine Learning Engineer, Data Mining

Senior Machine Learning Engineer, Data Mining

Senior Machine Learning Engineer, Data Mining

Remote Reinforcement Learning information

See salary details

How much do remote reinforcement learning jobs pay per year?

What is a Remote Reinforcement Learning job?

What are the key skills and qualifications needed to thrive as a Remote Reinforcement Learning Engineer, and why are they important?

What is the difference between Remote Reinforcement Learning vs Remote Machine Learning Engineer?

What are common challenges faced when working remotely in a Reinforcement Learning role and how can they be addressed?

Reinforcement Learning Engineer (Cybersecurity)

Share this job

Job description

About Bugcrowd

Industry

Company size

Headquarters location

Year founded

Website

Social media

Share this job