R0237547 Reinforcement Learning AI Engineer The Opportunity : Booz Allen is seeking an innovative ... Apply RL techniques such as policy optimization, value-based learning, model-based RL, and ...
R0237547 Reinforcement Learning AI Engineer The Opportunity : Booz Allen is seeking an innovative ... Apply RL techniques such as policy optimization, value-based learning, model-based RL, and ...
Drive performance improvements across our stack through profiling, optimization, and benchmarking ... Experience with reinforcement learning techniques and environments * Experience with virtualization ...
Drive performance improvements across our stack through profiling, optimization, and benchmarking ... Experience with reinforcement learning techniques and environments * Experience with virtualization ...
Research Intern - Reinforcement Learning (RL) - Onsite
Bodega Bay, CA · On-site
$17.75 - $23.75/hr
Design and build reinforcement learning environments that model real-world customer interaction ... Strong foundation in probability, math, and optimization * Passion for building real-world AI ...
Research Intern - Reinforcement Learning (RL) - Onsite
Bodega Bay, CA · On-site
$17.75 - $23.75/hr
Design and build reinforcement learning environments that model real-world customer interaction ... Strong foundation in probability, math, and optimization * Passion for building real-world AI ...
... optimization challenges AI Agents for Pricing Build AIdriven pricing agents that incorporate ... in reinforcement learning recommendation systems pricing algorithms pattern recognition or ...
... optimization challenges AI Agents for Pricing Build AIdriven pricing agents that incorporate ... in reinforcement learning recommendation systems pricing algorithms pattern recognition or ...
Research Engineer, Machine Learning (Reinforcement Learning)
San Francisco, CA · On-site
$500K - $850K/yr
Drive performance improvements across our stack through profiling, optimization, and benchmarking ... Experience with reinforcement learning techniques and environments * Experience with virtualization ...
Research Engineer, Machine Learning (Reinforcement Learning)
San Francisco, CA · On-site
$500K - $850K/yr
Drive performance improvements across our stack through profiling, optimization, and benchmarking ... Experience with reinforcement learning techniques and environments * Experience with virtualization ...
Strong understanding of RL fundamentals (e.g., policy optimization, value methods, model-based RL ... Publication record in reinforcement learning, machine learning, or robotics The pay offered for ...
Strong understanding of RL fundamentals (e.g., policy optimization, value methods, model-based RL ... Publication record in reinforcement learning, machine learning, or robotics The pay offered for ...
Strong understanding of RL fundamentals (e.g., policy optimization, value methods, model-based RL ... Publication record in reinforcement learning, machine learning, or robotics The pay offered for ...
Strong understanding of RL fundamentals (e.g., policy optimization, value methods, model-based RL ... Publication record in reinforcement learning, machine learning, or robotics The pay offered for ...
Reinforcement Learning Expertise - Develop and apply RL techniques including Contextual Bandits Qlearning SARSA and concepts like Thompson Sampling and Bayesian Optimization to solve pricing and ...
Reinforcement Learning Expertise - Develop and apply RL techniques including Contextual Bandits Qlearning SARSA and concepts like Thompson Sampling and Bayesian Optimization to solve pricing and ...
Senior PMT ES - Reinforcement Learning, SageMaker AI
$142K - $188K/yr
Reinforcement learning is at the heart of how the most capable AI systems are built today ... From aligning foundation models to optimizing complex multi-step reasoning and decision-making ...
Senior PMT ES - Reinforcement Learning, SageMaker AI
$142K - $188K/yr
Reinforcement learning is at the heart of how the most capable AI systems are built today ... From aligning foundation models to optimizing complex multi-step reasoning and decision-making ...
Senior PMT ES - Reinforcement Learning, SageMaker AI
Bellevue, WA · On-site
$142K - $188K/yr
Reinforcement learning is at the heart of how the most capable AI systems are built today ... From aligning foundation models to optimizing complex multi-step reasoning and decision-making ...
Senior PMT ES - Reinforcement Learning, SageMaker AI
Bellevue, WA · On-site
$142K - $188K/yr
Reinforcement learning is at the heart of how the most capable AI systems are built today ... From aligning foundation models to optimizing complex multi-step reasoning and decision-making ...
... optimization challenges AI Agents for Pricing Build AIdriven pricing agents that incorporate ... in reinforcement learning recommendation systems pricing algorithms pattern recognition or ...
... optimization challenges AI Agents for Pricing Build AIdriven pricing agents that incorporate ... in reinforcement learning recommendation systems pricing algorithms pattern recognition or ...
Senior Reinforcement Learning Engineer
Austin, TX · On-site
$103K - $142K/yr
Experience building or utilizing large-scale, distributed training pipelines and a strong intuition for their optimization. * A strong theoretical understanding of modern reinforcement learning ...
Senior Reinforcement Learning Engineer
Austin, TX · On-site
$103K - $142K/yr
Experience building or utilizing large-scale, distributed training pipelines and a strong intuition for their optimization. * A strong theoretical understanding of modern reinforcement learning ...
About Job PhD Research Intern - Applied Reinforcement Learning Centific AI Research Role Summary ... FastAPI, gRPC); optimization (ONNX, TensorRT) Logistics Location: Palo Alto, CA (Preferred ...
About Job PhD Research Intern - Applied Reinforcement Learning Centific AI Research Role Summary ... FastAPI, gRPC); optimization (ONNX, TensorRT) Logistics Location: Palo Alto, CA (Preferred ...
Research Scientist, Reinforcement Learning
New York, NY · On-site
$120K - $180K/yr
Strong background in reinforcement learning, planning, MDPs, optimal control, and sequential decision making. * Experience in developing AI systems that combine neural and symbolic methods is highly ...
Research Scientist, Reinforcement Learning
New York, NY · On-site
$120K - $180K/yr
Strong background in reinforcement learning, planning, MDPs, optimal control, and sequential decision making. * Experience in developing AI systems that combine neural and symbolic methods is highly ...
Responsibilities : • Design, implement, and train reinforcement learning algorithms for ... optimization, and robotics fundamentals (kinematics, dynamics). • Excellent Python and C ...
Responsibilities : • Design, implement, and train reinforcement learning algorithms for ... optimization, and robotics fundamentals (kinematics, dynamics). • Excellent Python and C ...
Applied Deep Learning PhD Research Intern, Reinforcement Learning for LLMs - Fall 2026
$17.50 - $23.50/hr
Experience with RLHF, RLAIF, policy optimization, reward modeling, or agentic LLM systems * Strong ... reinforcement learning to make language models more capable, reliable, and useful, this team could ...
Applied Deep Learning PhD Research Intern, Reinforcement Learning for LLMs - Fall 2026
$17.50 - $23.50/hr
Experience with RLHF, RLAIF, policy optimization, reward modeling, or agentic LLM systems * Strong ... reinforcement learning to make language models more capable, reliable, and useful, this team could ...
Applied Deep Learning PhD Research Intern, Reinforcement Learning for LLMs - Fall 2026
Santa Clara, CA · On-site
$17.50 - $23.50/hr
Experience with RLHF, RLAIF, policy optimization, reward modeling, or agentic LLM systems * Strong ... reinforcement learning to make language models more capable, reliable, and useful, this team could ...
Applied Deep Learning PhD Research Intern, Reinforcement Learning for LLMs - Fall 2026
Santa Clara, CA · On-site
$17.50 - $23.50/hr
Experience with RLHF, RLAIF, policy optimization, reward modeling, or agentic LLM systems * Strong ... reinforcement learning to make language models more capable, reliable, and useful, this team could ...
Applied Reinforcement Learning Engineer Location: Palo Alto, CA or Seattle, WA (Hybrid/Remote ... DPO, IPO, KTO, offline preference optimization • Group-based methods: GRPO, RLOO, sample ...
Applied Reinforcement Learning Engineer Location: Palo Alto, CA or Seattle, WA (Hybrid/Remote ... DPO, IPO, KTO, offline preference optimization • Group-based methods: GRPO, RLOO, sample ...
From experiment tracking and model optimization to high-performance training clusters, agent ... reinforcement learning or PhD + 2 years experience * Strong programming skills in Python and ...
From experiment tracking and model optimization to high-performance training clusters, agent ... reinforcement learning or PhD + 2 years experience * Strong programming skills in Python and ...
Research Intern - Applied Reinforcement Learning
$35 - $45/hr
About Job PhD Research Intern - Applied Reinforcement Learning Centific AI Research Role Summary ... FastAPI, gRPC); optimization (ONNX, TensorRT) Logistics Location: Palo Alto, CA (Preferred ...
Research Intern - Applied Reinforcement Learning
$35 - $45/hr
About Job PhD Research Intern - Applied Reinforcement Learning Centific AI Research Role Summary ... FastAPI, gRPC); optimization (ONNX, TensorRT) Logistics Location: Palo Alto, CA (Preferred ...
Reinforcement Learning Optimization information
See salary details
$21.8K is the 25th percentile. Wages below this are outliers.
$11K - $22.7K
27% of jobs
$22.7K - $34.5K
0% of jobs
$34.5K - $46.2K
0% of jobs
$46.2K - $57.9K
0% of jobs
$57.9K - $69.6K
0% of jobs
The median wage is $80.4K / yr.
$69.6K - $81.4K
25% of jobs
$81.4K - $93.1K
18% of jobs
$101.5K is the 75th percentile. Wages above this are outliers.
$93.1K - $104.8K
7% of jobs
$104.8K - $116.5K
2% of jobs
$116.5K - $128.3K
0% of jobs
$128.3K - $140K
21% of jobs
$11K
$83.9K
$140K
How much do reinforcement learning optimization jobs pay per year?
What are some common challenges faced by professionals in Reinforcement Learning Optimization roles, and how can they be addressed?
What are the key skills and qualifications needed to thrive as a Reinforcement Learning Optimization Specialist, and why are they important?
What is Reinforcement Learning Optimization?
$99K - $225K/yr
Other
Medical, Life, Retirement, PTO
Posted 27 days ago
Booz Allen Hamilton rating
8.8
Based on 47 frontline employees who took The Breakroom Quiz
9th of 57 rated business consultants
Job description
* Develop scalable training pipelines using Python and modern ML frameworks.
* Build and evaluate agents in simulated environments using Gym or PettingZoo, high-fidelity simulators, or custom environments.
* Apply RL techniques such as policy optimization, value-based learning, model-based RL, and imitation learning.
* Collaborate with domain experts to define re war d structures, constraints, and evaluation met rics aligned with mission objectives.
* Implement distributed training workflows leveraging cloud compute, containerization, and orchestration technologies.
* Transition trained models into production systems, following strong sof t war e engineering best practices.
* Contribute to system architecture and performance optimization in Python with opportunities to extend into C++ or Rus t for high-performance components. Join us. The world can't wait. You Have: * Experience developing and training reinforcement learning agents
* Experience with Gym or PettingZoo interfaces
* Experience with ML frameworks such as PyTorch, TensorFlow, or JAX
* Experience with artifi cia l intelligence, data science, machine learning engineering, or sof t war e engineering
* Experience developing technical solutions using Python, C++, or Rus t
* Knowledge of reinforcement learning and artifi cia l neural networks
* Secret clearance
* Bachelor's degree in a Computer Science, Artifi cia l Intelligence, or Engineering field Nice If You Have: * Experience applying RL to autonomy, control systems, or mission-scale
* Experience with Multi-Agent Reinforcement Learning ( MARL )
* Experience with AFSIM or other high-fidelity simulation environments
* Experience with embedded systems programming in C, C++, or Rus t
* Experience in GPU programming, including CUDA or RAPID
* Experience developing in-space solutions
* Knowledge of modern sof t war e design patterns, including microservice design and orchestration in Kubernetes deployment
* Master's degree in Computer Science, Artifi cia l Intelligence, Engineering, or a related field Clearance: Applicants selected will be subject to a security investigation and may need to meet eligibility requirements for access to classified information; Secret clearance is required. Compensation At Booz Allen, we celebrate your contributions, provide you with opportunities and choices, and support your total well-being. Our offerings include health, life, disability, financial, and retirement benefits, as well as paid leave, professional development, tuition assistance, work-life programs, and dependent care. Our recognition awards program acknowledges employees for exceptional performance and superior demonstration of our values. Full-time and part-time employees working at least 20 hours a week on a regular basis are eligible to participate in Booz Allen's benefit programs. Individuals that do not meet the threshold are only eligible for select offerings, not inclusive of health benefits. We encourage you to learn more about our total benefits by visiting the Resource page on our Careers site and reviewing Our Employee Benefits page. Salary at Booz Allen is determined by various factors, including but not limited to location, the individual's particular combination of education, knowledge, skills, competencies, and experience, as well as contract-specific affordability and organizational requirements. The projected compensation range for this position is $99,000.00 to $225,000.00 (annualized USD). The estimate displayed represents the typical salary range for this position and is just one component of Booz Allen's total compensation package for employees. This posting will close within 90 days from the Posting Date. Identity Statement As part of the hiring process, we will ask you to complete an identity verification process that leverages advanced biometrics and artificial intelligence to ensure authenticity and protect against identity fraud. You are expected to be on camera during interviews and assessments. We reserve the right to take your picture to verify your identity and prevent fraud. Candidate AI Usage Policy AI is a part of our daily work at Booz Allen, and we are committed to the responsible and ethical use of AI tools. However, we want to ensure a fair candidate process based on your own skills and knowledge. As part of this commitment, the use of artificial intelligence (AI) or other tools to assist with responses during interviews (whether in-person or virtual) is prohibited unless permission is explicitly provided. Work Model Our people-first culture prioritizes the benefits of collaboration, whether it occurs in person or virtually. To support engagement and effective communication, employees working virtually are generally expected to have their cameras on during meetings. * Remote: If this position is listed as remote, there may still be occasions when you are required to work in person at a Booz Allen or customer facility.
* Hybrid: If this position is listed as hybrid, you will be expected to work from a Booz Allen facility frequently, in alignment with leadership expectations and the needs of the role. You may also be required to work from or visit a customer facility.
* Onsite: If this position is listed as onsite, work will primarily be performed at a Booz Allen office or customer facility, where employees will collaborate directly with colleagues and customers as required by the role. Commitment to Non-Discrimination All qualified applicants will receive consideration for employment without regard to disability, status as a protected veteran or any other status protected by applicable federal, state, local, or international law.
What Booz Allen Hamilton employees say
Pay
Benefits
Hours and flexibility
Workplace
Get the full story on Breakroom
About Booz Allen Hamilton
Sourced by ZipRecruiter
Booz Allen Hamilton is a leading provider of management and technology consulting services to the US government in defense, intelligence, and civil markets. Headquartered in McLean, Virginia, the firm also serves major corporations, institutions, and not-for-profit organizations. Founded in 1914 by Edwin G. Booz, the company has a long-standing tradition of helping clients achieve success by delivering a wide range of consulting services that include strategic planning, human capital and learning, communication, systems development, and others. The company's mission is to empower people to change the world, and it has a reputation for maintaining the highest standards of integrity and-excellence.
Industry
It services
Company size
10,000+ Employees
Headquarters location
McLean, VA, US
Year founded
1914