Reinforcement Learning Toolbox

 

Reinforcement Learning Toolbox

Design and train policies using reinforcement learning

Get Started:

Reinforcement Learning Agents

Create and configure reinforcement learning agents to train policies in MATLAB and Simulink. Use built-in or develop custom reinforcement learning algorithms.

Reinforcement Learning Algorithms

Create agents using deep Q-network (DQN), deep deterministic policy gradient (DDPG), proximal policy optimization (PPO), and other built-in algorithms. Use templates to develop custom agents for training policies.

Various training algorithms, including SARSA, SAC, DDPG, and others.

Training algorithms available in Reinforcement Learning Toolbox.

Reinforcement Learning Designer App

Interactively design, train, and simulate reinforcement learning agents. Export trained agents to MATLAB for further use and deployment.

Policy and Value Function Representation Using Deep Neural Networks

For complex systems with large state-action spaces, define deep neural network policies programmatically, using layers from Deep Learning Toolbox, or interactively, with Deep Network Designer. Alternatively, use the default network architecture suggested by the toolbox. Initialize the policy using imitation learning to accelerate training. Import and export ONNX models for interoperability with other deep learning frameworks.

Single- and Multi-Agent Reinforcement Learning in Simulink

Create and train reinforcement learning agents in Simulink with the RL Agent block. Train multiple agents simultaneously (multi-agent reinforcement learning) in Simulink using multiple instances of the RL Agent block.

Simulink model with an RL Agent block.

The reinforcement learning agent block for Simulink.

Environment Modeling

Create MATLAB and Simulink environment models. Describe system dynamics and provide observation and reward signals for training agents.

Simulink and Simscape Environments

Use Simulink and Simscape™ to create a model of an environment. Specify the observation, action, and reward signals within the model.

Simulink environment model for a biped robot.

Simulink environment model for a biped robot.

MATLAB Environments

Use MATLAB functions and classes to model an environment. Specify observation, action, and reward variables within the MATLAB file.

MATLAB environment for a 3-dof rocket.

MATLAB environment for a three-degrees-of-freedom rocket.

Accelerating Training

Speed up training using GPU, cloud, and distributed computing resources.

Block diagram of multiple workers used to speed up training by running multiple simulations in parallel.

Speeding up training using parallel computing.

GPU Acceleration

Speed up deep neural network training and inference with high-performance NVIDIA® GPUs. Use MATLAB with Parallel Computing Toolbox and most CUDA-enabled NVIDIA GPUs that have compute capability 3.0 or higher.

GPU hardware.

Accelerate training using GPUs.

Code Generation and Deployment

Deploy trained policies to embedded devices or integrate them with a wide range of production systems.

Code Generation

Use GPU Coder™ to generate optimized CUDA code from MATLAB code representing trained policies. Use MATLAB Coder™ to generate C/C++ code to deploy policies.

GPU Coder configuration screen.

Generating CUDA code using GPU Coder.

MATLAB Compiler Support

Use MATLAB Compiler™ and MATLAB Compiler SDK™ to deploy trained policies as standalone applications, C/C++ shared libraries, Microsoft® .NET assemblies, Java® classes, and Python® packages.

MATLAB Compiler screen for creating a standalone application.

Packaging and sharing policies as standalone programs.

Reference Examples

Design controllers and decision-making algorithms for robotics, automated driving, calibration, scheduling, and other applications.

Tuning, Calibration, and Scheduling

Design reinforcement learning policies for tuning, calibration, and scheduling applications.

A water distribution system with three pumps, reservoir, and tank.

Resource allocation problem for water distribution.

Reinforcement Learning Video Series

Watch the videos in this series to learn more about reinforcement learning.