From d5e8b33ad1efb3df7ddece5314efba12af7325f4 Mon Sep 17 00:00:00 2001 From: Gaiejj <524339208@qq.com> Date: Sun, 20 Aug 2023 01:47:46 +0800 Subject: [PATCH] docs: fix typo --- docs/source/usage/implement.rst | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/docs/source/usage/implement.rst b/docs/source/usage/implement.rst index af79ef8..ba1ebb1 100644 --- a/docs/source/usage/implement.rst +++ b/docs/source/usage/implement.rst @@ -36,7 +36,7 @@ You can customize new alforithms based on it. Next we will provide a detailed example to show how to customize the ``PPO`` algorithm to ``PPO-Lag`` algorithm. Example: PPO-Lag ---------------- +---------------- The Lagrangian multiplier is a useful tool to control the constraint violation in the Safe RL algorithms. Classic RL algorithms combined with the Lagrangian multiplier are exellent baselines for Safe RL algorithms.