From 51818709f1bef340f1abc67ce3f6d096c75338c3 Mon Sep 17 00:00:00 2001
From: Jiayi Zhou <108712610+Gaiejj@users.noreply.github.com>
Date: Tue, 3 Sep 2024 23:47:50 +0800
Subject: [PATCH] docs: update appendix (#350)

---
 .github/workflows/lint.yml                    |    4 -
 docs/source/benchmark/case-study.md           |   54 +
 docs/source/benchmark/modelbased.md           |  223 ++
 docs/source/benchmark/off-policy.md           |  938 ++++++
 docs/source/benchmark/offline.md              |  275 ++
 docs/source/benchmark/on-policy.md            | 2841 +++++++++++++++++
 docs/source/index.rst                         |   15 +
 docs/source/spelling_wordlist.txt             |   11 +
 docs/source/start/algo.md                     |  111 +
 docs/source/start/efficiency.rst              |   60 +
 docs/source/start/exp-grid.md                 |   31 +
 docs/source/start/features.md                 |  257 ++
 omnisafe/adapter/modelbased_adapter.py        |    2 +-
 omnisafe/common/logger.py                     |    4 +-
 omnisafe/common/offline/data_collector.py     |    2 +-
 .../envs/classic_control/envs_from_crabs.py   |    2 +-
 omnisafe/envs/safety_gymnasium_modelbased.py  |    9 +-
 omnisafe/evaluator.py                         |    2 +-
 omnisafe/utils/plotter.py                     |    3 +-
 pyproject.toml                                |   20 +-
 20 files changed, 4842 insertions(+), 22 deletions(-)
 create mode 100644 docs/source/benchmark/case-study.md
 create mode 100644 docs/source/benchmark/modelbased.md
 create mode 100644 docs/source/benchmark/off-policy.md
 create mode 100644 docs/source/benchmark/offline.md
 create mode 100644 docs/source/benchmark/on-policy.md
 create mode 100644 docs/source/start/algo.md
 create mode 100644 docs/source/start/efficiency.rst
 create mode 100644 docs/source/start/exp-grid.md
 create mode 100644 docs/source/start/features.md
diff --git a/.github/workflows/lint.yml b/.github/workflows/lint.yml
index 80db45035..e6f178cb0 100644
--- a/.github/workflows/lint.yml
+++ b/.github/workflows/lint.yml
@@ -45,10 +45,6 @@ jobs:
         run: |
           make pre-commit
 
-      - name: ruff
-        run: |
-          make ruff
-
       - name: flake8
         run: |
           make flake8
diff --git a/docs/source/benchmark/case-study.md b/docs/source/benchmark/case-study.md
new file mode 100644
index 000000000..28f93ba72
--- /dev/null
+++ b/docs/source/benchmark/case-study.md
@@ -0,0 +1,54 @@
+# Case Study
+
+One important motivation for SafeRL is to enable agents to explore and
+learn safely. Therefore, evaluating algorithm performance concerning
+*procedural constraint violations* is also important. We have selected
+representative experimental results and report as shown in <a href="#analys">Figure 1</a> and <a href="#analys_ppo">Figure 2</a>:
+
+#### Radical vs. Conservative
+
+*Radical* policies often explore higher rewards but violate more safety
+constraints, whereas *Conservative* policies are the opposite.
+<a href="#analys">Figure 1</a> illustrates this: during training, CPO and
+PPOLag consistently pursue the highest rewards among all algorithms, as
+depicted in the first row. However, as shown in the second row, they
+experience significant fluctuations in constraint violations, especially
+for PPOLag. So, they are relatively radical, *i.e.,* higher rewards but
+higher costs. In comparison, while P3O achieves slightly lower rewards
+than PPOLag, it maintains fewer oscillations in constraint violations,
+making it safer in adhering to safety constraints, evident from the
+smaller proportion of its distribution crossing the black dashed line. A
+similar pattern is also observed when comparing PCPO with CPO.
+Therefore, P3O and PCPO are relatively conservative, *i.e.,* lower costs
+but lower rewards.
+
+<img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/analys.png?raw=true" id="analys">
+<br>
+
+**Figure 1:** PPOLag, P3O, CPO, and PCPO training on four tasks in for 1e7 steps, showing the distribution of all episodic rewards and costs. All data covers over 5 random seeds and filters out data points over 3 standard deviations. The black dashed line in the graph represents the preset `cost_limit`.
+
+
+#### Oscillation vs. Stability
+
+The oscillations in the degree of constraint violations during the
+training process can indicate the performance of SafeRL algorithms.
+These oscillations are quantified by *Extremes*, *i.e.,* the maximum
+constraint violation, and *Distributions*, *i.e.,* the frequency of
+violations remaining below a predefined `cost_limit`. As shown in
+<a href="#analys_ppo">Figure 2</a>, PPOLag, a popular baseline in SafeRL,
+utilizes the Lagrangian multiplier for constraint handling. Despite its
+simplicity and ease of implementation, PPOLag often suffers from
+significant oscillations due to challenges in setting appropriate
+initial values and learning rates. It consistently seeks higher rewards
+but always leads to larger extremes and unsafe distributions.
+Conversely, CPPOPID, which employs a PID controller for updating the
+Lagrangian multiplier, markedly reduces these extremes. CUP implements a
+two-stage projection method that constrains violations' distribution
+below the `cost_limit`. Lastly, PPOSaute integrates state observations
+with constraints, resulting in smaller extremes and safer distributions
+of violations.
+
+<img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/analys_ppo.png?raw=true"  id="analys_ppo">
+<br>
+
+**Figure 2:** PPOLag, CPPOPID, CUP, and PPOSaute trained on four tasks in for all 1e7 steps, showing the distribution of all episodic rewards and costs. All data covers over 5 random seeds and filters out data points over 3 standard deviations. The black dashed line in the graph represents the preset `cost_limit`.
diff --git a/docs/source/benchmark/modelbased.md b/docs/source/benchmark/modelbased.md
new file mode 100644
index 000000000..abbc1fce2
--- /dev/null
+++ b/docs/source/benchmark/modelbased.md
@@ -0,0 +1,223 @@
+# Model-based Algorithms
+
+The OmniSafe Navigation Benchmark for model-based algorithms evaluates the effectiveness of OmniSafe's model-based algorithms across two different environments from the [Safety-Gymnasium](https://github.com/PKU-Alignment/safety-gymnasium) task suite. For each supported algorithm and environment, we offer the following:
+
+- Default hyperparameters used for the benchmark and scripts that enable result replication.
+- Graphs and raw data that can be utilized for research purposes.
+- Detailed logs obtained during training.
+
+Supported algorithms are listed below:
+
+- **[NeurIPS 2001]** [Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models (PETS))](https://arxiv.org/abs/1805.12114)
+- **[CoRL 2021]** [Learning Off-Policy with Online Planning (LOOP and SafeLOOP)](https://arxiv.org/abs/2008.10066)
+- **[AAAI 2022]** [Conservative and Adaptive Penalty for Model-Based Safe Reinforcement Learning (CAP)](https://arxiv.org/abs/2112.07701)
+- **[ICML 2022 Workshop]** [Constrained Model-based Reinforcement Learning with Robust Cross-Entropy Method (RCE)](https://arxiv.org/abs/2010.07968)
+- **[NeurIPS 2018]** [Constrained Cross-Entropy Method for Safe Reinforcement Learning (CCE)](https://proceedings.neurips.cc/paper/2018/hash/34ffeb359a192eb8174b6854643cc046-Abstract.html)
+
+## Safety-Gymnasium
+
+We highly recommend using **Safety-Gymnasium** to run the following experiments. To install, in a linux machine, type:
+
+```bash
+pip install safety_gymnasium
+```
+
+## Run the Benchmark
+
+You can set the main function of ``examples/benchmarks/experiment_grid.py`` as:
+
+```python
+if __name__ == '__main__':
+    eg = ExperimentGrid(exp_name='Model-Based-Benchmarks')
+
+    # set up the algorithms.
+    model_based_base_policy = ['LOOP', 'PETS']
+    model_based_safe_policy = ['SafeLOOP', 'CCEPETS', 'CAPPETS', 'RCEPETS']
+    eg.add('algo', model_based_base_policy + model_based_safe_policy)
+
+    # you can use wandb to monitor the experiment.
+    eg.add('logger_cfgs:use_wandb', [False])
+    # you can use tensorboard to monitor the experiment.
+    eg.add('logger_cfgs:use_tensorboard', [True])
+    eg.add('train_cfgs:total_steps', [1000000])
+
+    # set up the environment.
+    eg.add('env_id', [
+        'SafetyPointGoal1-v0-modelbased',
+        'SafetyCarGoal1-v0-modelbased',
+        ])
+    eg.add('seed', [0, 5, 10, 15, 20])
+
+    # total experiment num must can be divided by num_pool
+    # meanwhile, users should decide this value according to their machine
+    eg.run(train, num_pool=5)
+```
+
+After that, you can run the following command to run the benchmark:
+
+```bash
+cd examples/benchmarks
+python run_experiment_grid.py
+```
+
+You can set the path of ``examples/benchmarks/experiment_grid.py`` :
+example:
+
+```python
+path ='omnisafe/examples/benchmarks/exp-x/Model-Based-Benchmarks'
+```
+
+You can also plot the results by running the following command:
+
+```bash
+cd examples
+python analyze_experiment_results.py
+```
+
+**For a detailed usage of OmniSafe statistics tool, please refer to [this tutorial](https://omnisafe.readthedocs.io/en/latest/common/stastics_tool.html).**
+
+## OmniSafe Benchmark
+
+To demonstrate the high reliability of the algorithms implemented, OmniSafe offers performance insights within the Safety-Gymnasium environment. It should be noted that all data is procured under the constraint of `cost_limit=1.00`. The results are presented in <a href="#performance_model_based">Table 1</a> and <a href="#curve_model_based">Figure 1</a>.
+
+### Performance Table
+
+<!DOCTYPE html>
+<html lang="en">
+<head>
+<meta charset="UTF-8">
+<style>
+  .scrollable-container {
+    overflow-x: auto;
+    white-space: nowrap;
+    width: 100%;
+  }
+  table {
+    border-collapse: collapse;
+    width: auto;
+    font-size: 12px;
+  }
+  th, td {
+    padding: 8px;
+    text-align: center;
+    border: 1px solid #ddd;
+  }
+  th {
+    font-weight: bold;
+  }
+  caption {
+    font-size: 12px;
+    font-family: 'Times New Roman', Times, serif;
+  }
+</style>
+</head>
+<body>
+
+<div class="scrollable-container">
+<table id="performance_model_based">
+<thead>
+<tr class="header">
+<th style="text-align: left;"></th>
+<th colspan="2" style="text-align: center;"><strong>PETS</strong></th>
+<th colspan="2" style="text-align: center;"><strong>LOOP</strong></th>
+<th colspan="2"
+style="text-align: center;"><strong>SafeLOOP</strong></th>
+</tr>
+</thead>
+<tbody>
+<tr class="odd">
+<td style="text-align: left;"><strong>Environment</strong></td>
+<td style="text-align: center;"><strong>Reward</strong></td>
+<td style="text-align: center;"><strong>Cost</strong></td>
+<td style="text-align: center;"><strong>Reward</strong></td>
+<td style="text-align: center;"><strong>Cost</strong></td>
+<td style="text-align: center;"><strong>Reward</strong></td>
+<td style="text-align: center;"><strong>Cost</strong></td>
+</tr>
+<tr class="even">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyCarGoal1-v0</span></td>
+<td style="text-align: center;">33.07 <span class="math inline">±</span>1.33</td>
+<td style="text-align: center;">61.20 <span class="math inline">±</span>7.23</td>
+<td style="text-align: center;">25.41 <span class="math inline">±</span>1.23</td>
+<td style="text-align: center;">62.64 <span class="math inline">±</span>8.34</td>
+<td style="text-align: center;">22.09 <span class="math inline">±</span>0.30</td>
+<td style="text-align: center;">0.16 <span class="math inline">±</span>0.15</td>
+</tr>
+<tr class="odd">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyPointGoal1-v0</span></td>
+<td style="text-align: center;">27.66 <span class="math inline">±</span>0.07</td>
+<td style="text-align: center;">49.16 <span class="math inline">±</span>2.69</td>
+<td style="text-align: center;">25.08 <span class="math inline">±</span>1.47</td>
+<td style="text-align: center;">55.23 <span class="math inline">±</span>2.64</td>
+<td style="text-align: center;">22.94 <span class="math inline">±</span>0.72</td>
+<td style="text-align: center;">0.04 <span class="math inline">±</span>0.07</td>
+</tr>
+<thead>
+<tr class="header">
+<th style="text-align: left;"></th>
+<th colspan="2" style="text-align: center;"><strong>CCEPETS</strong></th>
+<th colspan="2" style="text-align: center;"><strong>RCEPETS</strong></th>
+<th colspan="2" style="text-align: center;"><strong>CAPPETS</strong></th>
+</tr>
+</thead>
+<tr class="odd">
+<td style="text-align: left;"><strong>Environment</strong></td>
+<td style="text-align: center;"><strong>Reward</strong></td>
+<td style="text-align: center;"><strong>Cost</strong></td>
+<td style="text-align: center;"><strong>Reward</strong></td>
+<td style="text-align: center;"><strong>Cost</strong></td>
+<td style="text-align: center;"><strong>Reward</strong></td>
+<td style="text-align: center;"><strong>Cost</strong></td>
+</tr>
+<tr class="even">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyCarGoal1-v0</span></td>
+<td style="text-align: center;">27.60 <span class="math inline">±</span>1.21</td>
+<td style="text-align: center;">1.03 <span class="math inline">±</span>0.29</td>
+<td style="text-align: center;">29.08 <span class="math inline">±</span>1.63</td>
+<td style="text-align: center;">1.02 <span class="math inline">±</span>0.88</td>
+<td style="text-align: center;">23.33 <span class="math inline">±</span>6.34</td>
+<td style="text-align: center;">0.48 <span class="math inline">±</span>0.17</td>
+</tr>
+<tr class="odd">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyPointGoal1-v0</span></td>
+<td style="text-align: center;">24.98 <span class="math inline">±</span>0.05</td>
+<td style="text-align: center;">1.87 <span class="math inline">±</span>1.27</td>
+<td style="text-align: center;">25.39 <span class="math inline">±</span>0.28</td>
+<td style="text-align: center;">2.46 <span class="math inline">±</span>0.58</td>
+<td style="text-align: center;">9.45 <span class="math inline">±</span>8.62</td>
+<td style="text-align: center;">0.64 <span class="math inline">±</span>0.77</td>
+</tr>
+</tbody>
+</table>
+</div>
+
+<caption><p><b>Table 1:</b> The performance of OmniSafe model-based algorithms, encompassing both reward and cost, was assessed within the Safety-Gymnasium environments. It is crucial to highlight that all model-based algorithms underwent evaluation following 1e6 training steps.</p></caption>
+
+### Performance Curves
+
+<table id="curve_model_based">
+  <tr>
+    <td style="text-align:center">
+      <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/model-based/benchmarks/SafetyCarGoal1-v0-modelbased.png?raw=True">
+      <br>
+      <div>
+        SafetyCarGoal1-v0
+      </div>
+    </td>
+  </tr>
+  <tr>
+    <td style="text-align:center">
+      <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/model-based/benchmarks/SafetyPointGoal1-v0-modelbased.png?raw=True">
+      <br>
+      <div>
+        SafetyPointGoal1-v0
+      </div>
+    </td>
+  </tr>
+</table>
+
+<caption><p><b>Figure 1:</b> Training curves in Safety-Gymnasium environments, covering classical reinforcement learning algorithms and safe learning algorithms mentioned in <a href="#performance_model_based">Table 1</a>.</p></caption>
diff --git a/docs/source/benchmark/off-policy.md b/docs/source/benchmark/off-policy.md
new file mode 100644
index 000000000..077f1a3d5
--- /dev/null
+++ b/docs/source/benchmark/off-policy.md
@@ -0,0 +1,938 @@
+# Off-Policy Algorithms
+
+The OmniSafe Safety-Gymnasium Benchmark for off-policy algorithms evaluates the effectiveness of OmniSafe's off-policy algorithms across multiple environments from the [Safety-Gymnasium](https://github.com/PKU-Alignment/safety-gymnasium) task suite. For each supported algorithm and environment, we offer the following:
+
+- Default hyperparameters used for the benchmark and scripts that enable result replication.
+- Performance comparison with other open-source implementations.
+- Graphs and raw data that can be utilized for research purposes.
+- Detailed logs obtained during training.
+
+Supported algorithms are listed below:
+
+- **[ICLR 2016]** [Deep Deterministic Policy Gradient (DDPG)](https://arxiv.org/pdf/1509.02971.pdf)
+- **[ICML 2018]** [Twin Delayed DDPG (TD3)](https://arxiv.org/pdf/1802.09477.pdf)
+- **[ICML 2018]** [Soft Actor-Critic (SAC)](https://arxiv.org/pdf/1812.05905.pdf)
+- **[Preprint 2019]<sup>[[1]](#footnote1)</sup>** [The Lagrangian version of DDPG (DDPGLag)](https://cdn.openai.com/safexp-short.pdf)
+- **[Preprint 2019]<sup>[[1]](#footnote1)</sup>** [The Lagrangian version of TD3 (TD3Lag)](https://cdn.openai.com/safexp-short.pdf)
+- **[Preprint 2019]<sup>[[1]](#footnote1)</sup>** [The Lagrangian version of SAC (SACLag)](https://cdn.openai.com/safexp-short.pdf)
+- **[ICML 2020]** [Responsive Safety in Reinforcement Learning by PID Lagrangian Methods (DDPGPID)](https://arxiv.org/abs/2007.03964)
+- **[ICML 2020]** [Responsive Safety in Reinforcement Learning by PID Lagrangian Methods (TD3PID)](https://arxiv.org/abs/2007.03964)
+- **[ICML 2020]** [Responsive Safety in Reinforcement Learning by PID Lagrangian Methods (SACPID)](https://arxiv.org/abs/2007.03964)
+
+## Safety-Gymnasium
+
+We highly recommend using **Safety-Gymnasium** to run the following experiments. To install, in a linux machine, type:
+
+```bash
+pip install safety_gymnasium
+```
+
+## Run the Benchmark
+You can set the main function of `examples/benchmarks/experiment_grid.py` as:
+
+```python
+if __name__ == '__main__':
+    eg = ExperimentGrid(exp_name='Off-Policy-Benchmarks')
+
+    # set up the algorithms.
+    off_policy = ['DDPG', 'SAC', 'TD3', 'DDPGLag', 'TD3Lag', 'SACLag', 'DDPGPID', 'TD3PID', 'SACPID']
+    eg.add('algo', off_policy)
+
+    # you can use wandb to monitor the experiment.
+    eg.add('logger_cfgs:use_wandb', [False])
+    # you can use tensorboard to monitor the experiment.
+    eg.add('logger_cfgs:use_tensorboard', [True])
+
+    # the default configs here are as follows:
+    # eg.add('algo_cfgs:steps_per_epoch', [2000])
+    # eg.add('train_cfgs:total_steps', [2000 * 500])
+    # which can reproduce results of 1e6 steps.
+
+    # if you want to reproduce results of 3e6 steps, using
+    # eg.add('algo_cfgs:steps_per_epoch', [2000])
+    # eg.add('train_cfgs:total_steps', [2000 * 1500])
+
+    # set the device.
+    avaliable_gpus = list(range(torch.cuda.device_count()))
+    gpu_id = [0, 1, 2, 3]
+    # if you want to use CPU, please set gpu_id = None
+    # gpu_id = None
+
+    if gpu_id and not set(gpu_id).issubset(avaliable_gpus):
+        warnings.warn('The GPU ID is not available, use CPU instead.', stacklevel=1)
+        gpu_id = None
+
+    # set up the environments.
+    eg.add('env_id', [
+        'SafetyHopper',
+        'SafetyWalker2d',
+        'SafetySwimmer',
+        'SafetyAnt',
+        'SafetyHalfCheetah',
+        'SafetyHumanoid'
+        ])
+    eg.add('seed', [0, 5, 10, 15, 20])
+    eg.run(train, num_pool=5, gpu_id=gpu_id)
+```
+
+After that, you can run the following command to run the benchmark:
+
+```bash
+cd examples/benchmarks
+python run_experiment_grid.py
+```
+
+You can also plot the results by running the following command:
+
+```bash
+cd examples
+python analyze_experiment_results.py
+```
+
+**For a detailed usage of OmniSafe statistics tool, please refer to [this tutorial](https://omnisafe.readthedocs.io/en/latest/common/stastics_tool.html).**
+
+Logs are saved in `examples/benchmarks/exp-x` and can be monitored with tensorboard or wandb.
+
+```bash
+tensorboard --logdir examples/benchmarks/exp-x
+```
+
+After the experiment is finished, you can use the following command to generate the video of the trained agent:
+
+```bash
+cd examples
+python evaluate_saved_policy.py
+```
+Please note that before you evaluate, please set the `LOG_DIR` in `evaluate_saved_policy.py`.
+
+For example, if I train `DDPG` in `SafetyHumanoid`
+
+```python
+LOG_DIR = '~/omnisafe/examples/runs/DDPG-<SafetyHumanoid>/seed-000'
+play = True
+save_replay = True
+if __name__ == '__main__':
+    evaluator = omnisafe.Evaluator(play=play, save_replay=save_replay)
+    for item in os.scandir(os.path.join(LOG_DIR, 'torch_save')):
+        if item.is_file() and item.name.split('.')[-1] == 'pt':
+            evaluator.load_saved(
+                save_dir=LOG_DIR, model_name=item.name, camera_name='track', width=256, height=256
+            )
+            evaluator.render(num_episodes=1)
+            evaluator.evaluate(num_episodes=1)
+```
+
+## OmniSafe Benchmark
+
+### Classic Reinforcement Learning Algorithms
+
+In an effort to ascertain the credibility of OmniSafe’s algorithmic implementation, a comparative assessment was conducted, juxtaposing the performance of classical reinforcement
+learning algorithms, such as DDPG, TD3 and SAC. The performance table is provided in <a href="#compare_off_policy">Table 1</a>, with
+well-established open-source implementations, specifically [Tianshou](https://github.com/thu-ml/tianshou) and
+[Stable-Baselines3](https://github.com/DLR-RM/stable-baselines3).
+
+
+<!DOCTYPE html>
+<html lang="en">
+<head>
+<meta charset="UTF-8">
+<style>
+  .scrollable-container {
+    overflow-x: auto;
+    white-space: nowrap;
+    width: 100%;
+  }
+  table {
+    border-collapse: collapse;
+    width: auto;
+    font-size: 12px;
+  }
+  th, td {
+    padding: 8px;
+    text-align: center;
+    border: 1px solid #ddd;
+  }
+  th {
+    font-weight: bold;
+  }
+  caption {
+    font-size: 12px;
+    font-family: 'Times New Roman', Times, serif;
+  }
+</style>
+</head>
+<body>
+
+<div class="scrollable-container">
+<table id="compare_off_policy">
+<thead>
+<tr class="header">
+<th style="text-align: left;"></th>
+<th colspan="3" style="text-align: center;"><strong>DDPG</strong></th>
+<th colspan="3" style="text-align: center;"><strong>TD3</strong></th>
+<th colspan="3" style="text-align: center;"><strong>SAC</strong></th>
+</tr>
+</thead>
+<tbody>
+<tr class="odd">
+<td style="text-align: left;"><strong>Environment</strong></td>
+<td style="text-align: center;"><strong>OmniSafe (Ours)</strong></td>
+<td style="text-align: center;"><strong>Tianshou</strong></td>
+<td style="text-align: center;"><strong>Stable-Baselines3</strong></td>
+<td style="text-align: center;"><strong>OmniSafe (Ours)</strong></td>
+<td style="text-align: center;"><strong>Tianshou</strong></td>
+<td style="text-align: center;"><strong>Stable-Baselines3</strong></td>
+<td style="text-align: center;"><strong>OmniSafe (Ours)</strong></td>
+<td style="text-align: center;"><strong>Tianshou</strong></td>
+<td style="text-align: center;"><strong>Stable-Baselines3</strong></td>
+</tr>
+<tr class="even">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyAntVelocity-v1</span></td>
+<td style="text-align: center;">860.86 <span class="math inline">±</span> 198.03</td>
+<td style="text-align: center;">308.60 <span class="math inline">±</span> 318.60</td>
+<td style="text-align: center;"><strong>2654.58 <span class="math inline">±</span> 1738.21</strong></td>
+<td style="text-align: center;">5246.86 <span class="math inline">±</span> 580.50</td>
+<td style="text-align: center;"><strong>5379.55 <span class="math inline">±</span> 224.69</strong></td>
+<td style="text-align: center;">3079.45 <span class="math inline">±</span> 1456.81</td>
+<td style="text-align: center;">5456.31 <span class="math inline">±</span> 156.04</td>
+<td style="text-align: center;"><strong>6012.30 <span class="math inline">±</span> 102.64</strong></td>
+<td style="text-align: center;">2404.50 <span class="math inline">±</span> 1152.65</td>
+</tr>
+<tr class="odd">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyHalfCheetahVelocity-v1</span></td>
+<td style="text-align: center;">11377.10 <span class="math inline">±</span> 75.29</td>
+<td style="text-align: center;"><strong>12493.55 <span class="math inline">±</span> 437.54</strong></td>
+<td style="text-align: center;">7796.63 <span class="math inline">±</span> 3541.64</td>
+<td style="text-align: center;"><strong>11246.12 <span class="math inline">±</span> 488.62</strong></td>
+<td style="text-align: center;">10246.77 <span class="math inline">±</span> 908.39</td>
+<td style="text-align: center;">8631.27 <span class="math inline">±</span> 2869.15</td>
+<td style="text-align: center;">11488.86 <span class="math inline">±</span> 513.09</td>
+<td style="text-align: center;"><strong>12083.89 <span class="math inline">±</span> 564.51</strong></td>
+<td style="text-align: center;">7767.74 <span class="math inline">±</span> 3159.07</td>
+</tr>
+<tr class="even">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyHopperVelocity-v1</span></td>
+<td style="text-align: center;">1462.56 <span class="math inline">±</span> 591.14</td>
+<td style="text-align: center;">2018.97 <span class="math inline">±</span> 1045.20</td>
+<td style="text-align: center;"><strong>2214.06 <span class="math inline">±</span> 1219.57</strong></td>
+<td style="text-align: center;"><strong>3404.41 <span class="math inline">±</span> 82.57</strong></td>
+<td style="text-align: center;">2682.53 <span class="math inline">±</span> 1004.84</td>
+<td style="text-align: center;">2542.67 <span class="math inline">±</span> 1253.33</td>
+<td style="text-align: center;"><strong>3597.70 <span class="math inline">±</span> 32.23</strong></td>
+<td style="text-align: center;">3546.59 <span class="math inline">±</span> 76 .00</td>
+<td style="text-align: center;">2158.54 <span class="math inline">±</span> 1343.24</td>
+</tr>
+<tr class="odd">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyHumanoidVelocity-v1</span></td>
+<td style="text-align: center;">1537.39 <span class="math inline">±</span> 335.62</td>
+<td style="text-align: center;">124.96 <span class="math inline">±</span> 61.68</td>
+<td style="text-align: center;"><strong>2276.92 <span class="math inline">±</span> 2299.68</strong></td>
+<td style="text-align: center;"><strong>5798.01 <span class="math inline">±</span> 160.7</strong>2</td>
+<td style="text-align: center;">3838.06 <span class="math inline">±</span> 1832.90</td>
+<td style="text-align: center;">3511.06 <span class="math inline">±</span> 2214.12</td>
+<td style="text-align: center;"><strong>6039.77 <span class="math inline">±</span> 167.82</strong></td>
+<td style="text-align: center;">5424.55 <span class="math inline">±</span> 118.52</td>
+<td style="text-align: center;">2713.60 <span class="math inline">±</span> 2256.89</td>
+</tr>
+<tr class="even">
+<td style="text-align: left;"><span
+class="smallcaps">SafetySwimmerVelocity-v1</span></td>
+<td style="text-align: center;">139.39 <span class="math inline">±</span> 11.74</td>
+<td style="text-align: center;">138.98 <span class="math inline">±</span> 8.60</td>
+<td style="text-align: center;"><strong>210.40 <span class="math inline">±</span> 148.01</strong></td>
+<td style="text-align: center;"><strong>98.39 <span class="math inline">±</span> 32.28</strong></td>
+<td style="text-align: center;">94.43 <span class="math inline">±</span>9.63</td>
+<td style="text-align: center;">247.09 <span class="math inline">±</span> 131.69</td>
+<td style="text-align: center;">46.44 <span class="math inline">±</span>1.23</td>
+<td style="text-align: center;">44.34 <span class="math inline">±</span>2.01</td>
+<td style="text-align: center;"><strong>247.33 <span class="math inline">±</span> 122.02</strong></td>
+</tr>
+<tr class="odd">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyWalker2dVelocity-v1</span></td>
+<td style="text-align: center;">1911.70 <span class="math inline">±</span> 395.97</td>
+<td style="text-align: center;">543.23 <span class="math inline">±</span> 316.10</td>
+<td style="text-align: center;"><strong>3917.46 <span class="math inline">±</span> 1077.38</strong></td>
+<td style="text-align: center;">3034.83 <span class="math inline">±</span> 1374.72</td>
+<td style="text-align: center;"><strong>4267.05 <span class="math inline">±</span> 678.65</strong></td>
+<td style="text-align: center;">4087.94 <span class="math inline">±</span> 755.10</td>
+<td style="text-align: center;">4419.29 <span class="math inline">±</span> 232.06</td>
+<td style="text-align: center;"><strong>4619.34 <span class="math inline">±</span> 274.43</strong></td>
+<td style="text-align: center;">3906.78 <span class="math inline">±</span> 795.48</td>
+</tr>
+</tbody>
+</table>
+</div>
+
+
+<caption><p><b>Table 1:</b> The performance of OmniSafe, which was evaluated in relation to published baselines within the Safety-Gymnasium environments. Experimental outcomes, comprising mean and standard deviation, were derived from 10 assessment iterations encompassing multiple random seeds. A noteworthy distinction lies in the fact that Stable-Baselines3 employs distinct parameters tailored to each environment, while OmniSafe maintains a consistent parameter set across all environments.</p></caption>
+
+### Safe Reinforcement Learning Algorithms
+
+To demonstrate the high reliability of the algorithms implemented, OmniSafe offers performance insights within the Safety-Gymnasium environment. It should be noted that all data is procured under the constraint of `cost_limit=25.00`. The results are presented in <a href="#performance_off_policy">Table 2</a>, <a href="#curve_off_base">Figure 1</a>, <a href="#curve_off_lag">Figure 2</a>, <a href="#curve_off_pid">Figure 3</a>.
+
+#### Performance Table
+
+<div class="scrollable-container">
+<table id="performance_off_policy">
+<thead>
+<tr class="header">
+<th style="text-align: left;"></th>
+<th colspan="2" style="text-align: center;"><strong>DDPG</strong></th>
+<th colspan="2" style="text-align: center;"><strong>TD3</strong></th>
+<th colspan="2" style="text-align: center;"><strong>SAC</strong></th>
+</tr>
+</thead>
+<tbody>
+<tr class="odd">
+<td style="text-align: left;"><strong>Environment</strong></td>
+<td style="text-align: center;"><strong>Reward</strong></td>
+<td style="text-align: center;"><strong>Cost</strong></td>
+<td style="text-align: center;"><strong>Reward</strong></td>
+<td style="text-align: center;"><strong>Cost</strong></td>
+<td style="text-align: center;"><strong>Reward</strong></td>
+<td style="text-align: center;"><strong>Cost</strong></td>
+</tr>
+<tr class="even">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyAntVelocity-v1</span></td>
+<td style="text-align: center;">860.86 <span class="math inline">±</span> 198.03</td>
+<td style="text-align: center;">234.80 <span class="math inline">±</span> 40.63</td>
+<td style="text-align: center;">5246.86 <span class="math inline">±</span> 580.50</td>
+<td style="text-align: center;">912.90 <span class="math inline">±</span> 93.73</td>
+<td style="text-align: center;">5456.31 <span class="math inline">±</span> 156.04</td>
+<td style="text-align: center;">943.10 <span class="math inline">±</span> 47.51</td>
+</tr>
+<tr class="odd">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyHalfCheetahVelocity-v1</span></td>
+<td style="text-align: center;">11377.10 <span class="math inline">±</span> 75.29</td>
+<td style="text-align: center;">980.93 <span class="math inline">±</span> 1.05</td>
+<td style="text-align: center;">11246.12 <span class="math inline">±</span> 488.62</td>
+<td style="text-align: center;">981.27 <span class="math inline">±</span> 0.31</td>
+<td style="text-align: center;">11488.86 <span class="math inline">±</span> 513.09</td>
+<td style="text-align: center;">981.93 <span class="math inline">±</span> 0.33</td>
+</tr>
+<tr class="even">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyHopperVelocity-v1</span></td>
+<td style="text-align: center;">1462.56 <span class="math inline">±</span> 591.14</td>
+<td style="text-align: center;">429.17 <span class="math inline">±</span> 220.05</td>
+<td style="text-align: center;">3404.41 <span class="math inline">±</span> 82.57</td>
+<td style="text-align: center;">973.80 <span class="math inline">±</span> 4.92</td>
+<td style="text-align: center;">3537.70 <span class="math inline">±</span> 32.23</td>
+<td style="text-align: center;">975.23 <span class="math inline">±</span> 2.39</td>
+</tr>
+<tr class="odd">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyHumanoidVelocity-v1</span></td>
+<td style="text-align: center;">1537.39 <span class="math inline">±</span> 335.62</td>
+<td style="text-align: center;">48.79 <span class="math inline">±</span>13.06</td>
+<td style="text-align: center;">5798.01 <span class="math inline">±</span> 160.72</td>
+<td style="text-align: center;">255.43 <span class="math inline">±</span> 437.13</td>
+<td style="text-align: center;">6039.77 <span class="math inline">±</span> 167.82</td>
+<td style="text-align: center;">41.42 <span class="math inline">±</span>49.78</td>
+</tr>
+<tr class="even">
+<td style="text-align: left;"><span
+class="smallcaps">SafetySwimmerVelocity-v1</span></td>
+<td style="text-align: center;">139.39 <span class="math inline">±</span> 11.74</td>
+<td style="text-align: center;">200.53 <span class="math inline">±</span> 43.28</td>
+<td style="text-align: center;">98.39 <span class="math inline">±</span>32.28</td>
+<td style="text-align: center;">115.27 <span class="math inline">±</span> 44.90</td>
+<td style="text-align: center;">46.44 <span class="math inline">±</span>1.23</td>
+<td style="text-align: center;">40.97 <span class="math inline">±</span>0.47</td>
+</tr>
+<tr class="odd">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyWalker2dVelocity-v1</span></td>
+<td style="text-align: center;">1911.70 <span class="math inline">±</span> 395.97</td>
+<td style="text-align: center;">318.10 <span class="math inline">±</span> 71.03</td>
+<td style="text-align: center;">3034.83 <span class="math inline">±</span> 1374.72</td>
+<td style="text-align: center;">606.47 <span class="math inline">±</span> 337.33</td>
+<td style="text-align: center;">4419.29 <span class="math inline">±</span> 232.06</td>
+<td style="text-align: center;">877.70 <span class="math inline">±</span> 8.95</td>
+</tr>
+<tr class="even">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyCarCircle1-v0</span></td>
+<td style="text-align: center;">44.64 <span class="math inline">±</span>2.15</td>
+<td style="text-align: center;">371.93 <span class="math inline">±</span> 38.75</td>
+<td style="text-align: center;">44.57 <span class="math inline">±</span>2.71</td>
+<td style="text-align: center;">383.37 <span class="math inline">±</span> 62.03</td>
+<td style="text-align: center;">43.46 <span class="math inline">±</span>4.39</td>
+<td style="text-align: center;">406.87 <span class="math inline">±</span> 78.78</td>
+</tr>
+<tr class="odd">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyCarGoal1-v0</span></td>
+<td style="text-align: center;">36.99 <span class="math inline">±</span>1.66</td>
+<td style="text-align: center;">57.13 <span class="math inline">±</span>38.40</td>
+<td style="text-align: center;">36.26 <span class="math inline">±</span>2.35</td>
+<td style="text-align: center;">69.70 <span class="math inline">±</span>52.18</td>
+<td style="text-align: center;">35.71 <span class="math inline">±</span>2.24</td>
+<td style="text-align: center;">54.73 <span class="math inline">±</span>46.74</td>
+</tr>
+<tr class="even">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyPointCircle1-v0</span></td>
+<td style="text-align: center;">113.67 <span class="math inline">±</span> 1.33</td>
+<td style="text-align: center;">421.53 <span class="math inline">±</span> 142.66</td>
+<td style="text-align: center;">115.15 <span class="math inline">±</span> 2.24</td>
+<td style="text-align: center;">391.07 <span class="math inline">±</span> 38.34</td>
+<td style="text-align: center;">115.06 <span class="math inline">±</span> 2.04</td>
+<td style="text-align: center;">403.43 <span class="math inline">±</span> 44.78</td>
+</tr>
+<tr class="odd">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyPointGoal1-v0</span></td>
+<td style="text-align: center;">25.55 <span class="math inline">±</span>2.62</td>
+<td style="text-align: center;">41.60 <span class="math inline">±</span>37.17</td>
+<td style="text-align: center;">27.28 <span class="math inline">±</span>1.21</td>
+<td style="text-align: center;">51.43 <span class="math inline">±</span>33.05</td>
+<td style="text-align: center;">27.04 <span class="math inline">±</span>1.49</td>
+<td style="text-align: center;">67.57 <span class="math inline">±</span>32.13</td>
+</tr>
+</tbody>
+<thead>
+<tr class="header">
+<th style="text-align: left;"></th>
+<th colspan="2"
+style="text-align: center;"><strong>DDPGLag</strong></th>
+<th colspan="2" style="text-align: center;"><strong>TD3Lag</strong></th>
+<th colspan="2" style="text-align: center;"><strong>SACLag</strong></th>
+</tr>
+</thead>
+<tbody>
+<tr class="odd">
+<td style="text-align: left;"><strong>Environment</strong></td>
+<td style="text-align: center;"><strong>Reward</strong></td>
+<td style="text-align: center;"><strong>Cost</strong></td>
+<td style="text-align: center;"><strong>Reward</strong></td>
+<td style="text-align: center;"><strong>Cost</strong></td>
+<td style="text-align: center;"><strong>Reward</strong></td>
+<td style="text-align: center;"><strong>Cost</strong></td>
+</tr>
+<tr class="even">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyAntVelocity-v1</span></td>
+<td style="text-align: center;">1271.48 <span class="math inline">±</span> 581.71</td>
+<td style="text-align: center;">33.27 <span class="math inline">±</span>13.34</td>
+<td style="text-align: center;">1944.38 <span class="math inline">±</span> 759.20</td>
+<td style="text-align: center;">63.27 <span class="math inline">±</span>46.89</td>
+<td style="text-align: center;">1897.32 <span class="math inline">±</span> 1213.74</td>
+<td style="text-align: center;">5.73 <span class="math inline">±</span>7.83</td>
+</tr>
+<tr class="odd">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyHalfCheetahVelocity-v1</span></td>
+<td style="text-align: center;">2743.06 <span class="math inline">±</span> 21.77</td>
+<td style="text-align: center;">0.33 <span class="math inline">±</span>0.12</td>
+<td style="text-align: center;">2741.08 <span class="math inline">±</span> 49.13</td>
+<td style="text-align: center;">10.47 <span class="math inline">±</span>14.45</td>
+<td style="text-align: center;">2833.72 <span class="math inline">±</span> 3.62</td>
+<td style="text-align: center;">0.00 <span class="math inline">±</span>0.00</td>
+</tr>
+<tr class="even">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyHopperVelocity-v1</span></td>
+<td style="text-align: center;">1093.25 <span class="math inline">±</span> 81.55</td>
+<td style="text-align: center;">15.00 <span class="math inline">±</span>21.21</td>
+<td style="text-align: center;">928.79 <span class="math inline">±</span> 389.48</td>
+<td style="text-align: center;">40.67 <span class="math inline">±</span>30.99</td>
+<td style="text-align: center;">963.49 <span class="math inline">±</span> 291.64</td>
+<td style="text-align: center;">20.23 <span class="math inline">±</span>28.47</td>
+</tr>
+<tr class="odd">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyHumanoidVelocity-v1</span></td>
+<td style="text-align: center;">2059.96 <span class="math inline">±</span> 485.68</td>
+<td style="text-align: center;">19.71 <span class="math inline">±</span>4.05</td>
+<td style="text-align: center;">5751.99 <span class="math inline">±</span> 157.28</td>
+<td style="text-align: center;">10.71 <span class="math inline">±</span>23.60</td>
+<td style="text-align: center;">5940.04 <span class="math inline">±</span> 121.93</td>
+<td style="text-align: center;">17.59 <span class="math inline">±</span>6.24</td>
+</tr>
+<tr class="even">
+<td style="text-align: left;"><span
+class="smallcaps">SafetySwimmerVelocity-v1</span></td>
+<td style="text-align: center;">13.18 <span class="math inline">±</span>20.31</td>
+<td style="text-align: center;">28.27 <span class="math inline">±</span>32.27</td>
+<td style="text-align: center;">15.58 <span class="math inline">±</span>16.97</td>
+<td style="text-align: center;">13.27 <span class="math inline">±</span>17.64</td>
+<td style="text-align: center;">11.03 <span class="math inline">±</span>11.17</td>
+<td style="text-align: center;">22.70 <span class="math inline">±</span>32.10</td>
+</tr>
+<tr class="odd">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyWalker2dVelocity-v1</span></td>
+<td style="text-align: center;">2238.92 <span class="math inline">±</span> 400.67</td>
+<td style="text-align: center;">33.43 <span class="math inline">±</span>20.08</td>
+<td style="text-align: center;">2996.21 <span class="math inline">±</span> 74.40</td>
+<td style="text-align: center;">22.50 <span class="math inline">±</span>16.97</td>
+<td style="text-align: center;">2676.47 <span class="math inline">±</span> 300.43</td>
+<td style="text-align: center;">30.67 <span class="math inline">±</span>32.30</td>
+</tr>
+<tr class="even">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyCarCircle1-v0</span></td>
+<td style="text-align: center;">33.29 <span class="math inline">±</span>6.55</td>
+<td style="text-align: center;">20.67 <span class="math inline">±</span>28.48</td>
+<td style="text-align: center;">34.38 <span class="math inline">±</span>1.55</td>
+<td style="text-align: center;">2.25 <span class="math inline">±</span>3.90</td>
+<td style="text-align: center;">31.42 <span class="math inline">±</span>11.67</td>
+<td style="text-align: center;">22.33 <span class="math inline">±</span>26.16</td>
+</tr>
+<tr class="odd">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyCarGoal1-v0</span></td>
+<td style="text-align: center;">22.80 <span class="math inline">±</span>8.75</td>
+<td style="text-align: center;">17.33 <span class="math inline">±</span>21.40</td>
+<td style="text-align: center;">7.31 <span class="math inline">±</span>5.34</td>
+<td style="text-align: center;">33.83 <span class="math inline">±</span>31.03</td>
+<td style="text-align: center;">10.83 <span class="math inline">±</span>11.29</td>
+<td style="text-align: center;">22.67 <span class="math inline">±</span>28.91</td>
+</tr>
+<tr class="even">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyPointCircle1-v0</span></td>
+<td style="text-align: center;">70.71 <span class="math inline">±</span>13.61</td>
+<td style="text-align: center;">22.00 <span class="math inline">±</span>32.80</td>
+<td style="text-align: center;">83.07 <span class="math inline">±</span>3.49</td>
+<td style="text-align: center;">7.83 <span class="math inline">±</span>15.79</td>
+<td style="text-align: center;">83.68 <span class="math inline">±</span>3.32</td>
+<td style="text-align: center;">12.83 <span class="math inline">±</span>19.53</td>
+</tr>
+<tr class="odd">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyPointGoal1-v0</span></td>
+<td style="text-align: center;">17.17 <span class="math inline">±</span>10.03</td>
+<td style="text-align: center;">20.33 <span class="math inline">±</span>31.59</td>
+<td style="text-align: center;">25.27 <span class="math inline">±</span>2.74</td>
+<td style="text-align: center;">28.00 <span class="math inline">±</span>15.75</td>
+<td style="text-align: center;">21.45 <span class="math inline">±</span>6.97</td>
+<td style="text-align: center;">19.17 <span class="math inline">±</span>9.72</td>
+</tr>
+</tbody>
+<thead>
+<tr class="header">
+<th style="text-align: left;"></th>
+<th colspan="2"
+style="text-align: center;"><strong>DDPGPID</strong></th>
+<th colspan="2" style="text-align: center;"><strong>TD3PID</strong></th>
+<th colspan="2" style="text-align: center;"><strong>SACPID</strong></th>
+</tr>
+</thead>
+<tbody>
+<tr class="odd">
+<td style="text-align: left;"><strong>Environment</strong></td>
+<td style="text-align: center;"><strong>Reward</strong></td>
+<td style="text-align: center;"><strong>Cost</strong></td>
+<td style="text-align: center;"><strong>Reward</strong></td>
+<td style="text-align: center;"><strong>Cost</strong></td>
+<td style="text-align: center;"><strong>Reward</strong></td>
+<td style="text-align: center;"><strong>Cost</strong></td>
+</tr>
+<tr class="even">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyAntVelocity-v1</span></td>
+<td style="text-align: center;">2078.27 <span class="math inline">±</span> 704.77</td>
+<td style="text-align: center;">18.20 <span class="math inline">±</span>7.21</td>
+<td style="text-align: center;">2410.46 <span class="math inline">±</span> 217.00</td>
+<td style="text-align: center;">44.50 <span class="math inline">±</span>38.39</td>
+<td style="text-align: center;">1940.55 <span class="math inline">±</span> 482.41</td>
+<td style="text-align: center;">13.73 <span class="math inline">±</span>7.24</td>
+</tr>
+<tr class="odd">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyHalfCheetahVelocity-v1</span></td>
+<td style="text-align: center;">2737.61 <span class="math inline">±</span> 45.93</td>
+<td style="text-align: center;">36.10 <span class="math inline">±</span>11.03</td>
+<td style="text-align: center;">2695.64 <span class="math inline">±</span> 29.42</td>
+<td style="text-align: center;">35.93 <span class="math inline">±</span>14.03</td>
+<td style="text-align: center;">2689.01 <span class="math inline">±</span> 15.46</td>
+<td style="text-align: center;">21.43 <span class="math inline">±</span>5.49</td>
+</tr>
+<tr class="even">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyHopperVelocity-v1</span></td>
+<td style="text-align: center;">1034.42 <span class="math inline">±</span> 350.59</td>
+<td style="text-align: center;">29.53 <span class="math inline">±</span>34.54</td>
+<td style="text-align: center;">1225.97 <span class="math inline">±</span> 224.71</td>
+<td style="text-align: center;">46.87 <span class="math inline">±</span>65.28</td>
+<td style="text-align: center;">812.80 <span class="math inline">±</span> 381.86</td>
+<td style="text-align: center;">92.23 <span class="math inline">±</span>77.64</td>
+</tr>
+<tr class="odd">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyHumanoidVelocity-v1</span></td>
+<td style="text-align: center;">1082.36 <span class="math inline">±</span> 486.48</td>
+<td style="text-align: center;">15.00 <span class="math inline">±</span>19.51</td>
+<td style="text-align: center;">6179.38 <span class="math inline">±</span> 105.70</td>
+<td style="text-align: center;">5.60 <span class="math inline">±</span>6.23</td>
+<td style="text-align: center;">6107.36 <span class="math inline">±</span> 113.24</td>
+<td style="text-align: center;">6.20 <span class="math inline">±</span>10.14</td>
+</tr>
+<tr class="even">
+<td style="text-align: left;"><span
+class="smallcaps">SafetySwimmerVelocity-v1</span></td>
+<td style="text-align: center;">23.99 <span class="math inline">±</span>7.76</td>
+<td style="text-align: center;">30.70 <span class="math inline">±</span>21.81</td>
+<td style="text-align: center;">28.62 <span class="math inline">±</span>8.48</td>
+<td style="text-align: center;">22.47 <span class="math inline">±</span>7.69</td>
+<td style="text-align: center;">7.50 <span class="math inline">±</span>10.42</td>
+<td style="text-align: center;">7.77 <span class="math inline">±</span>8.48</td>
+</tr>
+<tr class="odd">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyWalker2dVelocity-v1</span></td>
+<td style="text-align: center;">1378.75 <span class="math inline">±</span> 896.73</td>
+<td style="text-align: center;">14.77 <span class="math inline">±</span>13.02</td>
+<td style="text-align: center;">2769.64 <span class="math inline">±</span> 67.23</td>
+<td style="text-align: center;">6.53 <span class="math inline">±</span>8.86</td>
+<td style="text-align: center;">1251.87 <span class="math inline">±</span> 721.54</td>
+<td style="text-align: center;">41.23 <span class="math inline">±</span>73.33</td>
+</tr>
+<tr class="even">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyCarCircle1-v0</span></td>
+<td style="text-align: center;">26.89 <span class="math inline">±</span>11.18</td>
+<td style="text-align: center;">31.83 <span class="math inline">±</span>33.59</td>
+<td style="text-align: center;">34.77 <span class="math inline">±</span>3.24</td>
+<td style="text-align: center;">47.00 <span class="math inline">±</span>39.53</td>
+<td style="text-align: center;">34.41 <span class="math inline">±</span>7.19</td>
+<td style="text-align: center;">5.00 <span class="math inline">±</span>11.18</td>
+</tr>
+<tr class="odd">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyCarGoal1-v0</span></td>
+<td style="text-align: center;">19.35 <span class="math inline">±</span>14.63</td>
+<td style="text-align: center;">17.50 <span class="math inline">±</span>21.31</td>
+<td style="text-align: center;">27.28 <span class="math inline">±</span>4.50</td>
+<td style="text-align: center;">9.50 <span class="math inline">±</span>12.15</td>
+<td style="text-align: center;">16.21 <span class="math inline">±</span>12.65</td>
+<td style="text-align: center;">6.67 <span class="math inline">±</span>14.91</td>
+</tr>
+<tr class="even">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyPointCircle1-v0</span></td>
+<td style="text-align: center;">71.63 <span class="math inline">±</span>8.39</td>
+<td style="text-align: center;">0.00 <span class="math inline">±</span>0.00</td>
+<td style="text-align: center;">70.95 <span class="math inline">±</span>6.00</td>
+<td style="text-align: center;">0.00 <span class="math inline">±</span>0.00</td>
+<td style="text-align: center;">75.15 <span class="math inline">±</span>6.65</td>
+<td style="text-align: center;">4.50 <span class="math inline">±</span>4.65</td>
+</tr>
+<tr class="odd">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyPointGoal1-v0</span></td>
+<td style="text-align: center;">19.85 <span class="math inline">±</span>5.32</td>
+<td style="text-align: center;">22.67 <span class="math inline">±</span>13.73</td>
+<td style="text-align: center;">18.76 <span class="math inline">±</span>7.87</td>
+<td style="text-align: center;">12.17 <span class="math inline">±</span>9.39</td>
+<td style="text-align: center;">15.87 <span class="math inline">±</span>6.73</td>
+<td style="text-align: center;">27.50 <span class="math inline">±</span>15.25</td>
+</tr>
+</tbody>
+</table>
+</div>
+
+**Table 2:** The performance of OmniSafe off-policy algorithms, which underwent evaluation under the experimental setting of `cost_limit=25.00`. During experimentation, it was observed that off-policy algorithms did not violate safety constraints in `SafetyHumanoidVeloicty-v1`. This observation suggests that the agent may not have fully learned to run within 1e6 steps; consequently, the 3e6 results were utilized in off-policy `SafetyHumanoidVeloicty-v1`. Meanwhile, in environments with strong stochasticity such as `SafetyCarCircle1-v0`, `SafetyCarGoal1-v0`, `SafetyPointCircle1-v0`, and `SafetyPointGoal1-v0`, off-policy methods require more training steps to estimate a more accurate Q-function. Therefore, we also conducted evaluations on these four environments using a training duration of 3e6 steps. For other environments, we use the evaluation results after 1e6 training steps.
+
+#### Performance Curves
+
+<details>
+<summary>DDPG, TD3, and SAC</summary>
+<table id="curve_off_base">
+  <tr>
+    <td style="text-align:center">
+      <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/off-policy/benchmarks/base_ant_1e6.png?raw=true">
+      <br>
+      <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+        SafetyAntVelocity-v1
+      </div>
+    </td>
+  </tr>
+  <tr>
+    <td style="text-align:center">
+      <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/off-policy/benchmarks/base_halfcheetah_1e6.png?raw=true">
+      <br>
+      <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+        SafetyHalfCheetahVelocity-v1
+      </div>
+    </td>
+  </tr>
+  <tr>
+    <td style="text-align:center">
+      <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/off-policy/benchmarks/base_hopper_1e6.png?raw=true">
+      <br>
+      <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+        SafetyHopperVelocity-v1
+      </div>
+    </td>
+  </tr>
+  <tr>
+    <td style="text-align:center">
+      <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/off-policy/benchmarks/base_humanoid_3e6.png?raw=true">
+      <br>
+      <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+        SafetyHumanoidVelocity-v1
+      </div>
+    </td>
+  </tr>
+  <tr>
+    <td style="text-align:center">
+      <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/off-policy/benchmarks/base_swimmer_1e6.png?raw=true">
+      <br>
+      <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+        SafetySwimmerVelocity-v1
+      </div>
+    </td>
+  </tr>
+  <tr>
+    <td style="text-align:center">
+      <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/off-policy/benchmarks/base_walker2d_1e6.png?raw=true">
+      <br>
+      <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+        SafetyWalker2dVelocity-v1
+      </div>
+    </td>
+  </tr>
+  <tr>
+    <td style="text-align:center">
+      <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/off-policy/benchmarks/base_carcircle1_3e6.png?raw=true">
+      <br>
+      <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+        SafetyCarCircle1-v0
+      </div>
+    </td>
+  </tr>
+  <tr>
+    <td style="text-align:center">
+      <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/off-policy/benchmarks/base_cargoal1_3e6.png?raw=true">
+      <br>
+      <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+        SafetyCarGoal1-v0
+      </div>
+    </td>
+  </tr>
+  <tr>
+    <td style="text-align:center">
+      <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/off-policy/benchmarks/base_pointcircle1_3e6.png?raw=true">
+      <br>
+      <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+        SafetyPointCircle1-v0
+      </div>
+    </td>
+  </tr>
+  <tr>
+    <td style="text-align:center">
+      <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/off-policy/benchmarks/base_pointgoal1_3e6.png?raw=true">
+      <br>
+      <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+        SafetyPointGoal1-v0
+      </div>
+    </td>
+  </tr>
+</table>
+<caption><p><b>Figure 1:</b> Training curves in Safety-Gymnasium environments, covering classical reinforcement learning algorithms mentioned in <a href="#compare_off_policy">Table 1</a> and safe reinforcement learning algorithms.</p></caption>
+</details>
+
+<details>
+<summary>DDPGLag, TD3Lag, and SACLag</summary>
+<table id="curve_off_lag">
+  <tr>
+    <td style="text-align:center">
+      <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/off-policy/benchmarks/lag_ant_1e6.png?raw=true">
+      <br>
+      <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+        SafetyAntVelocity-v1
+      </div>
+    </td>
+  </tr>
+  <tr>
+    <td style="text-align:center">
+      <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/off-policy/benchmarks/lag_halfcheetah_1e6.png?raw=true">
+      <br>
+      <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+        SafetyHalfCheetahVelocity-v1
+      </div>
+    </td>
+  </tr>
+  <tr>
+    <td style="text-align:center">
+      <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/off-policy/benchmarks/lag_hopper_1e6.png?raw=true">
+      <br>
+      <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+        SafetyHopperVelocity-v1
+      </div>
+    </td>
+  </tr>
+  <tr>
+    <td style="text-align:center">
+      <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/off-policy/benchmarks/lag_humanoid_3e6.png?raw=true">
+      <br>
+      <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+        SafetyHumanoidVelocity-v1
+      </div>
+    </td>
+  </tr>
+  <tr>
+    <td style="text-align:center">
+      <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/off-policy/benchmarks/lag_swimmer_1e6.png?raw=true">
+      <br>
+      <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+        SafetySwimmerVelocity-v1
+      </div>
+    </td>
+  </tr>
+  <tr>
+    <td style="text-align:center">
+      <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/off-policy/benchmarks/lag_walker2d_1e6.png?raw=true">
+      <br>
+      <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+        SafetyWalker2dVelocity-v1
+      </div>
+    </td>
+  </tr>
+  <tr>
+    <td style="text-align:center">
+      <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/off-policy/benchmarks/lag_carcircle1_3e6.png?raw=true">
+      <br>
+      <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+        SafetyCarCircle1-v0
+      </div>
+    </td>
+  </tr>
+  <tr>
+    <td style="text-align:center">
+      <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/off-policy/benchmarks/lag_cargoal1_3e6.png?raw=true">
+      <br>
+      <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+        SafetyCarGoal1-v0
+      </div>
+    </td>
+  </tr>
+  <tr>
+    <td style="text-align:center">
+      <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/off-policy/benchmarks/lag_pointcircle1_3e6.png?raw=true">
+      <br>
+      <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+        SafetyPointCircle1-v0
+      </div>
+    </td>
+  </tr>
+  <tr>
+    <td style="text-align:center">
+      <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/off-policy/benchmarks/lag_pointgoal1_3e6.png?raw=true">
+      <br>
+      <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+        SafetyPointGoal1-v0
+      </div>
+    </td>
+  </tr>
+</table>
+<caption><p><b>Figure 2:</b> Training curves in Safety-Gymnasium environments, covering lagrangian reinforcement learning algorithms mentioned in <a href="#compare_off_policy">Table 1</a> and safe reinforcement learning algorithms.</p></caption>
+</details>
+
+<details>
+<summary>DDPGPID, TD3PID, and SACPID</summary>
+<table id="curve_off_pid">
+  <tr>
+    <td style="text-align:center">
+      <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/off-policy/benchmarks/pid_ant_1e6.png?raw=true">
+      <br>
+      <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+        SafetyAnt
+      </div>
+    </td>
+  </tr>
+  <tr>
+    <td style="text-align:center">
+      <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/off-policy/benchmarks/pid_halfcheetah_1e6.png?raw=true">
+      <br>
+      <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+        SafetyHalfCheetahVelocity-v1
+      </div>
+    </td>
+  </tr>
+  <tr>
+    <td style="text-align:center">
+      <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/off-policy/benchmarks/pid_hopper_1e6.png?raw=true">
+      <br>
+      <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+        SafetyHopperVelocity-v1
+      </div>
+    </td>
+  </tr>
+  <tr>
+    <td style="text-align:center">
+      <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/off-policy/benchmarks/pid_humanoid_3e6.png?raw=true">
+      <br>
+      <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+        SafetyHumanoidVelocity-v1
+      </div>
+    </td>
+  </tr>
+  <tr>
+    <td style="text-align:center">
+      <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/off-policy/benchmarks/pid_swimmer_1e6.png?raw=true">
+      <br>
+      <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+        SafetySwimmerVelocity-v1
+      </div>
+    </td>
+  </tr>
+  <tr>
+    <td style="text-align:center">
+      <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/off-policy/benchmarks/pid_walker2d_1e6.png?raw=true">
+      <br>
+      <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+        SafetyWalker2dVelocity-v1
+      </div>
+    </td>
+  </tr>
+  <tr>
+    <td style="text-align:center">
+      <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/off-policy/benchmarks/pid_carcircle1_3e6.png?raw=true">
+      <br>
+      <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+        SafetyCarCircle1-v0
+      </div>
+    </td>
+  </tr>
+  <tr>
+    <td style="text-align:center">
+      <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/off-policy/benchmarks/pid_cargoal1_3e6.png?raw=true">
+      <br>
+      <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+        SafetyCarGoal1-v0
+      </div>
+    </td>
+  </tr>
+  <tr>
+    <td style="text-align:center">
+      <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/off-policy/benchmarks/pid_pointcircle1_3e6.png?raw=true">
+      <br>
+      <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+        SafetyPointCircle1-v0
+      </div>
+    </td>
+  </tr>
+  <tr>
+    <td style="text-align:center">
+      <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/off-policy/benchmarks/pid_pointgoal1_3e6.png?raw=true">
+      <br>
+      <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+        SafetyPointGoal1-v0
+      </div>
+    </td>
+  </tr>
+</table>
+
+<caption><p><b>Figure 3:</b> Training curves in Safety-Gymnasium environments, covering pid-lagrangian reinforcement learning algorithms mentioned in <a href="#compare_off_policy">Table 1</a> and safe reinforcement learning algorithms.</p></caption>
+</details>
diff --git a/docs/source/benchmark/offline.md b/docs/source/benchmark/offline.md
new file mode 100644
index 000000000..58bb5cebf
--- /dev/null
+++ b/docs/source/benchmark/offline.md
@@ -0,0 +1,275 @@
+# Offline Algorithms
+
+OmniSafe's Mujoco Velocity Benchmark evaluated the performance of OmniSafe's offline algorithm implementations in SafetyPointCirlce, SafetyPointCirlce from the Safety-Gymnasium task suite. For each algorithm and environment supported, we provide:
+
+- Default hyperparameters used for the benchmark and scripts to reproduce the results.
+- A comparison of performance or code-level details with other open-source implementations or classic papers.
+- Graphs and raw data that can be used for research purposes.
+- Log details obtained during training.
+
+Supported algorithms are listed below:
+
+- **[ICML 2019]** [Batch-Constrained deep Q-learning(BCQ)](https://arxiv.org/pdf/1812.02900.pdf)
+- [The Lagrange version of BCQ (BCQ-Lag)](https://arxiv.org/pdf/1812.02900.pdf)
+- **[NeurIPS 2020]** [Critic Regularized Regression](https://proceedings.neurips.cc//paper/2020/file/588cb956d6bbe67078f29f8de420a13d-Paper.pdf)
+- [The Constrained version of CRR (C-CRR)](https://proceedings.neurips.cc/paper/2020/hash/588cb956d6bbe67078f29f8de420a13d-Abstract.html)
+- **[ICLR 2022 (Spotlight)]** [COptiDICE: Offline Constrained Reinforcement Learning via Stationary Distribution Correction Estimation](https://arxiv.org/abs/2204.08957?context=cs.AI)
+
+## Safety-Gymnasium
+
+We highly recommend using ``safety-gymnasium`` to run the following experiments. To install, in a linux machine, type:
+
+```bash
+pip install safety_gymnasium
+```
+
+## Training agents used to generate data
+
+```bash
+omnisafe train --env-id SafetyAntVelocity-v1 --algo PPO
+omnisafe train --env-id SafetyAntVelocity-v1 --algo PPOLag
+```
+
+## Collect offline data
+
+```python
+from omnisafe.common.offline.data_collector import OfflineDataCollector
+
+
+# please change agent path
+env_name = 'SafetyAntVelocity-v1'
+size = 1_000_000
+agents = [
+    ('./runs/PPO', 'epoch-500', 500_000),
+    ('./runs/PPOLag', 'epoch-500', 500_000),
+]
+save_dir = './data'
+
+if __name__ == '__main__':
+    col = OfflineDataCollector(size, env_name)
+    for agent, model_name, num in agents:
+        col.register_agent(agent, model_name, num)
+    col.collect(save_dir)
+```
+
+## Run the Benchmark
+
+You can set the main function of ``examples/benchmarks/experimrnt_grid.py`` as:
+
+```python
+if __name__ == '__main__':
+    eg = ExperimentGrid(exp_name='offline-Benchmarks')
+
+    # set up the algorithms.
+    offline_policy = ['VAEBC', 'BCQ', 'BCQLag', 'CCR', 'CCRR', 'COptiDICE']
+
+    eg.add('algo', offline_policy)
+
+    # you can use wandb to monitor the experiment.
+    eg.add('logger_cfgs:use_wandb', [False])
+    # you can use tensorboard to monitor the experiment.
+    eg.add('logger_cfgs:use_tensorboard', [True])
+    # add dataset path
+    eg.add('train_cfgs:dataset', [dataset_path])
+
+    # set up the environment.
+    eg.add('env_id', [
+        'SafetyAntVelocity-v1',
+        ])
+    eg.add('seed', [0, 5, 10, 15, 20])
+
+    # total experiment num must can be divided by num_pool
+    # meanwhile, users should decide this value according to their machine
+    eg.run(train, num_pool=5)
+```
+
+After that, you can run the following command to run the benchmark:
+
+```bash
+cd examples/benchmarks
+python run_experiment_grid.py
+```
+
+You can also plot the results by running the following command:
+
+```bash
+cd examples
+python plot.py --log-dir ALGODIR
+```
+
+## OmniSafe Benchmark
+
+### Performance Table
+
+<!DOCTYPE html>
+<html lang="en">
+<head>
+<meta charset="UTF-8">
+<style>
+  .scrollable-container {
+    overflow-x: auto;
+    white-space: nowrap;
+    width: 100%;
+  }
+  table {
+    border-collapse: collapse;
+    width: auto;
+    font-size: 12px;
+  }
+  th, td {
+    padding: 8px;
+    text-align: center;
+    border: 1px solid #ddd;
+  }
+  th {
+    font-weight: bold;
+  }
+  caption {
+    font-size: 12px;
+    font-family: 'Times New Roman', Times, serif;
+  }
+</style>
+</head>
+<body>
+
+<div class="scrollable-container">
+<table id="performance_offline">
+<thead>
+<tr class="header">
+<th style="text-align: left;"></th>
+<th colspan="2" style="text-align: center;"><strong>VAE-BC</strong></th>
+<th colspan="2" style="text-align: center;"><strong>C-CRR</strong></th>
+<th colspan="2" style="text-align: center;"><strong>BCQLag</strong></th>
+<th colspan="2" style="text-align: center;"><strong>COptiDICE</strong></th>
+</tr>
+</thead>
+<tbody>
+<tr class="odd">
+<td style="text-align: left;"><strong>Environment</strong></td>
+<td style="text-align: center;"><strong>Reward</strong></td>
+<td style="text-align: center;"><strong>Cost</strong></td>
+<td style="text-align: center;"><strong>Reward</strong></td>
+<td style="text-align: center;"><strong>Cost</strong></td>
+<td style="text-align: center;"><strong>Reward</strong></td>
+<td style="text-align: center;"><strong>Cost</strong></td>
+<td style="text-align: center;"><strong>Reward</strong></td>
+<td style="text-align: center;"><strong>Cost</strong></td>
+</tr>
+<tr class="even">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyPointCircle1-v0(beta=0.25)</span></td>
+<td style="text-align: center;">43.66 <span class="math inline">±</span> 0.90</td>
+<td style="text-align: center;">109.86 <span class="math inline">±</span> 13.24</td>
+<td style="text-align: center;">45.48 <span class="math inline">±</span> 0.87</td>
+<td style="text-align: center;">127.30 <span class="math inline">±</span> 12.60</td>
+<td style="text-align: center;">43.31 <span class="math inline">±</span> 0.76</td>
+<td style="text-align: center;">113.39 <span class="math inline">±</span> 12.81</td>
+<td style="text-align: center;">40.68 <span class="math inline">±</span> 0.93</td>
+<td style="text-align: center;">67.11 <span class="math inline">±</span> 13.15</td>
+</tr>
+<tr class="odd">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyPointCircle1-v0(beta=0.50)</span></td>
+<td style="text-align: center;">42.84 <span class="math inline">±</span> 1.36</td>
+<td style="text-align: center;">62.34 <span class="math inline">±</span> 14.84</td>
+<td style="text-align: center;">45.99 <span class="math inline">±</span> 1.36</td>
+<td style="text-align: center;">97.20 <span class="math inline">±</span> 13.57</td>
+<td style="text-align: center;">44.68 <span class="math inline">±</span> 1.97</td>
+<td style="text-align: center;">95.06 <span class="math inline">±</span> 33.07</td>
+<td style="text-align: center;">39.55 <span class="math inline">±</span> 1.39</td>
+<td style="text-align: center;">53.87 <span class="math inline">±</span> 13.27</td>
+</tr>
+<tr class="even">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyPointCircle1-v0(beta=0.75)</span></td>
+  <td style="text-align: center;">40.23 <span class="math inline">±</span> 0.75</td>
+  <td style="text-align: center;">41.25 <span class="math inline">±</span> 10.12</td>
+  <td style="text-align: center;">40.66 <span class="math inline">±</span> 0.88</td>
+  <td style="text-align: center;">49.90 <span class="math inline">±</span> 10.81</td>
+  <td style="text-align: center;">42.94 <span class="math inline">±</span> 1.04</td>
+  <td style="text-align: center;">85.37 <span class="math inline">±</span> 23.41</td>
+  <td style="text-align: center;">40.98 <span class="math inline">±</span> 0.89</td>
+  <td style="text-align: center;">70.40 <span class="math inline">±</span> 12.14</td>
+</tr>
+<tr class="odd">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyCarCircle1-v0(beta=0.25)</span></td>
+<td style="text-align: center;">19.62 <span class="math inline">±</span> 0.28</td>
+<td style="text-align: center;">150.54 <span class="math inline">±</span> 7.63</td>
+<td style="text-align: center;">18.53 <span class="math inline">±</span> 0.45</td>
+<td style="text-align: center;">122.63 <span class="math inline">±</span> 13.14</td>
+<td style="text-align: center;">18.88 <span class="math inline">±</span> 0.61</td>
+<td style="text-align: center;">125.44 <span class="math inline">±</span> 15.68</td>
+<td style="text-align: center;">17.25 <span class="math inline">±</span> 0.37</td>
+<td style="text-align: center;">90.86 <span class="math inline">±</span> 10.75</td>
+</tr>
+<tr>
+<tr class="even">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyCarCircle1-v0(beta=0.50)</span></td>
+<td style="text-align: center;">18.69 <span class="math inline">±</span> 0.33</td>
+<td style="text-align: center;">125.97 <span class="math inline">±</span> 10.36</td>
+<td style="text-align: center;">17.24 <span class="math inline">±</span> 0.43</td>
+<td style="text-align: center;">89.47 <span class="math inline">±</span> 11.55</td>
+<td style="text-align: center;">18.14 <span class="math inline">±</span> 0.96</td>
+<td style="text-align: center;">108.07 <span class="math inline">±</span> 20.70</td>
+<td style="text-align: center;">16.38 <span class="math inline">±</span> 0.43</td>
+<td style="text-align: center;">70.54 <span class="math inline">±</span> 12.36</td>
+</tr>
+<tr>
+<tr class="odd">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyCarCircle1-v0(beta=0.75)</span></td>
+<td style="text-align: center;">17.31 <span class="math inline">±</span> 0.33</td>
+<td style="text-align: center;">85.53 <span class="math inline">±</span> 11.33</td>
+<td style="text-align: center;">15.74 <span class="math inline">±</span> 0.42</td>
+<td style="text-align: center;">48.38 <span class="math inline">±</span> 10.31</td>
+<td style="text-align: center;">17.10 <span class="math inline">±</span> 0.84</td>
+<td style="text-align: center;">77.54 <span class="math inline">±</span> 14.07</td>
+<td style="text-align: center;">15.58 <span class="math inline">±</span> 0.37</td>
+<td style="text-align: center;">49.42 <span class="math inline">±</span> 8.70</td>
+</tr>
+<thead>
+</table>
+</div>
+
+<caption><p><b>Table 1:</b>The performance of OmniSafe offline algorithms, which was evaluated following 1e6 training steps and under the experimental setting of cost limit=25.00. We introduce a quantization parameter beta from the perspective of safe trajectories and control the trajectory distribution of the mixed dataset. This parameter beta indicates the difficulty of this dataset to a certain extent. When beta is smaller, it means that the number of safe trajectories in the current dataset is smaller, the less safe information can be available for the algorithm to learn.</p></caption>
+
+
+
+### Performance Curves
+
+<img style="border-radius: 0.3125em;
+box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);"
+src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/offline/benchmarks/SafetyPointCircle1-v0-0.25.png?raw=True">
+<br>
+<div>SafetyPointCircle1-v0(beta=0.25)</div>
+
+<img style="border-radius: 0.3125em;
+box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);"
+src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/offline/benchmarks/SafetyPointCircle1-v0-0.5.png?raw=True">
+<br>
+<div>SafetyPointCircle1-v0(beta=0.50)</div>
+
+<img style="border-radius: 0.3125em;
+box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);"
+src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/offline/benchmarks/SafetyPointCircle1-v0-0.75.png?raw=True">
+<br>
+<div>SafetyPointCircle1-v0(beta=0.75)</div>
+
+<img style="border-radius: 0.3125em;
+box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);"
+src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/offline/benchmarks/SafetyCarCircle1-v0-0.25.png?raw=true">
+<br><div>SafetyCarCircle1-v0(beta=0.25)</div>
+
+<img style="border-radius: 0.3125em;
+box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);"
+src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/offline/benchmarks/SafetyCarCircle1-v0-0.5.png?raw=True">
+<br>
+<div>SafetyCarCircle1-v0(beta=0.5)</div>
+
+<img style="border-radius: 0.3125em;
+box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);"
+src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/offline/benchmarks/SafetyCarCircle1-v0-0.75.png?raw=True">
+<br>
+<div>SafetyCarCircle1-v0(beta=0.75)</div>
diff --git a/docs/source/benchmark/on-policy.md b/docs/source/benchmark/on-policy.md
new file mode 100644
index 000000000..798551e8e
--- /dev/null
+++ b/docs/source/benchmark/on-policy.md
@@ -0,0 +1,2841 @@
+# On-Policy Algorithms
+
+The OmniSafe Safety-Gymnasium Benchmark for on-policy algorithms evaluates the effectiveness of OmniSafe's on-policy algorithms across multiple environments from the [Safety-Gymnasium](https://github.com/PKU-Alignment/safety-gymnasium) task suite. For each supported algorithm and environment, we offer the following:
+
+- Default hyperparameters used for the benchmark and scripts that enable result replication.
+- Performance comparison with other open-source implementations.
+- Graphs and raw data that can be utilized for research purposes.
+- Detailed logs obtained during training.
+
+Supported algorithms are listed below:
+
+**First-Order**
+
+- **[NIPS 1999]** [Policy Gradient (PG)](https://papers.nips.cc/paper/1999/file/464d828b85b0bed98e80ade0a5c43b0f-Paper.pdf)
+- **[Preprint 2017]** [Proximal Policy Optimization (PPO)](https://arxiv.org/pdf/1707.06347.pdf)
+- [The Lagrange version of PPO (PPOLag)](https://cdn.openai.com/safexp-short.pdf)
+- **[IJCAI 2022]** [Penalized Proximal Policy Optimization for Safe Reinforcement Learning (P3O)]( https://arxiv.org/pdf/2205.11814.pdf)
+- **[NeurIPS 2020]** [First Order Constrained Optimization in Policy Space (FOCOPS)](https://arxiv.org/abs/2002.06506)
+- **[NeurIPS 2022]**  [Constrained Update Projection Approach to Safe Policy Optimization (CUP)](https://arxiv.org/abs/2209.07089)
+
+**Second-Order**
+
+- **[NeurIPS 2001]** [A Natural Policy Gradient (NaturalPG))](https://proceedings.neurips.cc/paper/2001/file/4b86abe48d358ecf194c56c69108433e-Paper.pdf)
+- **[PMLR 2015]** [Trust Region Policy Optimization (TRPO)](https://arxiv.org/abs/1502.05477)
+- [The Lagrange version of TRPO (TRPOLag)](https://cdn.openai.com/safexp-short.pdf)
+- **[ICML 2017]** [Constrained Policy Optimization (CPO)](https://proceedings.mlr.press/v70/achiam17a)
+- **[ICML 2017]** [Proximal Constrained Policy Optimization (PCPO)](https://proceedings.mlr.press/v70/achiam17a)
+- **[ICLR 2019]** [Reward Constrained Policy Optimization (RCPO)](https://openreview.net/forum?id=SkfrvsA9FX)
+
+**Saute RL**
+
+- **[ICML 2022]** [Sauté RL: Almost Surely Safe Reinforcement Learning Using State Augmentation (PPOSaute, TRPOSaute)](https://arxiv.org/abs/2202.06558)
+
+**Simmer**
+
+- **[NeurIPS 2022]** [Effects of Safety State Augmentation on Safe Exploration (PPOSimmerPID, TRPOSimmerPID)](https://arxiv.org/abs/2206.02675)
+
+**PID-Lagrangian**
+
+- **[ICML 2020]** [Responsive Safety in Reinforcement Learning by PID Lagrangian Methods (CPPOPID, TRPOPID)](https://arxiv.org/abs/2007.03964)
+
+**Early Terminated MDP**
+
+- **[Preprint 2021]** [Safe Exploration by Solving Early Terminated MDP (PPOEarlyTerminated, TRPOEarlyTerminated)](https://arxiv.org/pdf/2107.04200.pdf)
+
+
+
+
+## Safety-Gymnasium
+
+We highly recommend using **Safety-Gymnasium** to run the following experiments. To install, in a linux machine, type:
+
+```bash
+pip install safety_gymnasium
+```
+
+## Run the Benchmark
+
+You can set the main function of `examples/benchmarks/experiment_grid.py` as:
+
+```python
+if __name__ == '__main__':
+    eg = ExperimentGrid(exp_name='On-Policy-Benchmarks')
+
+    # set up the algorithms.
+    base_policy = ['PolicyGradient', 'NaturalPG', 'TRPO', 'PPO']
+    naive_lagrange_policy = ['PPOLag', 'TRPOLag', 'RCPO']
+    first_order_policy = ['CUP', 'FOCOPS', 'P3O']
+    second_order_policy = ['CPO', 'PCPO']
+    saute_policy = ['PPOSaute', 'TRPOSaute']
+    simmer_policy = ['PPOSimmerPID', 'TRPOSimmerPID']
+    pid_policy = ['CPPOPID', 'TRPOPID']
+    early_mdp_policy = ['PPOEarlyTerminated', 'TRPOEarlyTerminated']
+
+    eg.add(
+        'algo',
+        base_policy +
+        naive_lagrange_policy +
+        first_order_policy +
+        second_order_policy +
+        saute_policy +
+        simmer_policy +
+        pid_policy +
+        early_mdp_policy
+    )
+
+    # you can use wandb to monitor the experiment.
+    eg.add('logger_cfgs:use_wandb', [False])
+    # you can use tensorboard to monitor the experiment.
+    eg.add('logger_cfgs:use_tensorboard', [True])
+
+    # the default configs here are as follows:
+    # eg.add('algo_cfgs:steps_per_epoch', [20000])
+    # eg.add('train_cfgs:total_steps', [20000 * 500])
+    # which can reproduce results of 1e7 steps.
+
+    # if you want to reproduce results of 1e6 steps, using
+    # eg.add('algo_cfgs:steps_per_epoch', [2048])
+    # eg.add('train_cfgs:total_steps', [2048 * 500])
+
+    # set the device.
+    avaliable_gpus = list(range(torch.cuda.device_count()))
+    # if you want to use GPU, please set gpu_id like follows:
+    # gpu_id = [0, 1, 2, 3]
+    # if you want to use CPU, please set gpu_id = None
+    # we recommends using CPU to obtain results as consistent
+    # as possible with our publicly available results,
+    # since the performance of all on-policy algorithms
+    # in OmniSafe is tested on CPU.
+    gpu_id = None
+
+    if gpu_id and not set(gpu_id).issubset(avaliable_gpus):
+        warnings.warn('The GPU ID is not available, use CPU instead.', stacklevel=1)
+        gpu_id = None
+
+    # set up the environment.
+    eg.add('env_id', [
+        'SafetyHopper',
+        'SafetyWalker2d',
+        'SafetySwimmer',
+        'SafetyAnt',
+        'SafetyHalfCheetah',
+        'SafetyHumanoid'
+        ])
+    eg.add('seed', [0, 5, 10, 15, 20])
+
+    # total experiment num must can be divided by num_pool.
+    # meanwhile, users should decide this value according to their machine.
+    eg.run(train, num_pool=5, gpu_id=gpu_id)
+```
+
+After that, you can run the following command to run the benchmark:
+
+```bash
+cd examples/benchmarks
+python run_experiment_grid.py
+```
+
+You can also plot the results by running the following command:
+
+```bash
+cd examples
+python analyze_experiment_results.py
+```
+
+**For a detailed usage of OmniSafe statistics tool, please refer to [this tutorial](https://omnisafe.readthedocs.io/en/latest/common/stastics_tool.html).**
+
+Logs is saved in `examples/benchmarks/exp-x` and can be monitored with tensorboard or wandb.
+
+```bash
+tensorboard --logdir examples/benchmarks/exp-x
+```
+
+After the experiment is finished, you can use the following command to generate the video of the trained agent:
+
+```bash
+cd examples
+python evaluate_saved_policy.py
+```
+
+Please note that before you evaluate, set the `LOG_DIR` in `evaluate_saved_policy.py`.
+
+For example, if I train `PPOLag` in `SafetyHumanoid`
+
+```python
+LOG_DIR = '~/omnisafe/examples/runs/PPOLag-<SafetyHumanoid>/seed-000'
+play = True
+save_replay = True
+if __name__ == '__main__':
+    evaluator = omnisafe.Evaluator(play=play, save_replay=save_replay)
+    for item in os.scandir(os.path.join(LOG_DIR, 'torch_save')):
+        if item.is_file() and item.name.split('.')[-1] == 'pt':
+            evaluator.load_saved(
+                save_dir=LOG_DIR, model_name=item.name, camera_name='track', width=256, height=256
+            )
+            evaluator.render(num_episodes=1)
+            evaluator.evaluate(num_episodes=1)
+```
+
+## OmniSafe Benchmark
+
+### Classic Reinforcement Learning Algorithms
+To ascertain the credibility of OmniSafe ’s algorithmic implementation, a comparative assessment was conducted, juxtaposing the performance of classical reinforcement learning algorithms. Such as Policy Gradient, Natural Policy Gradient, TRPO and PPO. The performance table is provided in <a
+href="#compare_on_policy">Table 1</a>. with well-established open-source implementations, specifically [Tianshou](https://github.com/thu-ml/tianshou) and [Stable-Baselines3](https://github.com/DLR-RM/stable-baselines3).
+
+<!DOCTYPE html>
+<html lang="en">
+<head>
+<meta charset="UTF-8">
+<style>
+  .scrollable-container {
+    overflow-x: auto;
+    white-space: nowrap;
+    width: 100%;
+  }
+  table {
+    border-collapse: collapse;
+    width: auto;
+    font-size: 12px;
+  }
+  th, td {
+    padding: 8px;
+    text-align: center;
+    border: 1px solid #ddd;
+  }
+  th {
+    font-weight: bold;
+  }
+  caption {
+    font-size: 12px;
+    font-family: 'Times New Roman', Times, serif;
+  }
+</style>
+</head>
+<body>
+
+<div class="scrollable-container">
+<table id="compare_on_policy">
+<thead>
+<tr class="header">
+<th style="text-align: left;"></th>
+<th colspan="3" style="text-align: center;"><strong>Policy
+Gradient</strong></th>
+<th colspan="3" style="text-align: center;"><strong>PPO</strong></th></tr>
+</thead>
+<tbody>
+<tr class="odd">
+<td style="text-align: left;"><strong>Environment</strong></td>
+<td style="text-align: center;"><strong>OmniSafe (Ours)</strong></td>
+<td style="text-align: center;"><strong>Tianshou</strong></td>
+<td style="text-align: center;"><strong>Stable-Baselines3</strong></td>
+<td style="text-align: center;"><strong>OmniSafe (Ours)</strong></td>
+<td style="text-align: center;"><strong>Tianshou</strong></td>
+<td style="text-align: center;"><strong>Stable-Baselines3</strong></td>
+</tr>
+<tr class="even">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyAntVelocity-v1</span></td>
+<td style="text-align: center;"><strong>2769.45 <span class="math inline">±</span> 550.71</strong></td>
+<td style="text-align: center;">145.33 <span class="math inline">±</span> 127.55</td>
+<td style="text-align: center;">- <span class="math inline">±</span>-</td>
+<td style="text-align: center;"><strong>4295.96 <span class="math inline">±</span> 658.2</strong></td>
+<td style="text-align: center;">2607.48 <span class="math inline">±</span> 1415.78</td>
+<td style="text-align: center;">1780.61 <span class="math inline">±</span> 780.65</td>
+</tr>
+<tr class="odd">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyHalfCheetahVelocity-v1</span></td>
+<td style="text-align: center;"><strong>2625.44 <span class="math inline">±</span> 1079.04</strong></td>
+<td style="text-align: center;">707.56 <span class="math inline">±</span> 158.59</td>
+<td style="text-align: center;">- <span class="math inline">±</span>-</td>
+<td style="text-align: center;">3507.47 <span class="math inline">±</span> 1563.69</td>
+<td style="text-align: center;"><strong>6299.27 <span class="math inline">±</span> 1692.38</strong></td>
+<td style="text-align: center;">5074.85 <span class="math inline">±</span> 2225.47</td>
+</tr>
+<tr class="even">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyHopperVelocity-v1</span></td>
+<td style="text-align: center;"><strong>1884.38 <span class="math inline">±</span> 825.13</strong></td>
+<td style="text-align: center;">343.88 <span class="math inline">±</span> 51.85</td>
+<td style="text-align: center;">- <span class="math inline">±</span>-</td>
+<td style="text-align: center;"><strong>2679.98 <span class="math inline">±</span> 921.96</strong></td>
+<td style="text-align: center;">1834.7 <span class="math inline">±</span> 862.06</td>
+<td style="text-align: center;">838.96 <span class="math inline">±</span> 351.10</td>
+</tr>
+<tr class="odd">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyHumanoidVelocity-v1</span></td>
+<td style="text-align: center;"><strong>647.52 <span class="math inline">±</span> 154.82</strong></td>
+<td style="text-align: center;">438.97 <span class="math inline">±</span> 123.68</td>
+<td style="text-align: center;">- <span class="math inline">±</span>-</td>
+<td style="text-align: center;"><strong>1106.09 <span class="math inline">±</span> 607.6</strong></td>
+<td style="text-align: center;">677.43 <span class="math inline">±</span> 189.96</td>
+<td style="text-align: center;">762.73 <span class="math inline">±</span> 170.22</td>
+</tr>
+<tr class="even">
+<td style="text-align: left;"><span
+class="smallcaps">SafetySwimmerVelocity-v1</span></td>
+<td style="text-align: center;"><strong>47.31 <span class="math inline">±</span> 16.19</strong></td>
+<td style="text-align: center;">27.12 <span class="math inline">±</span>7.47</td>
+<td style="text-align: center;">- <span class="math inline">±</span>-</td>
+<td style="text-align: center;">113.28 <span class="math inline">±</span> 20.22</td>
+<td style="text-align: center;">37.93 <span class="math inline">±</span>8.68</td>
+<td style="text-align: center;"><strong>273.86 <span class="math inline">±</span> 87.76</strong></td>
+</tr>
+<tr class="odd">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyWalker2dVelocity-v1</span></td>
+<td style="text-align: center;"><strong>1665 .00 <span class="math inline">±</span> 930.18</strong></td>
+<td style="text-align: center;">373.63 <span class="math inline">±</span> 129.2</td>
+<td style="text-align: center;">- <span class="math inline">±</span>-</td>
+<td style="text-align: center;"><strong>3806.39 <span class="math inline">±</span> 1547.48</strong></td>
+<td style="text-align: center;">3748.26 <span class="math inline">±</span> 1832.83</td>
+<td style="text-align: center;">3304.35 <span class="math inline">±</span> 706.13</td>
+</tr>
+<thead>
+<tr class="header">
+<th style="text-align: left;"></th>
+<th colspan="3" style="text-align: center;"><strong>NaturalPG</strong></th>
+<th colspan="3" style="text-align: center;"><strong>TRPO</strong></th></tr>
+</thead>
+<tr class="odd">
+<td style="text-align: left;"><strong>Environment</strong></td>
+<td style="text-align: center;"><strong>OmniSafe (Ours)</strong></td>
+<td style="text-align: center;"><strong>Tianshou</strong></td>
+<td style="text-align: center;"><strong>Stable-Baselines3</strong></td>
+<td style="text-align: center;"><strong>OmniSafe (Ours)</strong></td>
+<td style="text-align: center;"><strong>Tianshou</strong></td>
+<td style="text-align: center;"><strong>Stable-Baselines3</strong></td>
+</tr>
+<tr class="even">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyAntVelocity-v1</span></td>
+<td style="text-align: center;"><strong>3793.70 <span class="math inline">±</span> 583.66</strong></td>
+<td style="text-align: center;">2062.45 <span class="math inline">±</span> 876.43</td>
+<td style="text-align: center;">- <span class="math inline">±</span>-</td>
+<td style="text-align: center;"><strong>4362.43 <span class="math inline">±</span> 640.54</strong></td>
+<td style="text-align: center;">2521.36 <span class="math inline">±</span> 1442.10</td>
+<td style="text-align: center;">3233.58 <span class="math inline">±</span> 1437.16</td>
+</tr>
+<tr class="odd">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyHalfCheetahVelocity-v1</span></td>
+<td style="text-align: center;"><strong>4096.77 <span class="math inline">±</span> 1223.70</strong></td>
+<td style="text-align: center;">3430.9 <span class="math inline">±</span> 239.38</td>
+<td style="text-align: center;">- <span class="math inline">±</span>-</td>
+<td style="text-align: center;">3313.31 <span class="math inline">±</span> 1048.78</td>
+<td style="text-align: center;">4255.73 <span class="math inline">±</span> 1053.82</td>
+<td style="text-align: center;"><strong>7185.06 <span class="math inline">±</span> 3650.82</strong></td>
+</tr>
+<tr class="even">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyHopperVelocity-v1</span></td>
+<td style="text-align: center;"><strong>2590.54 <span class="math inline">±</span> 631.05</strong></td>
+<td style="text-align: center;">993.63 <span class="math inline">±</span> 489.42</td>
+<td style="text-align: center;">- <span class="math inline">±</span>-</td>
+<td style="text-align: center;"><strong>2698.19 <span class="math inline">±</span> 568.80</strong></td>
+<td style="text-align: center;">1346.94 <span class="math inline">±</span> 984.09</td>
+<td style="text-align: center;">2467.10 <span class="math inline">±</span> 1160.25</td>
+</tr>
+<tr class="odd">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyHumanoidVelocity-v1</span></td>
+<td style="text-align: center;"><strong>3838.67 <span class="math inline">±</span> 1654.79</strong></td>
+<td style="text-align: center;">810.76 <span class="math inline">±</span> 270.69</td>
+<td style="text-align: center;">- <span class="math inline">±</span>-</td>
+<td style="text-align: center;">1461.51 <span class="math inline">±</span> 602.23</td>
+<td style="text-align: center;">749.42 <span class="math inline">±</span> 149.81</td>
+<td style="text-align: center;"><strong>2828.18 <span class="math inline">±</span> 2256.38</strong></td>
+</tr>
+<tr class="even">
+<td style="text-align: left;"><span
+class="smallcaps">SafetySwimmerVelocity-v1</span></td>
+<td style="text-align: center;"><strong>116.33 <span class="math inline">±</span> 5.97</strong></td>
+<td style="text-align: center;">29.75 <span class="math inline">±</span>12.00</td>
+<td style="text-align: center;">- <span class="math inline">±</span>-</td>
+<td style="text-align: center;">105.08 <span class="math inline">±</span> 31.00</td>
+<td style="text-align: center;">37.21 <span class="math inline">±</span>4.04</td>
+<td style="text-align: center;"><strong>258.62 <span class="math inline">±</span> 124.91</strong></td>
+</tr>
+<tr class="odd">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyWalker2dVelocity-v1</span></td>
+<td style="text-align: center;"><strong>4054.62 <span class="math inline">±</span> 1266.76</strong></td>
+<td style="text-align: center;">3372.59 <span class="math inline">±</span> 1049.14</td>
+<td style="text-align: center;">- <span class="math inline">±</span>-</td>
+<td style="text-align: center;">4099.97 <span class="math inline">±</span> 409.05</td>
+<td style="text-align: center;">3372.59 <span class="math inline">±</span> 961.74</td>
+<td style="text-align: center;"><strong>4227.91 <span class="math inline">±</span> 760.93</strong></td>
+</tr>
+</tbody>
+</table>
+</div>
+
+
+<caption><p><b>Table 1:</b>The performance of OmniSafe, which was evaluated in relation to published baselines within the Safety-Gymnasium MuJoCo Velocity environments. Experimental outcomes, comprising mean and standard deviation, were derived from 10 assessment iterations encompassing multiple random seeds.</p></caption>
+
+### Safe Reinforcement Learning Algorithms
+
+To demonstrate the high reliability of the algorithms implemented, OmniSafe offers performance insights within the Safety-Gymnasium environment. It should be noted that all data is procured under the constraint of `cost_limit=25.00`. The results are presented in <a href="#performance_on_policy">Table 2</a> and the training curves are in the following sections (Please click the triangle button to see the training curves).
+
+#### Performance Table
+
+
+<!DOCTYPE html>
+<html lang="en">
+<head>
+<meta charset="UTF-8">
+<style>
+  .scrollable-container {
+    overflow-x: auto;
+    white-space: nowrap;
+    width: 100%;
+  }
+  table {
+    border-collapse: collapse;
+    width: auto;
+    font-size: 12px;
+  }
+  th, td {
+    padding: 8px;
+    text-align: center;
+    border: 1px solid #ddd;
+  }
+  th {
+    font-weight: bold;
+  }
+  caption {
+    font-size: 12px;
+    font-family: 'Times New Roman', Times, serif;
+  }
+</style>
+</head>
+<body>
+
+<style>
+  #performance_on_policy {
+    font-size: 11px;
+  }
+</style>
+
+<div class="scrollable-container">
+<table id="performance_on_policy">
+<thead>
+<tr class="header">
+<th style="text-align: left;"></th>
+<th colspan="2" style="text-align: center;"><strong>Policy Gradient</strong></th>
+<th colspan="2" style="text-align: center;"><strong>Natural PG</strong></th>
+<th colspan="2" style="text-align: center;"><strong>TRPO</strong></th>
+<th colspan="2" style="text-align: center;"><strong>PPO</strong></th>
+</tr>
+</thead>
+<tbody>
+<tr class="odd">
+<td style="text-align: left;"><strong>Environment</strong></td>
+<td style="text-align: center;"><strong>Reward</strong></td>
+<td style="text-align: center;"><strong>Cost</strong></td>
+<td style="text-align: center;"><strong>Reward</strong></td>
+<td style="text-align: center;"><strong>Cost</strong></td>
+<td style="text-align: center;"><strong>Reward</strong></td>
+<td style="text-align: center;"><strong>Cost</strong></td>
+<td style="text-align: center;"><strong>Reward</strong></td>
+<td style="text-align: center;"><strong>Cost</strong></td>
+</tr>
+<tr class="even">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyAntVelocity-v1</span></td>
+<td style="text-align: center;">5292.29 <span class="math inline">±</span> 913.44</td>
+<td style="text-align: center;">919.42 <span class="math inline">±</span> 158.61</td>
+<td style="text-align: center;">5547.20 <span class="math inline">±</span> 807.89</td>
+<td style="text-align: center;">895.56 <span class="math inline">±</span> 77.13</td>
+<td style="text-align: center;">6026.79 <span class="math inline">±</span> 314.98</td>
+<td style="text-align: center;">933.46 <span class="math inline">±</span> 41.28</td>
+<td style="text-align: center;">5977.73 <span class="math inline">±</span> 885.65</td>
+<td style="text-align: center;">958.13 <span class="math inline">±</span> 134.5</td>
+</tr>
+<tr class="odd">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyHalfCheetahVelocity-v1</span></td>
+<td style="text-align: center;">5188.46 <span class="math inline">±</span> 1202.76</td>
+<td style="text-align: center;">896.55 <span class="math inline">±</span> 184.7</td>
+<td style="text-align: center;">5878.28 <span class="math inline">±</span> 2012.24</td>
+<td style="text-align: center;">847.74 <span class="math inline">±</span> 249.02</td>
+<td style="text-align: center;">6490.76 <span class="math inline">±</span> 2507.18</td>
+<td style="text-align: center;">734.26 <span class="math inline">±</span> 321.88</td>
+<td style="text-align: center;">6921.83 <span class="math inline">±</span> 1721.79</td>
+<td style="text-align: center;">919.2 <span class="math inline">±</span>173.08</td>
+</tr>
+<tr class="even">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyHopperVelocity-v1</span></td>
+<td style="text-align: center;">3218.17 <span class="math inline">±</span> 672.88</td>
+<td style="text-align: center;">881.76 <span class="math inline">±</span> 198.46</td>
+<td style="text-align: center;">2613.95 <span class="math inline">±</span> 866.13</td>
+<td style="text-align: center;">587.78 <span class="math inline">±</span> 220.97</td>
+<td style="text-align: center;">2047.35 <span class="math inline">±</span> 447.33</td>
+<td style="text-align: center;">448.12 <span class="math inline">±</span> 103.87</td>
+<td style="text-align: center;">2337.11 <span class="math inline">±</span> 942.06</td>
+<td style="text-align: center;">550.02 <span class="math inline">±</span> 237.70</td>
+</tr>
+<tr class="odd">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyHumanoidVelocity-v1</span></td>
+<td style="text-align: center;">7001.78 <span class="math inline">±</span> 419.67</td>
+<td style="text-align: center;">834.11 <span class="math inline">±</span> 212.43</td>
+<td style="text-align: center;">8055.20 <span class="math inline">±</span> 641.67</td>
+<td style="text-align: center;">946.40 <span class="math inline">±</span> 9.11</td>
+<td style="text-align: center;">8681.24 <span class="math inline">±</span> 3934.08</td>
+<td style="text-align: center;">718.42 <span class="math inline">±</span> 323.30</td>
+<td style="text-align: center;">9115.93 <span class="math inline">±</span> 596.88</td>
+<td style="text-align: center;">960.44 <span class="math inline">±</span> 7.06</td>
+</tr>
+<tr class="even">
+<td style="text-align: left;"><span
+class="smallcaps">SafetySwimmerVelocity-v1</span></td>
+<td style="text-align: center;">77.05 <span class="math inline">±</span>33.44</td>
+<td style="text-align: center;">107.1 <span class="math inline">±</span>60.58</td>
+<td style="text-align: center;">120.19 <span class="math inline">±</span> 7.74</td>
+<td style="text-align: center;">161.78 <span class="math inline">±</span> 17.51</td>
+<td style="text-align: center;">124.91 <span class="math inline">±</span> 6.13</td>
+<td style="text-align: center;">176.56 <span class="math inline">±</span> 15.95</td>
+<td style="text-align: center;">119.77 <span class="math inline">±</span> 13.8</td>
+<td style="text-align: center;">165.27 <span class="math inline">±</span> 20.15</td>
+</tr>
+<tr class="odd">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyWalker2dVelocity-v1</span></td>
+<td style="text-align: center;">4832.34 <span class="math inline">±</span> 685.76</td>
+<td style="text-align: center;">866.59 <span class="math inline">±</span> 93.47</td>
+<td style="text-align: center;">5347.35 <span class="math inline">±</span> 436.86</td>
+<td style="text-align: center;">914.74 <span class="math inline">±</span> 32.61</td>
+<td style="text-align: center;">6096.67 <span class="math inline">±</span> 723.06</td>
+<td style="text-align: center;">914.46 <span class="math inline">±</span> 27.85</td>
+<td style="text-align: center;">6239.52 <span class="math inline">±</span> 879.99</td>
+<td style="text-align: center;">902.68 <span class="math inline">±</span> 100.93</td>
+</tr>
+<tr class="even">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyCarGoal1-v0</span></td>
+<td style="text-align: center;">35.86 <span class="math inline">±</span>1.97</td>
+<td style="text-align: center;">57.46 <span class="math inline">±</span>48.34</td>
+<td style="text-align: center;">36.07 <span class="math inline">±</span>1.25</td>
+<td style="text-align: center;">58.06 <span class="math inline">±</span>10.03</td>
+<td style="text-align: center;">36.60 <span class="math inline">±</span>0.22</td>
+<td style="text-align: center;">55.58 <span class="math inline">±</span>12.68</td>
+<td style="text-align: center;">33.41 <span class="math inline">±</span>2.89</td>
+<td style="text-align: center;">58.06 <span class="math inline">±</span>42.06</td>
+</tr>
+<tr class="odd">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyCarButton1-v0</span></td>
+<td style="text-align: center;">19.76 <span class="math inline">±</span>10.15</td>
+<td style="text-align: center;">353.26 <span class="math inline">±</span> 177.08</td>
+<td style="text-align: center;">22.16 <span class="math inline">±</span>4.48</td>
+<td style="text-align: center;">333.98 <span class="math inline">±</span> 67.49</td>
+<td style="text-align: center;">21.98 <span class="math inline">±</span>2.06</td>
+<td style="text-align: center;">343.22 <span class="math inline">±</span> 24.60</td>
+<td style="text-align: center;">17.51 <span class="math inline">±</span>9.46</td>
+<td style="text-align: center;">373.98 <span class="math inline">±</span> 156.64</td>
+</tr>
+<tr class="even">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyCarGoal2-v0</span></td>
+<td style="text-align: center;">29.43 <span class="math inline">±</span>4.62</td>
+<td style="text-align: center;">179.2 <span class="math inline">±</span>84.86</td>
+<td style="text-align: center;">30.26 <span class="math inline">±</span>0.38</td>
+<td style="text-align: center;">209.62 <span class="math inline">±</span> 29.97</td>
+<td style="text-align: center;">32.17 <span class="math inline">±</span>1.24</td>
+<td style="text-align: center;">190.74 <span class="math inline">±</span> 21.05</td>
+<td style="text-align: center;">29.88 <span class="math inline">±</span>4.55</td>
+<td style="text-align: center;">194.16 <span class="math inline">±</span> 106.2</td>
+</tr>
+<tr class="odd">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyCarButton2-v0</span></td>
+<td style="text-align: center;">18.06 <span class="math inline">±</span>10.53</td>
+<td style="text-align: center;">349.82 <span class="math inline">±</span> 187.07</td>
+<td style="text-align: center;">20.85 <span class="math inline">±</span>3.14</td>
+<td style="text-align: center;">313.88 <span class="math inline">±</span> 58.20</td>
+<td style="text-align: center;">20.51 <span class="math inline">±</span>3.34</td>
+<td style="text-align: center;">316.42 <span class="math inline">±</span> 35.28</td>
+<td style="text-align: center;">21.35 <span class="math inline">±</span>8.22</td>
+<td style="text-align: center;">312.64 <span class="math inline">±</span> 138.4</td>
+</tr>
+<tr class="even">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyPointGoal1-v0</span></td>
+<td style="text-align: center;">26.19 <span class="math inline">±</span>3.44</td>
+<td style="text-align: center;">201.22 <span class="math inline">±</span> 80.4</td>
+<td style="text-align: center;">26.92 <span class="math inline">±</span>0.58</td>
+<td style="text-align: center;">57.92 <span class="math inline">±</span>9.97</td>
+<td style="text-align: center;">27.20 <span class="math inline">±</span>0.44</td>
+<td style="text-align: center;">45.88 <span class="math inline">±</span>11.27</td>
+<td style="text-align: center;">25.44 <span class="math inline">±</span>5.43</td>
+<td style="text-align: center;">55.72 <span class="math inline">±</span>35.55</td>
+</tr>
+<tr class="odd">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyPointButton1-v0</span></td>
+<td style="text-align: center;">29.98 <span class="math inline">±</span>5.24</td>
+<td style="text-align: center;">141.74 <span class="math inline">±</span> 75.13</td>
+<td style="text-align: center;">31.95 <span class="math inline">±</span>1.53</td>
+<td style="text-align: center;">123.98 <span class="math inline">±</span> 32.05</td>
+<td style="text-align: center;">30.61 <span class="math inline">±</span>0.40</td>
+<td style="text-align: center;">134.38 <span class="math inline">±</span> 22.06</td>
+<td style="text-align: center;">27.03 <span class="math inline">±</span>6.14</td>
+<td style="text-align: center;">152.48 <span class="math inline">±</span> 80.39</td>
+</tr>
+<tr class="even">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyPointGoal2-v0</span></td>
+<td style="text-align: center;">25.18 <span class="math inline">±</span>3.62</td>
+<td style="text-align: center;">204.96 <span class="math inline">±</span> 104.97</td>
+<td style="text-align: center;">26.19 <span class="math inline">±</span>0.84</td>
+<td style="text-align: center;">193.60 <span class="math inline">±</span> 18.54</td>
+<td style="text-align: center;">25.61 <span class="math inline">±</span>0.89</td>
+<td style="text-align: center;">202.26 <span class="math inline">±</span> 15.15</td>
+<td style="text-align: center;">25.49 <span class="math inline">±</span>2.46</td>
+<td style="text-align: center;">159.28 <span class="math inline">±</span> 87.13</td>
+</tr>
+<tr class="odd">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyPointButton2-v0</span></td>
+<td style="text-align: center;">26.88 <span class="math inline">±</span>4.38</td>
+<td style="text-align: center;">153.88 <span class="math inline">±</span> 65.54</td>
+<td style="text-align: center;">28.45 <span class="math inline">±</span>1.49</td>
+<td style="text-align: center;">160.40 <span class="math inline">±</span> 20.08</td>
+<td style="text-align: center;">28.78 <span class="math inline">±</span>2.05</td>
+<td style="text-align: center;">170.30 <span class="math inline">±</span> 30.59</td>
+<td style="text-align: center;">25.91 <span class="math inline">±</span>6.15</td>
+<td style="text-align: center;">166.6 <span class="math inline">±</span>111.21</td>
+</tr>
+<thead>
+<tr class="header">
+<th style="text-align: left;"></th>
+<th colspan="2" style="text-align: center;"><strong>RCPO</strong></th>
+<th colspan="2" style="text-align: center;"><strong>TRPOLag</strong></th>
+<th colspan="2" style="text-align: center;"><strong>PPOLag</strong></th>
+<th colspan="2" style="text-align: center;"><strong>P3O</strong></th>
+</tr>
+</thead>
+<tr class="odd">
+<td style="text-align: left;"><strong>Environment</strong></td>
+<td style="text-align: center;"><strong>Reward</strong></td>
+<td style="text-align: center;"><strong>Cost</strong></td>
+<td style="text-align: center;"><strong>Reward</strong></td>
+<td style="text-align: center;"><strong>Cost</strong></td>
+<td style="text-align: center;"><strong>Reward</strong></td>
+<td style="text-align: center;"><strong>Cost</strong></td>
+<td style="text-align: center;"><strong>Reward</strong></td>
+<td style="text-align: center;"><strong>Cost</strong></td>
+</tr>
+<tr class="even">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyAntVelocity-v1</span></td>
+<td style="text-align: center;">3139.52 <span class="math inline">±</span> 110.34</td>
+<td style="text-align: center;">12.34 <span class="math inline">±</span>3.11</td>
+<td style="text-align: center;">3041.89 <span class="math inline">±</span> 180.77</td>
+<td style="text-align: center;">19.52 <span class="math inline">±</span>20.21</td>
+<td style="text-align: center;">3261.87 <span class="math inline">±</span> 80.00</td>
+<td style="text-align: center;">12.05 <span class="math inline">±</span>6.57</td>
+<td style="text-align: center;">2636.62 <span class="math inline">±</span> 181.09</td>
+<td style="text-align: center;">20.69 <span class="math inline">±</span>10.23</td>
+</tr>
+<tr class="odd">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyHalfCheetahVelocity-v1</span></td>
+<td style="text-align: center;">2440.97 <span class="math inline">±</span> 451.88</td>
+<td style="text-align: center;">9.02 <span class="math inline">±</span>9.34</td>
+<td style="text-align: center;">2884.68 <span class="math inline">±</span> 77.47</td>
+<td style="text-align: center;">9.04 <span class="math inline">±</span>11.83</td>
+<td style="text-align: center;">2946.15 <span class="math inline">±</span> 306.35</td>
+<td style="text-align: center;">3.44 <span class="math inline">±</span>4.77</td>
+<td style="text-align: center;">2117.84 <span class="math inline">±</span> 313.55</td>
+<td style="text-align: center;">27.6 <span class="math inline">±</span>8.36</td>
+</tr>
+<tr class="even">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyHopperVelocity-v1</span></td>
+<td style="text-align: center;">1428.58 <span class="math inline">±</span> 199.87</td>
+<td style="text-align: center;">11.12 <span class="math inline">±</span>12.66</td>
+<td style="text-align: center;">1391.79 <span class="math inline">±</span> 269.07</td>
+<td style="text-align: center;">11.22 <span class="math inline">±</span>9.97</td>
+<td style="text-align: center;">961.92 <span class="math inline">±</span> 752.87</td>
+<td style="text-align: center;">13.96 <span class="math inline">±</span>19.33</td>
+<td style="text-align: center;">1231.52 <span class="math inline">±</span> 465.35</td>
+<td style="text-align: center;">16.33 <span class="math inline">±</span>11.38</td>
+</tr>
+<tr class="odd">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyHumanoidVelocity-v1</span></td>
+<td style="text-align: center;">6286.51 <span class="math inline">±</span> 151.03</td>
+<td style="text-align: center;">19.47 <span class="math inline">±</span>7.74</td>
+<td style="text-align: center;">6551.30 <span class="math inline">±</span> 58.42</td>
+<td style="text-align: center;">59.56 <span class="math inline">±</span>117.37</td>
+<td style="text-align: center;">6624.46 <span class="math inline">±</span> 25.9</td>
+<td style="text-align: center;">5.87 <span class="math inline">±</span>9.46</td>
+<td style="text-align: center;">6342.47 <span class="math inline">±</span> 82.45</td>
+<td style="text-align: center;">126.4 <span class="math inline">±</span>193.76</td>
+</tr>
+<tr class="even">
+<td style="text-align: left;"><span
+class="smallcaps">SafetySwimmerVelocity-v1</span></td>
+<td style="text-align: center;">61.29 <span class="math inline">±</span>18.12</td>
+<td style="text-align: center;">22.60 <span class="math inline">±</span>1.16</td>
+<td style="text-align: center;">81.18 <span class="math inline">±</span>16.33</td>
+<td style="text-align: center;">22.24 <span class="math inline">±</span>3.91</td>
+<td style="text-align: center;">64.74 <span class="math inline">±</span>17.67</td>
+<td style="text-align: center;">28.02 <span class="math inline">±</span>4.09</td>
+<td style="text-align: center;">38.02 <span class="math inline">±</span>34.18</td>
+<td style="text-align: center;">18.4 <span class="math inline">±</span>12.13</td>
+</tr>
+<tr class="odd">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyWalker2dVelocity-v1</span></td>
+<td style="text-align: center;">3064.43 <span class="math inline">±</span> 218.83</td>
+<td style="text-align: center;">3.02 <span class="math inline">±</span>1.48</td>
+<td style="text-align: center;">3207.10 <span class="math inline">±</span> 7.88</td>
+<td style="text-align: center;">14.98 <span class="math inline">±</span>9.27</td>
+<td style="text-align: center;">2982.27 <span class="math inline">±</span> 681.55</td>
+<td style="text-align: center;">13.49 <span class="math inline">±</span>14.55</td>
+<td style="text-align: center;">2713.57 <span class="math inline">±</span> 313.2</td>
+<td style="text-align: center;">20.51 <span class="math inline">±</span>14.09</td>
+</tr>
+<tr class="even">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyCarGoal1-v0</span></td>
+<td style="text-align: center;">18.71 <span class="math inline">±</span>2.72</td>
+<td style="text-align: center;">23.10 <span class="math inline">±</span>12.57</td>
+<td style="text-align: center;">27.04 <span class="math inline">±</span>1.82</td>
+<td style="text-align: center;">26.80 <span class="math inline">±</span>5.64</td>
+<td style="text-align: center;">13.27 <span class="math inline">±</span>9.26</td>
+<td style="text-align: center;">21.72 <span class="math inline">±</span>32.06</td>
+<td style="text-align: center;">-1.10 <span class="math inline">±</span>6.851</td>
+<td style="text-align: center;">50.58 <span class="math inline">±</span>99.24</td>
+</tr>
+<tr class="odd">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyCarButton1-v0</span></td>
+<td style="text-align: center;">-2.04 <span class="math inline">±</span>2.98</td>
+<td style="text-align: center;">43.48 <span class="math inline">±</span>31.52</td>
+<td style="text-align: center;">-0.38 <span class="math inline">±</span>0.85</td>
+<td style="text-align: center;">37.54 <span class="math inline">±</span>31.72</td>
+<td style="text-align: center;">0.33 <span class="math inline">±</span>1.96</td>
+<td style="text-align: center;">55.5 <span class="math inline">±</span>89.64</td>
+<td style="text-align: center;">-2.06 <span class="math inline">±</span>7.2</td>
+<td style="text-align: center;">43.78 <span class="math inline">±</span>98.01</td>
+</tr>
+<tr class="even">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyCarGoal2-v0</span></td>
+<td style="text-align: center;">2.30 <span class="math inline">±</span>1.76</td>
+<td style="text-align: center;">22.90 <span class="math inline">±</span>16.22</td>
+<td style="text-align: center;">3.65 <span class="math inline">±</span>1.09</td>
+<td style="text-align: center;">39.98 <span class="math inline">±</span>20.29</td>
+<td style="text-align: center;">1.58 <span class="math inline">±</span>2.49</td>
+<td style="text-align: center;">13.82 <span class="math inline">±</span>24.62</td>
+<td style="text-align: center;">-0.07 <span class="math inline">±</span>1.62</td>
+<td style="text-align: center;">43.86 <span class="math inline">±</span>99.58</td>
+</tr>
+<tr class="odd">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyCarButton2-v0</span></td>
+<td style="text-align: center;">-1.35 <span class="math inline">±</span>2.41</td>
+<td style="text-align: center;">42.02 <span class="math inline">±</span>31.77</td>
+<td style="text-align: center;">-1.68 <span class="math inline">±</span>2.55</td>
+<td style="text-align: center;">20.36 <span class="math inline">±</span>13.67</td>
+<td style="text-align: center;">0.76 <span class="math inline">±</span>2.52</td>
+<td style="text-align: center;">47.86 <span class="math inline">±</span>103.27</td>
+<td style="text-align: center;">0.11 <span class="math inline">±</span>0.72</td>
+<td style="text-align: center;">85.94 <span class="math inline">±</span>122.01</td>
+</tr>
+<tr class="even">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyPointGoal1-v0</span></td>
+<td style="text-align: center;">15.27 <span class="math inline">±</span>4.05</td>
+<td style="text-align: center;">30.56 <span class="math inline">±</span>19.15</td>
+<td style="text-align: center;">18.51 <span class="math inline">±</span>3.83</td>
+<td style="text-align: center;">22.98 <span class="math inline">±</span>8.45</td>
+<td style="text-align: center;">12.96 <span class="math inline">±</span>6.95</td>
+<td style="text-align: center;">25.80 <span class="math inline">±</span>34.99</td>
+<td style="text-align: center;">1.6 <span class="math inline">±</span>3.01</td>
+<td style="text-align: center;">31.1 <span class="math inline">±</span>80.03</td>
+</tr>
+<tr class="odd">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyPointButton1-v0</span></td>
+<td style="text-align: center;">3.65 <span class="math inline">±</span>4.47</td>
+<td style="text-align: center;">26.30 <span class="math inline">±</span>9.22</td>
+<td style="text-align: center;">6.93 <span class="math inline">±</span>1.84</td>
+<td style="text-align: center;">31.16 <span class="math inline">±</span>20.58</td>
+<td style="text-align: center;">4.60 <span class="math inline">±</span>4.73</td>
+<td style="text-align: center;">20.8 <span class="math inline">±</span>35.78</td>
+<td style="text-align: center;">-0.34 <span class="math inline">±</span>1.53</td>
+<td style="text-align: center;">52.86 <span class="math inline">±</span>85.62</td>
+</tr>
+<tr class="even">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyPointGoal2-v0</span></td>
+<td style="text-align: center;">2.17 <span class="math inline">±</span>1.46</td>
+<td style="text-align: center;">33.82 <span class="math inline">±</span>21.93</td>
+<td style="text-align: center;">4.64 <span class="math inline">±</span>1.43</td>
+<td style="text-align: center;">26.00 <span class="math inline">±</span>4.70</td>
+<td style="text-align: center;">1.98 <span class="math inline">±</span>3.86</td>
+<td style="text-align: center;">41.20 <span class="math inline">±</span>61.03</td>
+<td style="text-align: center;">0.34 <span class="math inline">±</span>2.2</td>
+<td style="text-align: center;">65.84 <span class="math inline">±</span>195.76</td>
+</tr>
+<tr class="odd">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyPointButton2-v0</span></td>
+<td style="text-align: center;">7.18 <span class="math inline">±</span>1.93</td>
+<td style="text-align: center;">45.02 <span class="math inline">±</span>25.28</td>
+<td style="text-align: center;">5.43 <span class="math inline">±</span>3.44</td>
+<td style="text-align: center;">25.10 <span class="math inline">±</span>8.98</td>
+<td style="text-align: center;">0.93 <span class="math inline">±</span>3.69</td>
+<td style="text-align: center;">33.72 <span class="math inline">±</span>58.75</td>
+<td style="text-align: center;">0.33 <span class="math inline">±</span>2.44</td>
+<td style="text-align: center;">28.5 <span class="math inline">±</span>49.79</td>
+</tr>
+<thead>
+<tr class="header">
+<th style="text-align: left;"></th>
+<th colspan="2" style="text-align: center;"><strong>CUP</strong></th>
+<th colspan="2" style="text-align: center;"><strong>PCPO</strong></th>
+<th colspan="2" style="text-align: center;"><strong>FOCOPS</strong></th>
+<th colspan="2" style="text-align: center;"><strong>CPO</strong></th>
+</tr>
+</thead>
+<tr class="odd">
+<td style="text-align: left;"><strong>Environment</strong></td>
+<td style="text-align: center;"><strong>Reward</strong></td>
+<td style="text-align: center;"><strong>Cost</strong></td>
+<td style="text-align: center;"><strong>Reward</strong></td>
+<td style="text-align: center;"><strong>Cost</strong></td>
+<td style="text-align: center;"><strong>Reward</strong></td>
+<td style="text-align: center;"><strong>Cost</strong></td>
+<td style="text-align: center;"><strong>Reward</strong></td>
+<td style="text-align: center;"><strong>Cost</strong></td>
+</tr>
+<tr class="even">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyAntVelocity-v1</span></td>
+<td style="text-align: center;">3215.79 <span class="math inline">±</span> 346.68</td>
+<td style="text-align: center;">18.25 <span class="math inline">±</span>17.12</td>
+<td style="text-align: center;">2257.07 <span class="math inline">±</span> 47.97</td>
+<td style="text-align: center;">10.44 <span class="math inline">±</span>5.22</td>
+<td style="text-align: center;">3184.48 <span class="math inline">±</span> 305.59</td>
+<td style="text-align: center;">14.75 <span class="math inline">±</span>6.36</td>
+<td style="text-align: center;">3098.54 <span class="math inline">±</span> 78.90</td>
+<td style="text-align: center;">14.12 <span class="math inline">±</span>3.41</td>
+</tr>
+<tr class="odd">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyHalfCheetahVelocity-v1</span></td>
+<td style="text-align: center;">2850.6 <span class="math inline">±</span> 244.65</td>
+<td style="text-align: center;">4.27 <span class="math inline">±</span>4.46</td>
+<td style="text-align: center;">1677.93 <span class="math inline">±</span> 217.31</td>
+<td style="text-align: center;">19.06 <span class="math inline">±</span>15.26</td>
+<td style="text-align: center;">2965.2 <span class="math inline">±</span> 290.43</td>
+<td style="text-align: center;">2.37 <span class="math inline">±</span>3.5</td>
+<td style="text-align: center;">2786.48 <span class="math inline">±</span> 173.45</td>
+<td style="text-align: center;">4.70 <span class="math inline">±</span>6.72</td>
+</tr>
+<tr class="even">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyHopperVelocity-v1</span></td>
+<td style="text-align: center;">1716.08 <span class="math inline">±</span> 5.93</td>
+<td style="text-align: center;">7.48 <span class="math inline">±</span>5.535</td>
+<td style="text-align: center;">1551.22 <span class="math inline">±</span> 85.16</td>
+<td style="text-align: center;">15.46 <span class="math inline">±</span>9.83</td>
+<td style="text-align: center;">1437.75 <span class="math inline">±</span> 446.87</td>
+<td style="text-align: center;">10.13 <span class="math inline">±</span>8.87</td>
+<td style="text-align: center;">1713.71 <span class="math inline">±</span> 18.26</td>
+<td style="text-align: center;">13.40 <span class="math inline">±</span>5.82</td>
+</tr>
+<tr class="odd">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyHumanoidVelocity-v1</span></td>
+<td style="text-align: center;">6109.94 <span class="math inline">±</span> 497.56</td>
+<td style="text-align: center;">24.69 <span class="math inline">±</span>20.54</td>
+<td style="text-align: center;">5852.25 <span class="math inline">±</span> 78.01</td>
+<td style="text-align: center;">0.24 <span class="math inline">±</span>0.48</td>
+<td style="text-align: center;">6489.39 <span class="math inline">±</span> 35.1</td>
+<td style="text-align: center;">13.86 <span class="math inline">±</span>39.33</td>
+<td style="text-align: center;">6465.34 <span class="math inline">±</span> 79.87</td>
+<td style="text-align: center;">0.18 <span class="math inline">±</span>0.36</td>
+</tr>
+<tr class="even">
+<td style="text-align: left;"><span
+class="smallcaps">SafetySwimmerVelocity-v1</span></td>
+<td style="text-align: center;">63.83 <span class="math inline">±</span>46.45</td>
+<td style="text-align: center;">21.95 <span class="math inline">±</span>11.04</td>
+<td style="text-align: center;">54.42 <span class="math inline">±</span>38.65</td>
+<td style="text-align: center;">17.34 <span class="math inline">±</span>1.57</td>
+<td style="text-align: center;">53.87 <span class="math inline">±</span>17.9</td>
+<td style="text-align: center;">29.75 <span class="math inline">±</span>7.33</td>
+<td style="text-align: center;">65.30 <span class="math inline">±</span>43.25</td>
+<td style="text-align: center;">18.22 <span class="math inline">±</span>8.01</td>
+</tr>
+<tr class="odd">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyWalker2dVelocity-v1</span></td>
+<td style="text-align: center;">2466.95 <span class="math inline">±</span> 1114.13</td>
+<td style="text-align: center;">6.63 <span class="math inline">±</span>8.25</td>
+<td style="text-align: center;">1802.86 <span class="math inline">±</span> 714.04</td>
+<td style="text-align: center;">18.82 <span class="math inline">±</span>5.57</td>
+<td style="text-align: center;">3117.05 <span class="math inline">±</span> 53.60</td>
+<td style="text-align: center;">8.78 <span class="math inline">±</span>12.38</td>
+<td style="text-align: center;">2074.76 <span class="math inline">±</span> 962.45</td>
+<td style="text-align: center;">21.90 <span class="math inline">±</span>9.41</td>
+</tr>
+<tr class="even">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyCarGoal1-v0</span></td>
+<td style="text-align: center;">6.14 <span class="math inline">±</span>6.97</td>
+<td style="text-align: center;">36.12 <span class="math inline">±</span>89.56</td>
+<td style="text-align: center;">21.56 <span class="math inline">±</span>2.87</td>
+<td style="text-align: center;">38.42 <span class="math inline">±</span>8.36</td>
+<td style="text-align: center;">15.23 <span class="math inline">±</span>10.76</td>
+<td style="text-align: center;">31.66 <span class="math inline">±</span>93.51</td>
+<td style="text-align: center;">25.52 <span class="math inline">±</span>2.65</td>
+<td style="text-align: center;">43.32 <span class="math inline">±</span>14.35</td>
+</tr>
+<tr class="odd">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyCarButton1-v0</span></td>
+<td style="text-align: center;">1.49 <span class="math inline">±</span>2.84</td>
+<td style="text-align: center;">103.24 <span class="math inline">±</span> 123.12</td>
+<td style="text-align: center;">0.36 <span class="math inline">±</span>0.85</td>
+<td style="text-align: center;">40.52 <span class="math inline">±</span>21.25</td>
+<td style="text-align: center;">0.21 <span class="math inline">±</span>2.27</td>
+<td style="text-align: center;">31.78 <span class="math inline">±</span>47.03</td>
+<td style="text-align: center;">0.82 <span class="math inline">±</span>1.60</td>
+<td style="text-align: center;">37.86 <span class="math inline">±</span>27.41</td>
+</tr>
+<tr class="even">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyCarGoal2-v0</span></td>
+<td style="text-align: center;">1.78 <span class="math inline">±</span>4.03</td>
+<td style="text-align: center;">95.4 <span class="math inline">±</span>129.64</td>
+<td style="text-align: center;">1.62 <span class="math inline">±</span>0.56</td>
+<td style="text-align: center;">48.12 <span class="math inline">±</span>31.19</td>
+<td style="text-align: center;">2.09 <span class="math inline">±</span>4.33</td>
+<td style="text-align: center;">31.56 <span class="math inline">±</span>58.93</td>
+<td style="text-align: center;">3.56 <span class="math inline">±</span>0.92</td>
+<td style="text-align: center;">32.66 <span class="math inline">±</span>3.31</td>
+</tr>
+<tr class="odd">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyCarButton2-v0</span></td>
+<td style="text-align: center;">1.49 <span class="math inline">±</span>2.64</td>
+<td style="text-align: center;">173.68 <span class="math inline">±</span> 163.77</td>
+<td style="text-align: center;">0.66 <span class="math inline">±</span>0.42</td>
+<td style="text-align: center;">49.72 <span class="math inline">±</span>36.50</td>
+<td style="text-align: center;">1.14 <span class="math inline">±</span>3.18</td>
+<td style="text-align: center;">46.78 <span class="math inline">±</span>57.47</td>
+<td style="text-align: center;">0.17 <span class="math inline">±</span>1.19</td>
+<td style="text-align: center;">48.56 <span class="math inline">±</span>29.34</td>
+</tr>
+<tr class="even">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyPointGoal1-v0</span></td>
+<td style="text-align: center;">14.42 <span class="math inline">±</span>6.74</td>
+<td style="text-align: center;">19.02 <span class="math inline">±</span>20.08</td>
+<td style="text-align: center;">18.57 <span class="math inline">±</span>1.71</td>
+<td style="text-align: center;">22.98 <span class="math inline">±</span>6.56</td>
+<td style="text-align: center;">14.97 <span class="math inline">±</span>9.01</td>
+<td style="text-align: center;">33.72 <span class="math inline">±</span>42.24</td>
+<td style="text-align: center;">20.46 <span class="math inline">±</span>1.38</td>
+<td style="text-align: center;">28.84 <span class="math inline">±</span>7.76</td>
+</tr>
+<tr class="odd">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyPointButton1-v0</span></td>
+<td style="text-align: center;">3.5 <span class="math inline">±</span>7.07</td>
+<td style="text-align: center;">39.56 <span class="math inline">±</span>54.26</td>
+<td style="text-align: center;">2.66 <span class="math inline">±</span>1.83</td>
+<td style="text-align: center;">49.40 <span class="math inline">±</span>36.76</td>
+<td style="text-align: center;">5.89 <span class="math inline">±</span>7.66</td>
+<td style="text-align: center;">38.24 <span class="math inline">±</span>42.96</td>
+<td style="text-align: center;">4.04 <span class="math inline">±</span>4.54</td>
+<td style="text-align: center;">40.00 <span class="math inline">±</span>4.52</td>
+</tr>
+<tr class="even">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyPointGoal2-v0</span></td>
+<td style="text-align: center;">1.06 <span class="math inline">±</span>2.67</td>
+<td style="text-align: center;">107.3 <span class="math inline">±</span>204.26</td>
+<td style="text-align: center;">1.06 <span class="math inline">±</span>0.69</td>
+<td style="text-align: center;">51.92 <span class="math inline">±</span>47.40</td>
+<td style="text-align: center;">2.21 <span class="math inline">±</span>4.15</td>
+<td style="text-align: center;">37.92 <span class="math inline">±</span>111.81</td>
+<td style="text-align: center;">2.50 <span class="math inline">±</span>1.25</td>
+<td style="text-align: center;">40.84 <span class="math inline">±</span>23.31</td>
+</tr>
+<tr class="odd">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyPointButton2-v0</span></td>
+<td style="text-align: center;">2.88 <span class="math inline">±</span>3.65</td>
+<td style="text-align: center;">54.24 <span class="math inline">±</span>71.07</td>
+<td style="text-align: center;">1.05 <span class="math inline">±</span>1.27</td>
+<td style="text-align: center;">41.14 <span class="math inline">±</span>12.35</td>
+<td style="text-align: center;">2.43 <span class="math inline">±</span>3.33</td>
+<td style="text-align: center;">17.92 <span class="math inline">±</span>26.1</td>
+<td style="text-align: center;">5.09 <span class="math inline">±</span>1.83</td>
+<td style="text-align: center;">48.92 <span class="math inline">±</span>17.79</td>
+</tr>
+<thead>
+<tr class="header">
+<th style="text-align: left;"></th>
+<th colspan="2" style="text-align: center;"><strong>PPOSaute</strong></th>
+<th colspan="2" style="text-align: center;"><strong>TRPOSaute</strong></th>
+<th colspan="2" style="text-align: center;"><strong>PPOSimmerPID</strong></th>
+<th colspan="2" style="text-align: center;"><strong>TRPOSimmerPID</strong></th>
+</tr>
+</thead>
+<tr class="odd">
+<td style="text-align: left;"><strong>Environment</strong></td>
+<td style="text-align: center;"><strong>Reward</strong></td>
+<td style="text-align: center;"><strong>Cost</strong></td>
+<td style="text-align: center;"><strong>Reward</strong></td>
+<td style="text-align: center;"><strong>Cost</strong></td>
+<td style="text-align: center;"><strong>Reward</strong></td>
+<td style="text-align: center;"><strong>Cost</strong></td>
+<td style="text-align: center;"><strong>Reward</strong></td>
+<td style="text-align: center;"><strong>Cost</strong></td>
+</tr>
+<tr class="even">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyAntVelocity-v1</span></td>
+<td style="text-align: center;">2978.74 <span class="math inline">±</span> 93.65</td>
+<td style="text-align: center;">16.77 <span class="math inline">±</span>0.92</td>
+<td style="text-align: center;">2507.65 <span class="math inline">±</span> 63.97</td>
+<td style="text-align: center;">8.036 <span class="math inline">±</span>0.39</td>
+<td style="text-align: center;">2944.84 <span class="math inline">±</span> 60.53</td>
+<td style="text-align: center;">16.20 <span class="math inline">±</span>0.66</td>
+<td style="text-align: center;">3018.95 <span class="math inline">±</span> 66.44</td>
+<td style="text-align: center;">16.52 <span class="math inline">±</span>0.23</td>
+</tr>
+<tr class="odd">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyHalfCheetahVelocity-v1</span></td>
+<td style="text-align: center;">2901.40 <span class="math inline">±</span> 25.49</td>
+<td style="text-align: center;">16.20 <span class="math inline">±</span> 0.60</td>
+<td style="text-align: center;">2521.80 <span class="math inline">±</span> 477.29</td>
+<td style="text-align: center;">7.61 <span class="math inline">±</span>0.39</td>
+<td style="text-align: center;">2922.17 <span class="math inline">±</span> 24.84</td>
+<td style="text-align: center;">16.14 <span class="math inline">±</span>0.14</td>
+<td style="text-align: center;">2737.79 <span class="math inline">±</span> 37.53</td>
+<td style="text-align: center;">16.44 <span class="math inline">±</span>0.21</td>
+</tr>
+<tr class="even">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyHopperVelocity-v1</span></td>
+<td style="text-align: center;">1650.91 <span class="math inline">±</span> 152.65</td>
+<td style="text-align: center;">17.87 <span class="math inline">±</span>1.33</td>
+<td style="text-align: center;">1368.28 <span class="math inline">±</span> 576.08</td>
+<td style="text-align: center;">10.38 <span class="math inline">±</span>4.38</td>
+<td style="text-align: center;">1699.94 <span class="math inline">±</span> 24.25</td>
+<td style="text-align: center;">17.04 <span class="math inline">±</span>0.41</td>
+<td style="text-align: center;">1608.41 <span class="math inline">±</span> 88.23</td>
+<td style="text-align: center;">16.30 <span class="math inline">±</span>0.30</td>
+</tr>
+<tr class="odd">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyHumanoidVelocity-v1</span></td>
+<td style="text-align: center;">6401.00 <span class="math inline">±</span> 32.23</td>
+<td style="text-align: center;">17.10 <span class="math inline">±</span>2.41</td>
+<td style="text-align: center;">5759.44 <span class="math inline">±</span> 75.73</td>
+<td style="text-align: center;">15.84 <span class="math inline">±</span>1.42</td>
+<td style="text-align: center;">6401.85 <span class="math inline">±</span> 57.62</td>
+<td style="text-align: center;">11.06 <span class="math inline">±</span>5.35</td>
+<td style="text-align: center;">6411.32 <span class="math inline">±</span> 44.26</td>
+<td style="text-align: center;">13.04 <span class="math inline">±</span>2.68</td>
+</tr>
+<tr class="even">
+<td style="text-align: left;"><span
+class="smallcaps">SafetySwimmerVelocity-v1</span></td>
+<td style="text-align: center;">35.61 <span class="math inline">±</span>4.37</td>
+<td style="text-align: center;">3.44 <span class="math inline">±</span>1.35</td>
+<td style="text-align: center;">34.72 <span class="math inline">±</span>1.37</td>
+<td style="text-align: center;">10.19 <span class="math inline">±</span>2.32</td>
+<td style="text-align: center;">77.52 <span class="math inline">±</span>40.20</td>
+<td style="text-align: center;">0.98 <span class="math inline">±</span>1.91</td>
+<td style="text-align: center;">51.39 <span class="math inline">±</span>40.09</td>
+<td style="text-align: center;">0.00 <span class="math inline">±</span>0.00</td>
+</tr>
+<tr class="odd">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyWalker2dVelocity-v1</span></td>
+<td style="text-align: center;">2410.89 <span class="math inline">±</span> 241.22</td>
+<td style="text-align: center;">18.88 <span class="math inline">±</span>2.38</td>
+<td style="text-align: center;">2548.82 <span class="math inline">±</span> 891.65</td>
+<td style="text-align: center;">13.21 <span class="math inline">±</span>6.09</td>
+<td style="text-align: center;">3187.56 <span class="math inline">±</span> 32.66</td>
+<td style="text-align: center;">17.10 <span class="math inline">±</span>0.49</td>
+<td style="text-align: center;">3156.99 <span class="math inline">±</span> 30.93</td>
+<td style="text-align: center;">17.14 <span class="math inline">±</span>0.54</td>
+</tr>
+<tr class="even">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyCarGoal1-v0</span></td>
+<td style="text-align: center;">7.12 <span class="math inline">±</span>5.41</td>
+<td style="text-align: center;">21.68 <span class="math inline">±</span>29.11</td>
+<td style="text-align: center;">16.67 <span class="math inline">±</span>10.57</td>
+<td style="text-align: center;">23.58 <span class="math inline">±</span>26.39</td>
+<td style="text-align: center;">8.45 <span class="math inline">±</span>7.16</td>
+<td style="text-align: center;">18.98 <span class="math inline">±</span>25.63</td>
+<td style="text-align: center;">15.08 <span class="math inline">±</span>13.41</td>
+<td style="text-align: center;">23.22 <span class="math inline">±</span>19.80</td>
+</tr>
+<tr class="odd">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyCarButton1-v0</span></td>
+<td style="text-align: center;">-1.72 <span class="math inline">±</span>0.89</td>
+<td style="text-align: center;">51.88 <span class="math inline">±</span>28.18</td>
+<td style="text-align: center;">-2.03 <span class="math inline">±</span>0.40</td>
+<td style="text-align: center;">6.24 <span class="math inline">±</span>6.14</td>
+<td style="text-align: center;">-0.57 <span class="math inline">±</span>0.63</td>
+<td style="text-align: center;">49.14 <span class="math inline">±</span>37.77</td>
+<td style="text-align: center;">-1.24 <span class="math inline">±</span>0.47</td>
+<td style="text-align: center;">17.26 <span class="math inline">±</span>16.13</td>
+</tr>
+<tr class="even">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyCarGoal2-v0</span></td>
+<td style="text-align: center;">0.90 <span class="math inline">±</span>1.20</td>
+<td style="text-align: center;">19.98 <span class="math inline">±</span>10.12</td>
+<td style="text-align: center;">1.76 <span class="math inline">±</span>5.20</td>
+<td style="text-align: center;">31.50 <span class="math inline">±</span>45.50</td>
+<td style="text-align: center;">1.02 <span class="math inline">±</span>1.41</td>
+<td style="text-align: center;">27.32 <span class="math inline">±</span>60.12</td>
+<td style="text-align: center;">0.93 <span class="math inline">±</span>2.21</td>
+<td style="text-align: center;">26.66 <span class="math inline">±</span>60.07</td>
+</tr>
+<tr class="odd">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyCarButton2-v0</span></td>
+<td style="text-align: center;">-1.89 <span class="math inline">±</span>1.86</td>
+<td style="text-align: center;">47.33 <span class="math inline">±</span>28.90</td>
+<td style="text-align: center;">-2.60 <span class="math inline">±</span>0.40</td>
+<td style="text-align: center;">74.57 <span class="math inline">±</span>84.95</td>
+<td style="text-align: center;">-1.31 <span class="math inline">±</span>0.93</td>
+<td style="text-align: center;">52.33 <span class="math inline">±</span>19.96</td>
+<td style="text-align: center;">-0.99 <span class="math inline">±</span>0.63</td>
+<td style="text-align: center;">20.40 <span class="math inline">±</span>12.77</td>
+</tr>
+<tr class="even">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyPointGoal1-v0</span></td>
+<td style="text-align: center;">7.06 <span class="math inline">±</span>5.85</td>
+<td style="text-align: center;">20.04 <span class="math inline">±</span>21.91</td>
+<td style="text-align: center;">16.18 <span class="math inline">±</span>9.55</td>
+<td style="text-align: center;">29.94 <span class="math inline">±</span>26.68</td>
+<td style="text-align: center;">8.30 <span class="math inline">±</span>6.03</td>
+<td style="text-align: center;">25.32 <span class="math inline">±</span>31.91</td>
+<td style="text-align: center;">11.64 <span class="math inline">±</span>8.46</td>
+<td style="text-align: center;">30.00 <span class="math inline">±</span>27.67</td>
+</tr>
+<tr class="odd">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyPointButton1-v0</span></td>
+<td style="text-align: center;">-1.47 <span class="math inline">±</span>0.98</td>
+<td style="text-align: center;">22.60 <span class="math inline">±</span>13.91</td>
+<td style="text-align: center;">-3.13 <span class="math inline">±</span>3.51</td>
+<td style="text-align: center;">9.04 <span class="math inline">±</span>3.94</td>
+<td style="text-align: center;">-1.97 <span class="math inline">±</span>1.41</td>
+<td style="text-align: center;">12.80 <span class="math inline">±</span>7.84</td>
+<td style="text-align: center;">-1.36 <span class="math inline">±</span>0.37</td>
+<td style="text-align: center;">2.14 <span class="math inline">±</span>1.73</td>
+</tr>
+<tr class="even">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyPointGoal2-v0</span></td>
+<td style="text-align: center;">0.84 <span class="math inline">±</span>2.93</td>
+<td style="text-align: center;">14.06 <span class="math inline">±</span>30.21</td>
+<td style="text-align: center;">1.64 <span class="math inline">±</span>4.02</td>
+<td style="text-align: center;">19.00 <span class="math inline">±</span>34.69</td>
+<td style="text-align: center;">0.56 <span class="math inline">±</span>2.52</td>
+<td style="text-align: center;">12.36 <span class="math inline">±</span>43.39</td>
+<td style="text-align: center;">1.55 <span class="math inline">±</span>4.68</td>
+<td style="text-align: center;">14.90 <span class="math inline">±</span>27.82</td>
+</tr>
+<tr class="odd">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyPointButton2-v0</span></td>
+<td style="text-align: center;">-1.38 <span class="math inline">±</span>0.11</td>
+<td style="text-align: center;">12.00 <span class="math inline">±</span>8.60</td>
+<td style="text-align: center;">-2.56 <span class="math inline">±</span>0.67</td>
+<td style="text-align: center;">17.27 <span class="math inline">±</span>10.01</td>
+<td style="text-align: center;">-1.70 <span class="math inline">±</span>0.29</td>
+<td style="text-align: center;">7.90 <span class="math inline">±</span>3.30</td>
+<td style="text-align: center;">-1.66 <span class="math inline">±</span>0.99</td>
+<td style="text-align: center;">6.70 <span class="math inline">±</span>4.74</td>
+</tr>
+<thead>
+<tr class="header">
+<th style="text-align: left;"></th>
+<th colspan="2" style="text-align: center;"><strong>CPPOPID</strong></th>
+<th colspan="2" style="text-align: center;"><strong>TRPOPID</strong></th>
+<th colspan="2" style="text-align: center;"><strong>PPOEarlyTerminated</strong></th>
+<th colspan="2" style="text-align: center;"><strong>TRPOEarlyTerminated</strong></th>
+</tr>
+<tr class="odd">
+<td style="text-align: left;"><strong>Environment</strong></td>
+<td style="text-align: center;"><strong>Reward</strong></td>
+<td style="text-align: center;"><strong>Cost</strong></td>
+<td style="text-align: center;"><strong>Reward</strong></td>
+<td style="text-align: center;"><strong>Cost</strong></td>
+<td style="text-align: center;"><strong>Reward</strong></td>
+<td style="text-align: center;"><strong>Cost</strong></td>
+<td style="text-align: center;"><strong>Reward</strong></td>
+<td style="text-align: center;"><strong>Cost</strong></td>
+</tr>
+<tr class="even">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyAntVelocity-v1</span></td>
+<td style="text-align: center;">3213.36 <span class="math inline">±</span> 146.78</td>
+<td style="text-align: center;">14.30 <span class="math inline">±</span>7.39</td>
+<td style="text-align: center;">3052.94 <span class="math inline">±</span> 139.67</td>
+<td style="text-align: center;">15.22 <span class="math inline">±</span>3.68</td>
+<td style="text-align: center;">2801.53 <span class="math inline">±</span> 19.66</td>
+<td style="text-align: center;">0.23 <span class="math inline">±</span>0.09</td>
+<td style="text-align: center;">3052.63 <span class="math inline">±</span> 58.41</td>
+<td style="text-align: center;">0.40 <span class="math inline">±</span>0.23</td>
+</tr>
+<tr class="odd">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyHalfCheetahVelocity-v1</span></td>
+<td style="text-align: center;">2837.89 <span class="math inline">±</span> 398.52</td>
+<td style="text-align: center;">8.06 <span class="math inline">±</span>9.62</td>
+<td style="text-align: center;">2796.75 <span class="math inline">±</span> 190.84</td>
+<td style="text-align: center;">11.16 <span class="math inline">±</span>9.80</td>
+<td style="text-align: center;">2447.25 <span class="math inline">±</span> 346.84</td>
+<td style="text-align: center;">3.47 <span class="math inline">±</span>4.90</td>
+<td style="text-align: center;">2555.70 <span class="math inline">±</span> 368.17</td>
+<td style="text-align: center;">0.06 <span class="math inline">±</span>0.08</td>
+</tr>
+<tr class="even">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyHopperVelocity-v1</span></td>
+<td style="text-align: center;">1713.29 <span class="math inline">±</span> 10.21</td>
+<td style="text-align: center;">8.96 <span class="math inline">±</span>4.28</td>
+<td style="text-align: center;">1178.59 <span class="math inline">±</span> 646.71</td>
+<td style="text-align: center;">18.76 <span class="math inline">±</span>8.93</td>
+<td style="text-align: center;">1643.39 <span class="math inline">±</span> 2.58</td>
+<td style="text-align: center;">0.77 <span class="math inline">±</span>0.26</td>
+<td style="text-align: center;">1646.47 <span class="math inline">±</span> 49.95</td>
+<td style="text-align: center;">0.42 <span class="math inline">±</span>0.84</td>
+</tr>
+<tr class="odd">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyHumanoidVelocity-v1</span></td>
+<td style="text-align: center;">6579.26 <span class="math inline">±</span> 55.70</td>
+<td style="text-align: center;">3.76 <span class="math inline">±</span>3.61</td>
+<td style="text-align: center;">6407.95 <span class="math inline">±</span> 254.06</td>
+<td style="text-align: center;">7.38 <span class="math inline">±</span>11.34</td>
+<td style="text-align: center;">6321.45 <span class="math inline">±</span> 35.73</td>
+<td style="text-align: center;">0.00 <span class="math inline">±</span>0.00</td>
+<td style="text-align: center;">6332.14 <span class="math inline">±</span> 89.86</td>
+<td style="text-align: center;">0.00 <span class="math inline">±</span>0.00</td>
+</tr>
+<tr class="even">
+<td style="text-align: left;"><span
+class="smallcaps">SafetySwimmerVelocity-v1</span></td>
+<td style="text-align: center;">91.05 <span class="math inline">±</span>62.68</td>
+<td style="text-align: center;">19.12 <span class="math inline">±</span>8.33</td>
+<td style="text-align: center;">69.75 <span class="math inline">±</span>46.52</td>
+<td style="text-align: center;">20.48 <span class="math inline">±</span>9.13</td>
+<td style="text-align: center;">33.02 <span class="math inline">±</span>7.26</td>
+<td style="text-align: center;">24.23 <span class="math inline">±</span>0.54</td>
+<td style="text-align: center;">39.24 <span class="math inline">±</span>5.01</td>
+<td style="text-align: center;">23.20 <span class="math inline">±</span>0.48</td>
+</tr>
+<tr class="odd">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyWalker2dVelocity-v1</span></td>
+<td style="text-align: center;">2183.43 <span class="math inline">±</span> 1300.69</td>
+<td style="text-align: center;">14.12 <span class="math inline">±</span>10.28</td>
+<td style="text-align: center;">2707.75 <span class="math inline">±</span> 980.56</td>
+<td style="text-align: center;">9.60 <span class="math inline">±</span>8.94</td>
+<td style="text-align: center;">2195.57 <span class="math inline">±</span> 1046.29</td>
+<td style="text-align: center;">7.63 <span class="math inline">±</span>10.44</td>
+<td style="text-align: center;">2079.64 <span class="math inline">±</span> 1028.73</td>
+<td style="text-align: center;">13.74 <span class="math inline">±</span>15.94</td>
+</tr>
+<tr class="even">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyCarGoal1-v0</span></td>
+<td style="text-align: center;">10.60 <span class="math inline">±</span>2.51</td>
+<td style="text-align: center;">30.66 <span class="math inline">±</span>7.53</td>
+<td style="text-align: center;">25.49 <span class="math inline">±</span>1.31</td>
+<td style="text-align: center;">28.92 <span class="math inline">±</span>7.66</td>
+<td style="text-align: center;">17.92 <span class="math inline">±</span>1.54</td>
+<td style="text-align: center;">21.60 <span class="math inline">±</span>0.83</td>
+<td style="text-align: center;">22.09 <span class="math inline">±</span>3.07</td>
+<td style="text-align: center;">17.97 <span class="math inline">±</span>1.35</td>
+</tr>
+<tr class="odd">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyCarButton1-v0</span></td>
+<td style="text-align: center;">-1.36 <span class="math inline">±</span>0.68</td>
+<td style="text-align: center;">14.62 <span class="math inline">±</span>9.40</td>
+<td style="text-align: center;">-0.31 <span class="math inline">±</span>0.49</td>
+<td style="text-align: center;">15.24 <span class="math inline">±</span>17.01</td>
+<td style="text-align: center;">4.47 <span class="math inline">±</span>1.12</td>
+<td style="text-align: center;">25.00 <span class="math inline">±</span>0.00</td>
+<td style="text-align: center;">4.34 <span class="math inline">±</span>0.72</td>
+<td style="text-align: center;">25.00 <span class="math inline">±</span>0.00</td>
+</tr>
+<tr class="even">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyCarGoal2-v0</span></td>
+<td style="text-align: center;">0.13 <span class="math inline">±</span>1.11</td>
+<td style="text-align: center;">23.50 <span class="math inline">±</span>1.22</td>
+<td style="text-align: center;">1.77 <span class="math inline">±</span>1.20</td>
+<td style="text-align: center;">17.43 <span class="math inline">±</span>12.13</td>
+<td style="text-align: center;">6.59 <span class="math inline">±</span>0.58</td>
+<td style="text-align: center;">25.00 <span class="math inline">±</span>0.00</td>
+<td style="text-align: center;">7.12 <span class="math inline">±</span>4.06</td>
+<td style="text-align: center;">23.37 <span class="math inline">±</span>1.35</td>
+</tr>
+<tr class="odd">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyCarButton2-v0</span></td>
+<td style="text-align: center;">-1.59 <span class="math inline">±</span>0.70</td>
+<td style="text-align: center;">39.97 <span class="math inline">±</span>26.91</td>
+<td style="text-align: center;">-2.95 <span class="math inline">±</span>4.03</td>
+<td style="text-align: center;">27.90 <span class="math inline">±</span>6.37</td>
+<td style="text-align: center;">4.86 <span class="math inline">±</span>1.57</td>
+<td style="text-align: center;">25.00 <span class="math inline">±</span>0.00</td>
+<td style="text-align: center;">5.07 <span class="math inline">±</span>1.24</td>
+<td style="text-align: center;">25.00 <span class="math inline">±</span>0.00</td>
+</tr>
+<tr class="even">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyPointGoal1-v0</span></td>
+<td style="text-align: center;">8.43 <span class="math inline">±</span>3.43</td>
+<td style="text-align: center;">25.74 <span class="math inline">±</span>7.83</td>
+<td style="text-align: center;">19.24 <span class="math inline">±</span>3.94</td>
+<td style="text-align: center;">21.38 <span class="math inline">±</span>6.96</td>
+<td style="text-align: center;">16.03 <span class="math inline">±</span>8.60</td>
+<td style="text-align: center;">19.17 <span class="math inline">±</span>9.42</td>
+<td style="text-align: center;">16.31 <span class="math inline">±</span>6.99</td>
+<td style="text-align: center;">22.10 <span class="math inline">±</span>6.13</td>
+</tr>
+<tr class="odd">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyPointButton1-v0</span></td>
+<td style="text-align: center;">1.18 <span class="math inline">±</span>1.02</td>
+<td style="text-align: center;">29.42 <span class="math inline">±</span>12.10</td>
+<td style="text-align: center;">6.40 <span class="math inline">±</span>1.43</td>
+<td style="text-align: center;">27.90 <span class="math inline">±</span>13.27</td>
+<td style="text-align: center;">7.48 <span class="math inline">±</span>8.47</td>
+<td style="text-align: center;">24.27 <span class="math inline">±</span>3.95</td>
+<td style="text-align: center;">9.52 <span class="math inline">±</span>7.86</td>
+<td style="text-align: center;">25.00 <span class="math inline">±</span>0.00</td>
+</tr>
+<tr class="even">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyPointGoal2-v0</span></td>
+<td style="text-align: center;">-0.56 <span class="math inline">±</span>0.06</td>
+<td style="text-align: center;">48.43 <span class="math inline">±</span>40.55</td>
+<td style="text-align: center;">1.67 <span class="math inline">±</span>1.43</td>
+<td style="text-align: center;">23.50 <span class="math inline">±</span>11.17</td>
+<td style="text-align: center;">6.09 <span class="math inline">±</span>5.03</td>
+<td style="text-align: center;">25.00 <span class="math inline">±</span>0.00</td>
+<td style="text-align: center;">8.62 <span class="math inline">±</span>7.13</td>
+<td style="text-align: center;">25.00 <span class="math inline">±</span>0.00</td>
+</tr>
+<tr class="odd">
+<td style="text-align: left;"><span
+class="smallcaps">SafetyPointButton2-v0</span></td>
+<td style="text-align: center;">0.42 <span class="math inline">±</span>0.63</td>
+<td style="text-align: center;">28.87 <span class="math inline">±</span>11.27</td>
+<td style="text-align: center;">1.00 <span class="math inline">±</span>1.00</td>
+<td style="text-align: center;">30.00 <span class="math inline">±</span>9.50</td>
+<td style="text-align: center;">6.94 <span class="math inline">±</span>4.47</td>
+<td style="text-align: center;">25.00 <span class="math inline">±</span>0.00</td>
+<td style="text-align: center;">8.35 <span class="math inline">±</span>10.44</td>
+<td style="text-align: center;">25.00 <span class="math inline">±</span>0.00</td>
+</tr>
+</tbody>
+
+</table>
+</div>
+
+<caption><p><b>Table 2:</b> The performance of OmniSafe on-policy algorithms, encompassing both reward and cost, was assessed within the Safety-Gymnasium environments. It is crucial to highlight that all on-policy algorithms underwent evaluation following 1e7 training steps.</p></caption>
+
+#### First Order Algorithms
+
+<details>
+<summary>1e6 Steps Velocity Results</summary>
+<table>
+<tr>
+    <td style="text-align:center">
+    <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/first_order_ant_1e6.png?raw=True">
+    <br>
+    <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+        SafetyAntVelocity-v1
+    </div>
+    </td>
+</tr>
+</table>
+<table>
+<tr>
+    <td style="text-align:center">
+    <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/first_order_halfcheetah_1e6.png?raw=True">
+    <br>
+    <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+        SafetyHalfCheetahVelocity-v1
+    </div>
+    </td>
+</tr>
+<table>
+<tr>
+    <td style="text-align:center">
+    <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/first_order_hopper_1e6.png?raw=True">
+    <br>
+    <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+        SafetyHopperVelocity-v1
+    </div>
+    </td>
+</tr>
+</table>
+<table>
+<tr>
+    <td style="text-align:center">
+    <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/first_order_humanoid_1e6.png?raw=True">
+    <br>
+    <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+        SafetyHumanoidVelocity-v1
+    </div>
+    </td>
+</tr>
+</table>
+<table>
+<tr>
+    <td style="text-align:center">
+    <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/first_order_walker2d_1e6.png?raw=True">
+    <br>
+    <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+        SafetyWalker2dVelocity-v1
+    </div>
+    </td>
+</tr>
+</table>
+<table>
+<tr>
+    <td style="text-align:center">
+    <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/first_order_swimmer_1e6.png?raw=True">
+    <br>
+    <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+        SafetySwimmerVelocity-v1
+    </div>
+    </td>
+</tr>
+</table>
+<caption><p><b>Figure 1.1:</b> Training curves in Safety-Gymnasium MuJoCo Velocity environments within 1e6 steps
+</table>
+</details>
+
+<details>
+<summary>1e7 Steps Velocity Results</summary>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/first_order_ant_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyAntVelocity-v1
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/first_order_halfcheetah_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyHalfCheetahVelocity-v1
+        </div>
+        </td>
+    </tr>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/first_order_hopper_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyHopperVelocity-v1
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/first_order_humanoid_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyHumanoidVelocity-v1
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/first_order_walker2d_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyWalker2dVelocity-v1
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/first_order_swimmer_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetySwimmerVelocity-v1
+        </div>
+        </td>
+    </tr>
+    </table>
+    <caption><p><b>Figure 1.2:</b> Training curves in Safety-Gymnasium MuJoCo Velocity environments within 1e7 steps
+    </table>
+</details>
+
+<details>
+<summary>1e7 Steps Navigation Results</summary>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/first_order_carbutton1_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyCarButton1-v0
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/first_order_carbutton2_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyCarButton2-v0
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/first_order_cargoal1_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyCarGoal1-v0
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/first_order_cargoal2_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyCarGoal2-v0
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/first_order_pointbutton1_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyPointButton1-v0
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/first_order_pointbutton2_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyPointButton2-v0
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/first_order_pointgoal1_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyPointGoal1-v0
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/first_order_pointgoal2_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyPointGoal2-v0
+        </div>
+        </td>
+    </tr>
+    </table>
+    <caption><p><b>Figure 1.3:</b> Training curves in Safety-Gymnasium MuJoCo Navigation environments within 1e7 steps
+    </table>
+</details>
+
+#### Second Order Algorithms
+
+<details>
+<summary>1e6 Steps Velocity Results</summary>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/second_order_ant_1e6.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyAntVelocity-v1
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/second_order_halfcheetah_1e6.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyHalfCheetahVelocity-v1
+        </div>
+        </td>
+    </tr>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/second_order_hopper_1e6.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyHopperVelocity-v1
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/second_order_humanoid_1e6.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyHumanoidVelocity-v1
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/second_order_walker2d_1e6.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyWalker2dVelocity-v1
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/second_order_swimmer_1e6.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetySwimmerVelocity-v1
+        </div>
+        </td>
+    </tr>
+    </table>
+    <caption><p><b>Figure 2.1:</b> Training curves of second order algorithms in Safety-Gymnasium MuJoCo Velocity environments within 1e6 steps
+    </table>
+</details>
+
+<details>
+<summary>1e7 Steps Velocity Results</summary>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/second_order_ant_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyAntVelocity-v1
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/second_order_halfcheetah_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyHalfCheetahVelocity-v1
+        </div>
+        </td>
+    </tr>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/second_order_hopper_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyHopperVelocity-v1
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/second_order_humanoid_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyHumanoidVelocity-v1
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/second_order_walker2d_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyWalker2dVelocity-v1
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/second_order_swimmer_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetySwimmerVelocity-v1
+        </div>
+        </td>
+    </tr>
+    </table>
+    <caption><p><b>Figure 2.2:</b>  Training curves of second order algorithms in Safety-Gymnasium MuJoCo Velocity environments within 1e7 steps
+    </table>
+</details>
+
+<details>
+<summary>1e7 Steps Navigation Results</summary>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/second_order_carbutton1_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyCarButton1-v0
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/second_order_carbutton2_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyCarButton2-v0
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/second_order_cargoal1_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyCarGoal1-v0
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/second_order_cargoal2_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyCarGoal2-v0
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/second_order_pointbutton1_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyPointButton1-v0
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/second_order_pointbutton2_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyPointButton2-v0
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/second_order_pointgoal1_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyPointGoal1-v0
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/second_order_pointgoal2_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyPointGoal2-v0
+        </div>
+        </td>
+    </tr>
+    </table>
+    <caption><p><b>Figure 2.3:</b> Training curves of second order algorithms in Safety-Gymnasium MuJoCo Navigation environments within 1e7 steps
+    </table>
+</details>
+
+#### Saute Algorithms
+
+<details>
+<summary>1e6 Steps Velocity Results</summary>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/saute_ant_1e6.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyAntVelocity-v1
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/saute_halfcheetah_1e6.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyHalfCheetahVelocity-v1
+        </div>
+        </td>
+    </tr>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/saute_hopper_1e6.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyHopperVelocity-v1
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/saute_humanoid_1e6.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyHumanoidVelocity-v1
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/saute_walker2d_1e6.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyWalker2dVelocity-v1
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/saute_swimmer_1e6.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetySwimmerVelocity-v1
+        </div>
+        </td>
+    </tr>
+    </table>
+    <caption><p><b>Figure 3.1:</b> Training curves of Saute MDP algorithms in Safety-Gymnasium MuJoCo Velocity environments within 1e6 steps
+    </table>
+</details>
+
+<details>
+<summary>1e7 Steps Velocity Results</summary>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/saute_ant_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyAntVelocity-v1
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/saute_halfcheetah_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyHalfCheetahVelocity-v1
+        </div>
+        </td>
+    </tr>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/saute_hopper_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyHopperVelocity-v1
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/saute_humanoid_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyHumanoidVelocity-v1
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/saute_walker2d_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyWalker2dVelocity-v1
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/saute_swimmer_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetySwimmerVelocity-v1
+        </div>
+        </td>
+    </tr>
+    </table>
+    <caption><p><b>Figure 3.2:</b> Training curves of Saute MDP algorithms in Safety-Gymnasium MuJoCo Velocity environments within 1e7 steps
+    </table>
+</details>
+
+<details>
+<summary>1e7 Steps Navigation Results</summary>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/saute_carbutton1_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyCarButton1-v0
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/saute_carbutton2_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyCarButton2-v0
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/saute_carcircle1_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyCarCircle1-v0
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/saute_carcircle2_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyCarCircle2-v0
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/saute_cargoal1_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyCarGoal1-v0
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/saute_cargoal2_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyCarGoal2-v0
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/saute_pointbutton1_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyPointButton1-v0
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/saute_pointbutton2_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyPointButton2-v0
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/saute_pointcircle1_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyPointCircle1-v0
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/saute_pointcircle2_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyPointCircle2-v0
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/saute_pointgoal1_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyPointGoal1-v0
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/saute_pointgoal2_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyPointGoal2-v0
+        </div>
+        </td>
+    </tr>
+    </table>
+    <caption><p><b>Figure 3.3:</b> Training curves of Saute MDP algorithms in Safety-Gymnasium MuJoCo Navigation environments within 1e7 steps
+    </table>
+</details>
+
+#### Simmer Algorithms
+
+<details>
+<summary>1e6 Steps Velocity Results</summary>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/simmer_ant_1e6.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyAntVelocity-v1
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/simmer_halfcheetah_1e6.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyHalfCheetahVelocity-v1
+        </div>
+        </td>
+    </tr>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/simmer_hopper_1e6.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyHopperVelocity-v1
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/simmer_humanoid_1e6.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyHumanoidVelocity-v1
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/simmer_walker2d_1e6.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyWalker2dVelocity-v1
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/simmer_swimmer_1e6.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetySwimmerVelocity-v1
+        </div>
+        </td>
+    </tr>
+    </table>
+    <caption><p><b>Figure 4.1:</b> Training curves of Simmer MDP algorithms in Safety-Gymnasium MuJoCo Velocity environments within 1e6 steps
+    </table>
+</details>
+
+<details>
+<summary>1e7 Steps Velocity Results</summary>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/simmer_ant_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyAntVelocity-v1
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/simmer_halfcheetah_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyHalfCheetahVelocity-v1
+        </div>
+        </td>
+    </tr>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/simmer_hopper_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyHopperVelocity-v1
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/simmer_humanoid_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyHumanoidVelocity-v1
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/simmer_walker2d_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyWalker2dVelocity-v1
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/simmer_swimmer_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetySwimmerVelocity-v1
+        </div>
+        </td>
+    </tr>
+    </table>
+    <caption><p><b>Figure 4.2:</b> Training curves of Simmer MDP algorithms in Safety-Gymnasium MuJoCo Velocity environments within 1e7 steps
+    </table>
+</details>
+
+<details>
+<summary>1e7 Steps Navigation Results</summary>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/simmer_carbutton1_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyCarButton1-v0
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/simmer_carbutton2_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyCarButton2-v0
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/simmer_cargoal1_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyCarGoal1-v0
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/simmer_cargoal2_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyCarGoal2-v0
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/simmer_pointbutton1_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyPointButton1-v0
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/simmer_pointbutton2_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyPointButton2-v0
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/simmer_pointgoal1_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyPointGoal1-v0
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/simmer_pointgoal2_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyPointGoal2-v0
+        </div>
+        </td>
+    </tr>
+    </table>
+    <caption><p><b>Figure 4.3:</b> Training curves of Simmer MDP algorithms in Safety-Gymnasium MuJoCo Navigation environments within 1e7 steps
+    </table>
+</details>
+
+#### PID-Lagrangian Algorithms
+
+<details>
+<summary>1e6 Steps Velocity Results</summary>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/pid_ant_1e6.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyAntVelocity-v1
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/pid_halfcheetah_1e6.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyHalfCheetahVelocity-v1
+        </div>
+        </td>
+    </tr>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/pid_hopper_1e6.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyHopperVelocity-v1
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/pid_humanoid_1e6.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyHumanoidVelocity-v1
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/pid_walker2d_1e6.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyWalker2dVelocity-v1
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/pid_swimmer_1e6.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetySwimmerVelocity-v1
+        </div>
+        </td>
+    </tr>
+    </table>
+    <caption><p><b>Figure 5.1:</b> Training curves of PID-Lagrangian algorithms in Safety-Gymnasium MuJoCo Velocity environments within 1e6 steps
+    </table>
+</details>
+
+<details>
+<summary>1e7 Steps Velocity Results</summary>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/pid_ant_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyAntVelocity-v1
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/pid_halfcheetah_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyHalfCheetahVelocity-v1
+        </div>
+        </td>
+    </tr>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/pid_hopper_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyHopperVelocity-v1
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/pid_humanoid_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyHumanoidVelocity-v1
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/pid_walker2d_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyWalker2dVelocity-v1
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/pid_swimmer_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetySwimmerVelocity-v1
+        </div>
+        </td>
+    </tr>
+    </table>
+    <caption><p><b>Figure 5.2:</b> Training curves of PID-Lagrangian algorithms in Safety-Gymnasium MuJoCo Velocity environments within 1e7 steps
+    </table>
+</details>
+
+<details>
+<summary>1e7 Steps Navigation Results</summary>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/pid_carbutton1_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyCarButton1-v0
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/pid_carbutton2_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyCarButton2-v0
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/pid_cargoal1_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyCarGoal1-v0
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/pid_cargoal2_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyCarGoal2-v0
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/pid_pointbutton1_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyPointButton1-v0
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/pid_pointbutton2_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyPointButton2-v0
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/pid_pointgoal1_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyPointGoal1-v0
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/pid_pointgoal2_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyPointGoal2-v0
+        </div>
+        </td>
+    </tr>
+    </table>
+    <caption><p><b>Figure 5.3:</b> Training curves of PID-Lagrangian algorithms in Safety-Gymnasium MuJoCo Navigation environments within 1e7 steps.
+    </table>
+</details>
+
+#### Early Terminated MDP Algorithms
+
+<details>
+<summary>1e6 Steps Velocity Results</summary>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/early_ant_1e6.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyAntVelocity-v1
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/early_halfcheetah_1e6.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyHalfCheetahVelocity-v1
+        </div>
+        </td>
+    </tr>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/early_hopper_1e6.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyHopperVelocity-v1
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/early_humanoid_1e6.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyHumanoidVelocity-v1
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/early_walker2d_1e6.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyWalker2dVelocity-v1
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/early_swimmer_1e6.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetySwimmerVelocity-v1
+        </div>
+        </td>
+    </tr>
+    </table>
+    <caption><p><b>Figure 6.1:</b> Training curves of early terminated MDP algorithms in Safety-Gymnasium MuJoCo Velocity environments within 1e6 steps.
+    </table>
+</details>
+
+<details>
+<summary>1e7 Steps Velocity Results</summary>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/early_ant_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyAntVelocity-v1
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/early_halfcheetah_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyHalfCheetahVelocity-v1
+        </div>
+        </td>
+    </tr>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/early_hopper_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyHopperVelocity-v1
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/early_humanoid_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyHumanoidVelocity-v1
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/early_walker2d_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyWalker2dVelocity-v1
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/early_swimmer_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetySwimmerVelocity-v1
+        </div>
+        </td>
+    </tr>
+    </table>
+    <caption><p><b>Figure 6.2:</b> Training curves of early terminated MDP algorithms in Safety-Gymnasium MuJoCo Velocity environments within 1e7 steps.
+    </table>
+</details>
+
+<details>
+<summary>1e7 Steps Navigation Results</summary>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/early_carbutton1_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyCarButton1-v0
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/early_carbutton2_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyCarButton2-v0
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/early_cargoal1_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyCarGoal1-v0
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/early_cargoal2_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyCarGoal2-v0
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/early_pointbutton1_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyPointButton1-v0
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/early_pointbutton2_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyPointButton2-v0
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/early_pointgoal1_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyPointGoal1-v0
+        </div>
+        </td>
+    </tr>
+    </table>
+    <table>
+    <tr>
+        <td style="text-align:center">
+        <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/on-policy/benchmarks/early_pointgoal2_1e7.png?raw=True">
+        <br>
+        <div style="color:orange; border-bottom: 1px solid #d9d9d9; display: inline-block; color: #999; padding: 2px;">
+            SafetyPointGoal2-v0
+        </div>
+        </td>
+    </tr>
+    </table>
+    <caption><p><b>Figure 6.3:</b> Training curves of early terminated MDP algorithms in Safety-Gymnasium MuJoCo Navigation environments within 1e7 steps.
+    </table>
+</details>
diff --git a/docs/source/index.rst b/docs/source/index.rst
index 792f62052..595844afc 100644
--- a/docs/source/index.rst
+++ b/docs/source/index.rst
@@ -365,8 +365,23 @@ this project, don't hesitate to ask your question on `the GitHub issue page <htt
     :caption: get started
 
     start/installation
+    start/features
+    start/efficiency
     start/usage
     start/env
+    start/algo
+    start/exp-grid
+
+.. toctree::
+    :hidden:
+    :maxdepth: 3
+    :caption: benchmark
+
+    benchmark/on-policy
+    benchmark/off-policy
+    benchmark/offline
+    benchmark/modelbased
+    benchmark/case-study
 
 .. toctree::
     :hidden:
diff --git a/docs/source/spelling_wordlist.txt b/docs/source/spelling_wordlist.txt
index 460cabd1a..d042f1726 100644
--- a/docs/source/spelling_wordlist.txt
+++ b/docs/source/spelling_wordlist.txt
@@ -486,3 +486,14 @@ UpdateDynamics
 mathbb
 meger
 Jupyter
+codebase
+WandB
+wandb
+Colab
+colab
+Threadripper
+threadripper
+Ryzen
+ryzen
+linux
+stochasticity
diff --git a/docs/source/start/algo.md b/docs/source/start/algo.md
new file mode 100644
index 000000000..3fdc7784e
--- /dev/null
+++ b/docs/source/start/algo.md
@@ -0,0 +1,111 @@
+# Supported Algorithms
+
+OmniSafe offers a highly modular framework that integrates an extensive collection of algorithms specifically designed for Safe Reinforcement Learning (SafeRL) in various domains. The `Adapter` module in OmniSafe allows for easily expanding different types of SafeRL algorithms.
+
+<!DOCTYPE html>
+<html lang="en">
+<head>
+<meta charset="UTF-8">
+<style>
+  .scrollable-container {
+    overflow-x: auto;
+    white-space: nowrap;
+    width: 100%;
+  }
+  table {
+    border-collapse: collapse;
+    width: auto;
+    font-size: 12px;
+  }
+  th, td {
+    padding: 8px;
+    text-align: center;
+    border: 1px solid #ddd;
+  }
+  th {
+    font-weight: bold;
+  }
+  caption {
+    font-size: 12px;
+    font-family: 'Times New Roman', Times, serif;
+  }
+</style>
+</head>
+<body>
+
+<div class="scrollable-container">
+<table>
+<thead>
+  <tr>
+    <th>Domains</th>
+    <th>Types</th>
+    <th>Algorithms Registry</th>
+  </tr>
+</thead>
+<tbody>
+  <tr>
+    <td rowspan="5">On Policy</td>
+    <td rowspan="2">Primal Dual</td>
+    <td>TRPOLag; PPOLag; PDO; RCPO</td>
+  </tr>
+  <tr>
+    <td>TRPOPID; CPPOPID</td>
+  </tr>
+  <tr>
+    <td>Convex Optimization</td>
+    <td><span style="font-weight:400;font-style:normal">CPO; PCPO; </span>FOCOPS; CUP</td>
+  </tr>
+  <tr>
+    <td>Penalty Function</td>
+    <td>IPO; P3O</td>
+  </tr>
+  <tr>
+    <td>Primal</td>
+    <td>OnCRPO</td>
+  </tr>
+  <tr>
+    <td rowspan="3">Off Policy</td>
+    <td rowspan="2">Primal-Dual</td>
+    <td>DDPGLag; TD3Lag; SACLag</td>
+  </tr>
+  <tr>
+    <td><span style="font-weight:400;font-style:normal">DDPGPID; TD3PID; SACPID</span></td>
+  </tr>
+    <td rowspan="1">Control Barrier Function</td>
+    <td>DDPGCBF, SACRCBF, CRABS</td>
+  </tr>
+  <tr>
+    <td rowspan="2">Model-based</td>
+    <td>Online Plan</td>
+    <td>SafeLOOP; CCEPETS; RCEPETS</td>
+  </tr>
+  <tr>
+    <td><span style="font-weight:400;font-style:normal">Pessimistic Estimate</span></td>
+    <td>CAPPETS</td>
+  </tr>
+    <td rowspan="2">Offline</td>
+    <td>Q-Learning Based</td>
+    <td>BCQLag; C-CRR</td>
+  </tr>
+  <tr>
+    <td>DICE Based</td>
+    <td>COptDICE</td>
+  </tr>
+  <tr>
+    <td rowspan="3">Other Formulation MDP</td>
+    <td>ET-MDP</td>
+    <td><span style="font-weight:400;font-style:normal">PPO</span>EarlyTerminated; TRPOEarlyTerminated</td>
+  </tr>
+  <tr>
+    <td>SauteRL</td>
+    <td>PPOSaute; TRPOSaute</td>
+  </tr>
+  <tr>
+    <td>SimmerRL</td>
+    <td><span style="font-weight:400;font-style:normal">PPOSimmerPID; TRPOSimmerPID</span></td>
+  </tr>
+</tbody>
+</table>
+</div>
+
+<caption><p><b>Table 1:</b> OmniSafe supports varieties of SafeRL algorithms. From the perspective of classic RL, OmniSafe includes on-policy, off-policy, offline, and model-based algorithms; From the perspective of the SafeRL learning paradigm, OmniSafe supports primal-dual, projection, penalty function, primal, etc.</p></caption>
diff --git a/docs/source/start/efficiency.rst b/docs/source/start/efficiency.rst
new file mode 100644
index 000000000..c0b9bb428
--- /dev/null
+++ b/docs/source/start/efficiency.rst
@@ -0,0 +1,60 @@
+Efficiency
+==========
+
+To demonstrate the effectiveness and resource utilization of OmniSafe as a SafeRL infrastructure, we have added a comparison of the runtime efficiency between OmniSafe and other SafeRL libraries, *i.e.*, `SafePO <https://proceedings.neurips.cc/paper_files/paper/2023/file/3c557a3d6a48cc99444f85e924c66753-Paper-Datasets_and_Benchmarks.pdf>`_, `RL-Safety-Algorithms <https://github.com/SvenGronauer/RL-Safety-Algorithms>`_, and `Safety-starter-agents <https://github.com/openai/safety-starter-agents>`_. The test results are shown in Table 1:
+
+
+
+.. table:: **Table 1**: Comparison of computational time consumption between OmniSafe and other libraries in one thread (unit: seconds). We selected classic algorithms PPOLag and CPO for analysis and tested the average single epoch time consumption over 10 epochs with different sizes of neural networks on SG's SafetyPointGoal1-v0.
+   :name: appendix_f
+   :width: 100 %
+
+   +------------------------------------+----------------------------+------------------+------------------+------------------+
+   |                                    | **PPOLag**                 |                  | **CPO**          |                  |
+   +------------------------------------+----------------------------+------------------+------------------+------------------+
+   |**Hidden Layers Size**              | 64 x 64                    | 1024 x 1024      | 64 x 64          | 1024 x 1024      |
+   +------------------------------------+----------------------------+------------------+------------------+------------------+
+   |**Safety-starter-agents**           | 51.64 ± 1.56               | 63.99 ± 1.75     | 50.70 ± 1.17     | 83.09 ± 0.92     |
+   +------------------------------------+----------------------------+------------------+------------------+------------------+
+   | **RL-Safety-Algorithms**           | 46.25 ± 0.43               | 107.50 ± 2.18    | 47.24 ± 0.43     | 134.12 ± 0.71    |
+   +------------------------------------+----------------------------+------------------+------------------+------------------+
+   | **SafePO**                         | 15.91 ± 0.46               | 20.84 ± 0.26     | 16.50 ± 0.50     | 19.72 ± 0.16     |
+   +------------------------------------+----------------------------+------------------+------------------+------------------+
+   | **OmniSafe**                       | **10.59 ± 0.15**           | **14.02 ± 0.16** | **10.06 ± 0.09** | **12.28 ± 0.81** |
+   +------------------------------------+----------------------------+------------------+------------------+------------------+
+
+
+In our comparative experiments, we rigorously ensure uniformity across all experimental settings. More specifically, PPOLag and CPO implement early stopping techniques, which vary the number of
+updates based on the KL divergence between the current and reference policies. This introduces
+randomness into the time measurements. To control for consistent variables, we fixed the number of
+``update_iters`` at 1, ``steps_per_epoch`` at 20,000, and ``batch_size`` at 64, conducting the tests on the same machine with no other processes running. The specific device parameters are:
+
+- **CPU**: AMD Ryzen Threadripper PRO 3975WX 32-Cores
+- **GPU**: NVIDIA GeForce RTX 3090, Driver Version: 535.154.05
+
+Under these consistent conditions, **OmniSafe achieved the lowest computational time consumption on
+the same baseline algorithms**, which we attribute to 3 factors: *vectorized environment
+parallelism* for accelerated data collections, `asynchronous agent parallelism <https://arxiv.org/abs/1602.01783>`_ for parallelized learning, and *GPU resource utilization* for immense network
+support. We will elaborate on how these features contribute to OmniSafe's computational efficiency.
+
+**Vectorized Environment Parallelism**: OmniSafe and SafePO support vectorized environment
+interfaces and buffers. In this experiment, we set the parallelism number of vectorized
+environments to 10, meaning that a single agent can simultaneously generate 10 actions based on 10
+vectorized observations and perform batch updates through a vectorized buffer. This feature
+enhances the efficiency of agents' data sampling from environments.
+
+**Asynchronous Agent Parallelism**: OmniSafe supports *Asynchronous Advantage Actor-Critic (A3C)*
+parallelism based on the distributed framework ``torch.distributed``. In this experiment, we set
+the parallelism number of asynchronous agents to 2, meaning two agents were instantiated to sample
+and learn simultaneously, synchronizing their neural network parameters at the end of each epoch.
+This feature further enhances the efficiency of agent sampling and updating.
+
+**GPU Resource Utilization**: Since only OmniSafe and SafePO utilize GPU computing resources, in
+this experiment, we used the NVIDIA GeForce RTX 3090 as the computing device. As shown in
+:ref:`Table 1 <appendix_f>`., when the hidden layer parameters increased from 64 x 64 to 1024 x
+1024, the runtime of RL-Safety-Algorithms and Safety-starter-agents significantly increased,
+whereas the runtime increase for OmniSafe and SafePO was relatively smaller. This trend is
+particularly notable with the CPO algorithm, which requires computing a second-order Hessian matrix
+during updates. If computed using a CPU, the computational overhead would increase with the size of
+the neural network parameters. However, OmniSafe and SafePO, which support GPU acceleration, are
+almost unaffected.
diff --git a/docs/source/start/exp-grid.md b/docs/source/start/exp-grid.md
new file mode 100644
index 000000000..ddd0b818f
--- /dev/null
+++ b/docs/source/start/exp-grid.md
@@ -0,0 +1,31 @@
+# Experiment Grid
+
+In the context of RL experiments, it is imperative to assess the performance of various algorithms across multiple environments. However, the inherent influence of randomness necessitates repeated evaluations employing distinct random seeds. To tackle this challenge, introduces an `Experiment Grid`, facilitating simultaneous initiation of multiple experimental sets. Researchers are merely required to pre-configure the experimental parameters, subsequently executing multiple experiment sets in parallel via a single file. An exemplification of this process can be found in <a href="#expgrid">Figure 1</a>.
+
+<img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/grid/grid.png?raw=true" id="expgrid">
+<br>
+<caption><p><b></b></p></caption>
+
+**Figure 1:** OmniSafe ’s `Experiment Grid`. The left side of the figure displays the main unction of the run `experiment_grid.py` file, while the right side shows the status of the `Experiment Grid` execution. In this example, three distinct random seeds are selected for the `SafetyAntVelocity-v1` and `SafetyWalker2dVelocity-v1`, then the PPOLag and TRPO-Lag algorithms are executed.
+
+<img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/grid/example_ant.png?raw=true" id="ant">
+<br>
+<div style="color:orange; border-bottom: 1px solid #d9d9d9; text-align: center; color: #999; padding: 2px;">
+SafetyAntVelocity-v1
+</div>
+<img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08); text-align: center;" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/grid/example_walker2d.png?raw=true" id="walker2d">
+<div style="color:orange; border-bottom: 1px solid #d9d9d9; text-align: center; color: #999; padding: 2px;">
+SafetyWalker2dVelocity-v1
+</div>
+<br>
+
+**Figure 2:** Analysis of the example experiment results. The blue lines are the results from PPOLag, while the orange ones are TRPO-Lag. The solid line in the figure represents the mean of multiple random seeds, while the shadow represents the standard deviation among 0, 5, and 10 random seeds.
+
+The `run_experiment_grid.py` script executes experiments in parallel based on user-configured parameters and generates corresponding graphs of the experimental results. In the example presented in <a href="#expgrid">Figure 1</a>, we specified that the script should draw curves based on different environments and obtained the training curves of PPOLag and TRPO-Lag in `SafetyAntVelocity-v1` and `SafetyWalker2dVelocity-v1`, where seeds have been grouped.
+
+Moreover, combined with `Statistics Tools`, the `Experiment Grid` is a powerful tool for parameter tuning. As illustrated in <a href="#compare">Figure 3</a>, we utilized the `Experiment Grid` to explore the impact of `batch_size` on the performance of PPOLag and TRPO-Lag in `SafetyWalker2dVelocity-v1` and `SafetyAntVelocity-v1`, then used `Statistics Tools` to analyze the experiment results. It is obvious that the `batch_size` has a significant influence on the performance of PPOLag in `SafetyWalker2dVelocity-v1`, and the optimal `batch_size` is 128. Obtaining this conclusion requires repeating the experiment multiple times, and the `Experiment Grid` significantly expedites the process.
+
+<img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/Gaiejj/omnisafe_benchmarks_cruve/blob/main/grid/compare.png?raw=true" id="compare">
+<br>
+
+**Figure 3:** An example of how the `Experiment Grid` can be utilized for parameter tuning. In this particular example, we set the `batch_size` in the `algo_cfgs` to 64, 128, and 256. Then we ran multiple experiments using the `Experiment Grid`, and finally used `Statistics Tools` to analyze the impact of the `batch_size` on the performance of the algorithm. Note that different colors denote different `batch_size`. The results showed that the `batch_size` had a significant effect on the performance of the algorithm, and the optimal `batch_size` was found to be 128. The `Experiment Grid` enabled us to efficiently explore the effect of different parameter values on the algorithm's performance.
diff --git a/docs/source/start/features.md b/docs/source/start/features.md
new file mode 100644
index 000000000..e84e0e98f
--- /dev/null
+++ b/docs/source/start/features.md
@@ -0,0 +1,257 @@
+# Features
+
+OmniSafe transcends its role as a mere SafeRL library, functioning concurrently as a standardized and user-friendly SafeRL infrastructure. We compared the features of OmniSafe with popular open-source RL libraries. [See comparison results](#compare_with_repo).
+
+> **Note:** All results in [compare_with_repo](#compare_with_repo) are accurate as of 2024. Please consider the latest results if you find any discrepancies between these data.
+
+**Table 1:** Comparison of OmniSafe to a representative subset of RL or SafeRL libraries.
+
+<!DOCTYPE html>
+<html lang="en">
+<head>
+<meta charset="UTF-8">
+<style>
+  .scrollable-container {
+    overflow-x: auto;
+    white-space: nowrap;
+    width: 100%;
+  }
+  table {
+    border-collapse: collapse;
+    width: auto;
+    font-size: 12px;
+  }
+  th, td {
+    padding: 8px;
+    text-align: center;
+    border: 1px solid #ddd;
+  }
+  th {
+    font-weight: bold;
+  }
+  caption {
+    font-size: 12px;
+    font-family: 'Times New Roman', Times, serif;
+  }
+</style>
+</head>
+<body>
+
+<div class="scrollable-container">
+<table>
+    <thead>
+        <tr>
+            <th class="feature">Features</th>
+            <th>OmniSafe</th>
+            <th>TianShou</th>
+            <th>Stable-Baselines3</th>
+            <th>SafePO</th>
+            <th>RL-Safety-Algorithms</th>
+            <th>Safety-starter-agents</th>
+        </tr>
+    </thead>
+    <tbody>
+        <tr>
+            <td class="feature">Algorithm Tutorial</td>
+            <td>✓</td>
+            <td>✓</td>
+            <td>✓</td>
+            <td>✗</td>
+            <td>✗</td>
+            <td>✗</td>
+        </tr>
+        <tr>
+            <td class="feature">API Documentation</td>
+            <td>✓</td>
+            <td>✓</td>
+            <td>✓</td>
+            <td>✓</td>
+            <td>✗</td>
+            <td>✗</td>
+        </tr>
+        <tr>
+            <td class="feature">Command Line Interface</td>
+            <td>✓</td>
+            <td>✓</td>
+            <td>✓</td>
+            <td>✗</td>
+            <td>✗</td>
+            <td>✗</td>
+        </tr>
+        <tr>
+            <td class="feature">Custom Environment</td>
+            <td>✓</td>
+            <td>✓</td>
+            <td>✓</td>
+            <td>✗</td>
+            <td>✗</td>
+            <td>✗</td>
+        </tr>
+        <tr>
+            <td class="feature">Docker Support</td>
+            <td>✓</td>
+            <td>✓</td>
+            <td>✓</td>
+            <td>✗</td>
+            <td>✗</td>
+            <td>✗</td>
+        </tr>
+        <tr>
+            <td class="feature">GPU Support</td>
+            <td>✓</td>
+            <td>✓</td>
+            <td>✓</td>
+            <td>✓</td>
+            <td>✗</td>
+            <td>✗</td>
+        </tr>
+        <tr>
+            <td class="feature">Ipython / Notebook</td>
+            <td>✓</td>
+            <td>✓</td>
+            <td>✓</td>
+            <td>✗</td>
+            <td>✗</td>
+            <td>✗</td>
+        </tr>
+        <tr>
+            <td class="feature">PEP8 Code Style</td>
+            <td>✓</td>
+            <td>✓</td>
+            <td>✓</td>
+            <td>✓</td>
+            <td>✓</td>
+            <td>✓</td>
+        </tr>
+        <tr>
+            <td class="feature">Statistics Tools</td>
+            <td>✓</td>
+            <td>✓</td>
+            <td>✓</td>
+            <td>✗</td>
+            <td>✗</td>
+            <td>✗</td>
+        </tr>
+        <tr>
+            <td class="feature">Test Coverage</td>
+            <td>97%</td>
+            <td>91%</td>
+            <td>96%</td>
+            <td>91%</td>
+            <td>-</td>
+            <td>-</td>
+        </tr>
+        <tr>
+            <td class="feature">Type Hints</td>
+            <td>✓</td>
+            <td>✓</td>
+            <td>✓</td>
+            <td>✓</td>
+            <td>✓</td>
+            <td>✓</td>
+        </tr>
+        <tr>
+            <td class="feature">Vectorized Environments</td>
+            <td>✓</td>
+            <td>✓</td>
+            <td>✓</td>
+            <td>✓</td>
+            <td>✗</td>
+            <td>✗</td>
+        </tr>
+        <tr>
+            <td class="feature">Video Examples</td>
+            <td>✓</td>
+            <td>✓</td>
+            <td>✓</td>
+            <td>✗</td>
+            <td>✗</td>
+            <td>✗</td>
+        </tr>
+    </tbody>
+</table>
+</div>
+
+<a id="compare_with_repo"></a> Compared to classic RL open-source libraries, [TianShou](https://www.jmlr.org/papers/v23/21-1127.html) and [Stable-Baselines3](https://jmlr.org/papers/v22/20-1364.html), OmniSafe adheres to the same engineering standards and supports user-friendly features. Compared to the SafeRL library, [SafePO](https://proceedings.neurips.cc/paper_files/paper/2023/file/3c557a3d6a48cc99444f85e924c66753-Paper-Datasets_and_Benchmarks.pdf), [RL-Safety-Algorithms](https://github.com/SvenGronauer/RL-Safety-Algorithms), and [Safety-starter-agents](https://github.com/openai/safety-starter-agents), OmniSafe offers greater ease of use and robustness, making it a foundational infrastructure to accelerate SafeRL research. The complete codebase of OmniSafe adheres to the PEP8 style, with each commit undergoing stringent evaluations, such as `isort`, `pylint`, `black`, and `ruff`. Before merging into the main branch, code modifications necessitate approval from at least two reviewers. These features enhance the reliability of OmniSafe and provide assurances for effective ongoing development.
+
+OmniSafe includes a tutorial on `Colab` that provides a step-by-step guide to the training process, as illustrated in [Figure 2](#figure_2). For those who are new to SafeRL, the tutorial allows for interactive learning of the training procedure. By clicking on `Colab Tutorial`, users can access it and follow the instructions to understand better how to use OmniSafe. Seasoned researchers can capitalize on OmniSafe's informative command-line interface, as demonstrated in [Figure 1](#figure_1) and [Figure 3](#figure_3), facilitating rapid comprehension of the platform's utilization to expedite their scientific investigations.
+
+Regarding the experiment execution process, OmniSafe presents an array of tools for analyzing experimental outcomes, encompassing `WandB`, `TensorBoard`, and `Statistics Tools`. Furthermore, OmniSafe has submitted its experimental benchmark to the `WandB` report [1], as depicted in [Figure 4](#figure_4). This report furnishes more detailed training curves and evaluation demonstrations of classic algorithms, serving as a valuable reference for researchers.
+
+[1]: [https://api.wandb.ai/links/pku_rl/mv1eeetb](https://api.wandb.ai/links/pku_rl/mv1eeetb) | [https://api.wandb.ai/links/pku_rl/scvni0oj](https://api.wandb.ai/links/pku_rl/scvni0oj)
+
+[cli]: #cli
+[tutorial]: #tutorial
+[cli_details]: #cli_details
+[wandb_video]: #wandb_video
+
+
+<img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/muchvo/omnisafe_docs_img/blob/main/features/cli_help-1.png?raw=true" id="analys">
+<br>
+
+<a id="figure_1"></a> **Figure 1:** An illustration of the OmniSafe command line interface. Users can view the commands supported by OmniSafe and a brief usage guide by simply typing `omnisafe --help` in the command line. If a user wants to further understand how to use a specific command, they can obtain additional prompts by using the command `omnisafe COMMAND --help`, as shown in [Figure 3](#figure_3).
+
+<img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/muchvo/omnisafe_docs_img/blob/main/features/tutorial-1.png?raw=true" id="analys">
+<br>
+
+<a id="figure_2"></a> **Figure 2:** A example demonstrating the Colab tutorial provided by OmniSafe for using the `Experiment Grid`. The tutorial includes detailed usage descriptions and allows users to try running it and then see the results.
+
+<img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/muchvo/omnisafe_docs_img/blob/main/features/cli_analyze_grid_help-1.png?raw=true" id="analys">
+<br>
+
+(a) Example of `omnisafe analyze-grid --help` in command line.
+
+<img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/muchvo/omnisafe_docs_img/blob/main/features/cli_benchmark_help-1.png?raw=true" id="analys">
+<br>
+
+(b) Example of `omnisafe benchmark --help` in command line.
+
+<img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/muchvo/omnisafe_docs_img/blob/main/features/cli_eval_help-1.png?raw=true" id="analys">
+<br>
+
+(c) Example of `omnisafe eval --help` in command line.
+
+<img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/muchvo/omnisafe_docs_img/blob/main/features/cli_train_cfgs_help-1.png?raw=true" id="analys">
+<br>
+
+(d) Example of `omnisafe train-config --help` in command line.
+
+<img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/muchvo/omnisafe_docs_img/blob/main/features/cli_help-1.png?raw=true" id="analys">
+<br>
+
+<a id="figure_3"></a> **Figure 3:** Here are some more details on using `omnisafe --help` command. Users can input `omnisafe COMMAND --help` to get help, where `COMMAND` includes all the items listed in `Commands` of [Figure 1](#figure_1). This feature enables users to swiftly acquire proficiency in executing common operations provided by OmniSafe via command-line and customize them further to meet their specific requirements.
+
+<table style="width: 100%; border-collapse: collapse;">
+    <tr>
+        <td style="text-align: center; padding: 10px;">
+            <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12), 0 2px 10px 0 rgba(34,36,38,.08); width: 100%;" src="https://github.com/muchvo/omnisafe_docs_img/blob/main/features/wandb_pointgoal-1.png?raw=true" alt="SafetyPointGoal1-v0" />
+            <br>
+            <strong>(a) SafetyPointGoal1-v0</strong>
+        </td>
+        <td style="text-align: center; padding: 10px;">
+            <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12), 0 2px 10px 0 rgba(34,36,38,.08); width: 100%;" src="https://github.com/muchvo/omnisafe_docs_img/blob/main/features/wandb_pointbutton-1.png?raw=true" alt="SafetyPointButton1-v0" />
+            <br>
+            <strong>(b) SafetyPointButton1-v0</strong>
+        </td>
+    </tr>
+    <tr>
+        <td style="text-align: center; padding: 10px;">
+            <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12), 0 2px 10px 0 rgba(34,36,38,.08); width: 100%;" src="https://github.com/muchvo/omnisafe_docs_img/blob/main/features/wandb_cargoal-1.png?raw=true" alt="SafetyCarGoal1-v0" />
+            <br>
+            <strong>(c) SafetyCarGoal1-v0</strong>
+        </td>
+        <td style="text-align: center; padding: 10px;">
+            <img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12), 0 2px 10px 0 rgba(34,36,38,.08); width: 100%;" src="https://github.com/muchvo/omnisafe_docs_img/blob/main/features/wandb_carbutton-1.png?raw=true" alt="SafetyCarButton1-v0" />
+            <br>
+            <strong>(d) SafetyCarButton1-v0</strong>
+        </td>
+    </tr>
+</table>
+
+<a id="figure_4"></a> <p><strong>Figure 4:</strong> An exemplification of OmniSafe's <code>WandB</code> reports videos. This example supplies videos of PPO and PPOLag in <code>SafetyPointGoal1-v0</code>, <code>SafetyPointButton1-v0</code>, <code>SafetyCarGoal1-v0</code>, and <code>SafetyCarButton1-v0</code> environments. The left of each sub-figure is PPO, while the right is PPOLag. Through these videos, we can intuitively witness the difference between safe and unsafe behavior. This is exactly what OmniSafe pursues: not just the safety of the training curve, but the true safety in a real sense.</p>
+
+
+<img style="border-radius: 0.3125em; box-shadow: 0 2px 4px 0 rgba(34,36,38,.12),0 2px 10px 0 rgba(34,36,38,.08);" src="https://github.com/muchvo/omnisafe_docs_img/blob/main/features/wandb_curve-1.png?raw=true" id="analys">
+<br>
+
+**Figure 5:** An exemplification of OmniSafe's `WandB` reports training curve in `SafetyPointGoal1-v0`: The left panel represents the episode reward, and the right panel denotes the episode cost, with both encompassing the performance over 1e7 steps.
diff --git a/omnisafe/adapter/modelbased_adapter.py b/omnisafe/adapter/modelbased_adapter.py
index 8abbd90d7..5d4321bbf 100644
--- a/omnisafe/adapter/modelbased_adapter.py
+++ b/omnisafe/adapter/modelbased_adapter.py
@@ -330,7 +330,7 @@ def rollout(  # pylint: disable=too-many-arguments,too-many-locals
                 eval_start = time.time()
                 eval_func(current_step, True)
                 self._last_eval = current_step
-                eval_time += time.time() - eval_start
+                eval_time += time.time() - eval_start  # pylint: disable=undefined-variable
 
         if not self._first_log or current_step >= self._cfgs.train_cfgs.total_steps:
             self._log_metrics(logger)
diff --git a/omnisafe/common/logger.py b/omnisafe/common/logger.py
index 43d447800..9fc753e46 100644
--- a/omnisafe/common/logger.py
+++ b/omnisafe/common/logger.py
@@ -144,10 +144,10 @@ def __init__(  # pylint: disable=too-many-arguments,too-many-locals
                 config=config,
             )
             if config is not None:
-                wandb.config.update(config)
+                wandb.config.update(config)  # type: ignore
             if models is not None:
                 for model in models:
-                    wandb.watch(model)
+                    wandb.watch(model)  # type: ignore
 
     def log(self, msg: str, color: str = 'green', bold: bool = False) -> None:
         """Log the message to the console and the file.
diff --git a/omnisafe/common/offline/data_collector.py b/omnisafe/common/offline/data_collector.py
index 35d1e1b75..fc95fda00 100644
--- a/omnisafe/common/offline/data_collector.py
+++ b/omnisafe/common/offline/data_collector.py
@@ -110,7 +110,7 @@ def register_agent(self, save_dir: str, model_name: str, size: int) -> None:
 
         model_path = os.path.join(save_dir, 'torch_save', model_name)
         try:
-            model_params = torch.load(model_path)
+            model_params = torch.load(model_path, weights_only=False)
         except FileNotFoundError as error:
             raise FileNotFoundError(f'Model {model_name} not found in {save_dir}') from error
 
diff --git a/omnisafe/envs/classic_control/envs_from_crabs.py b/omnisafe/envs/classic_control/envs_from_crabs.py
index a3fd3b404..4f6933db0 100644
--- a/omnisafe/envs/classic_control/envs_from_crabs.py
+++ b/omnisafe/envs/classic_control/envs_from_crabs.py
@@ -238,7 +238,7 @@ def _get_obs(self):
         else:
             return np.array([np.cos(th), np.sin(th), thdot], dtype=np.float32)
 
-    def reset(self):
+    def reset(self):  # type: ignore
         """Reset the environment."""
         self.state = self.init_state
         self.last_u = None
diff --git a/omnisafe/envs/safety_gymnasium_modelbased.py b/omnisafe/envs/safety_gymnasium_modelbased.py
index fe5ae5071..372ccc4e8 100644
--- a/omnisafe/envs/safety_gymnasium_modelbased.py
+++ b/omnisafe/envs/safety_gymnasium_modelbased.py
@@ -174,6 +174,7 @@ def get_cost_from_obs_tensor(self, obs: torch.Tensor, is_binary: bool = True) ->
             cost: Batch cost.
         """
         assert torch.is_tensor(obs), 'obs must be tensor'
+        assert len(obs.shape) == 2 or len(obs.shape) == 3
         hazards_key = self.key_to_slice_tensor['hazards']
         if len(obs.shape) == 2:
             batch_size = obs.shape[0]
@@ -181,7 +182,11 @@ def get_cost_from_obs_tensor(self, obs: torch.Tensor, is_binary: bool = True) ->
         elif len(obs.shape) == 3:
             batch_size = obs.shape[0] * obs.shape[1]
             hazard_obs = obs[:, :, hazards_key].reshape(batch_size, -1, 2)
-        hazards_dist = torch.sqrt(torch.sum(torch.square(hazard_obs), dim=2)).reshape(
+        else:
+            raise RuntimeError('observation size mismatch')
+        hazards_dist = torch.sqrt(
+            torch.sum(torch.square(hazard_obs), dim=2),
+        ).reshape(
             batch_size,
             -1,
         )
@@ -499,7 +504,7 @@ def reset(
             info['goal_met'] = False
 
             obs = torch.as_tensor(flat_coordinate_obs, dtype=torch.float32, device=self._device)
-        return obs, info
+        return obs, info  # pylint: disable=possibly-used-before-assignment
 
     def set_seed(self, seed: int) -> None:
         """Set the seed for the environment.
diff --git a/omnisafe/evaluator.py b/omnisafe/evaluator.py
index 8732d6e34..13eac7263 100644
--- a/omnisafe/evaluator.py
+++ b/omnisafe/evaluator.py
@@ -150,7 +150,7 @@ def __load_model_and_env(
         # load the saved model
         model_path = os.path.join(save_dir, 'torch_save', model_name)
         try:
-            model_params = torch.load(model_path)
+            model_params = torch.load(model_path, weights_only=False)
         except FileNotFoundError as error:
             raise FileNotFoundError('The model is not found in the save directory.') from error
 
diff --git a/omnisafe/utils/plotter.py b/omnisafe/utils/plotter.py
index 5bdbb7ec2..f24a97bb4 100644
--- a/omnisafe/utils/plotter.py
+++ b/omnisafe/utils/plotter.py
@@ -118,8 +118,7 @@ def plot_data(
                 smoothed_x = np.convolve(x, y, 'same') / np.convolve(z, y, 'same')
                 datum['Costs'] = smoothed_x
 
-        if isinstance(data, list):
-            data_to_plot = pd.concat(data, ignore_index=True)
+        data_to_plot = pd.concat(data, ignore_index=True)
         sns.lineplot(
             data=data_to_plot,
             x=xaxis,
diff --git a/pyproject.toml b/pyproject.toml
index a74b46723..d7351aeb5 100644
--- a/pyproject.toml
+++ b/pyproject.toml
@@ -39,6 +39,11 @@ dependencies = [
     "matplotlib >= 3.7.1",
     "gdown >= 4.6.0",
     "pytorch_lightning >= 2.2.2",
+    "cvxopt== 1.3.2",
+    "gpytorch== 1.11",
+    "joblib == 1.3.2",
+    "qpth == 0.0.16",
+    "scikit_learn == 1.3.2"
 ]
 dynamic = ["version", "entry-points"]
 
@@ -125,9 +130,8 @@ ignore-words = "docs/source/spelling_wordlist.txt"
 # Sync with requires-python
 target-version = "py38"
 line-length = 100
-show-source = true
 src = ["omnisafe", "tests", "examples"]
-select = [
+lint.select = [
     "E", "W",  # pycodestyle
     "F",       # pyflakes
     "UP",      # pyupgrade
@@ -148,7 +152,7 @@ select = [
     "TID",     # flake8-tidy-imports
     "RUF",     # ruff
 ]
-ignore = [
+lint.ignore = [
     # E501: line too long
     # W505: doc line too long
     # too long docstring due to long example blocks
@@ -167,9 +171,9 @@ ignore = [
     # use alias for import convention (e.g., `import torch.nn as nn`)
     "PLR0402",
 ]
-typing-modules = ["omnisafe.typing"]
+lint.typing-modules = ["omnisafe.typing"]
 
-[tool.ruff.per-file-ignores]
+[tool.ruff.lint.per-file-ignores]
 "__init__.py" = [
     "F401",  # unused-import
 ]
@@ -231,15 +235,15 @@ typing-modules = ["omnisafe.typing"]
     "ANN003",   # Missing type annotation
 ]
 
-[tool.ruff.flake8-annotations]
+[tool.ruff.lint.flake8-annotations]
 allow-star-arg-any = true
 
-[tool.ruff.flake8-quotes]
+[tool.ruff.lint.flake8-quotes]
 docstring-quotes = "double"
 multiline-quotes = "double"
 inline-quotes = "single"
 
-[tool.ruff.flake8-tidy-imports]
+[tool.ruff.lint.flake8-tidy-imports]
 ban-relative-imports = "all"
 
 [tool.pytest.ini_options]

	PETS		LOOP		SafeLOOP
Environment	Reward	Cost	Reward	Cost	Reward	Cost
SafetyCarGoal1-v0	33.07 ±1.33	61.20 ±7.23	25.41 ±1.23	62.64 ±8.34	22.09 ±0.30	0.16 ±0.15
SafetyPointGoal1-v0	27.66 ±0.07	49.16 ±2.69	25.08 ±1.47	55.23 ±2.64	22.94 ±0.72	0.04 ±0.07
	CCEPETS		RCEPETS		CAPPETS
Environment	Reward	Cost	Reward	Cost	Reward	Cost
SafetyCarGoal1-v0	27.60 ±1.21	1.03 ±0.29	29.08 ±1.63	1.02 ±0.88	23.33 ±6.34	0.48 ±0.17
SafetyPointGoal1-v0	24.98 ±0.05	1.87 ±1.27	25.39 ±0.28	2.46 ±0.58	9.45 ±8.62	0.64 ±0.77
	DDPG			TD3			SAC
Environment	OmniSafe (Ours)	Tianshou	Stable-Baselines3	OmniSafe (Ours)	Tianshou	Stable-Baselines3	OmniSafe (Ours)	Tianshou	Stable-Baselines3
SafetyAntVelocity-v1	860.86 ± 198.03	308.60 ± 318.60	2654.58 ± 1738.21	5246.86 ± 580.50	5379.55 ± 224.69	3079.45 ± 1456.81	5456.31 ± 156.04	6012.30 ± 102.64	2404.50 ± 1152.65
SafetyHalfCheetahVelocity-v1	11377.10 ± 75.29	12493.55 ± 437.54	7796.63 ± 3541.64	11246.12 ± 488.62	10246.77 ± 908.39	8631.27 ± 2869.15	11488.86 ± 513.09	12083.89 ± 564.51	7767.74 ± 3159.07
SafetyHopperVelocity-v1	1462.56 ± 591.14	2018.97 ± 1045.20	2214.06 ± 1219.57	3404.41 ± 82.57	2682.53 ± 1004.84	2542.67 ± 1253.33	3597.70 ± 32.23	3546.59 ± 76 .00	2158.54 ± 1343.24
SafetyHumanoidVelocity-v1	1537.39 ± 335.62	124.96 ± 61.68	2276.92 ± 2299.68	5798.01 ± 160.72	3838.06 ± 1832.90	3511.06 ± 2214.12	6039.77 ± 167.82	5424.55 ± 118.52	2713.60 ± 2256.89
SafetySwimmerVelocity-v1	139.39 ± 11.74	138.98 ± 8.60	210.40 ± 148.01	98.39 ± 32.28	94.43 ±9.63	247.09 ± 131.69	46.44 ±1.23	44.34 ±2.01	247.33 ± 122.02
SafetyWalker2dVelocity-v1	1911.70 ± 395.97	543.23 ± 316.10	3917.46 ± 1077.38	3034.83 ± 1374.72	4267.05 ± 678.65	4087.94 ± 755.10	4419.29 ± 232.06	4619.34 ± 274.43	3906.78 ± 795.48
Domains	Types	Algorithms Registry
On Policy	Primal Dual	TRPOLag; PPOLag; PDO; RCPO
	Primal Dual	TRPOPID; CPPOPID
	Convex Optimization	CPO; PCPO; FOCOPS; CUP
	Penalty Function	IPO; P3O
	Primal	OnCRPO
Off Policy	Primal-Dual	DDPGLag; TD3Lag; SACLag
	Primal-Dual	DDPGPID; TD3PID; SACPID
	Control Barrier Function	DDPGCBF, SACRCBF, CRABS
Model-based	Online Plan	SafeLOOP; CCEPETS; RCEPETS
Model-based	Pessimistic Estimate	CAPPETS
Offline	Q-Learning Based	BCQLag; C-CRR
Offline	DICE Based	COptDICE
Other Formulation MDP	ET-MDP	PPOEarlyTerminated; TRPOEarlyTerminated
	SauteRL	PPOSaute; TRPOSaute
	SimmerRL	PPOSimmerPID; TRPOSimmerPID
Features	OmniSafe	TianShou	Stable-Baselines3	SafePO	RL-Safety-Algorithms	Safety-starter-agents
Algorithm Tutorial	✓	✓	✓	✗	✗	✗
API Documentation	✓	✓	✓	✓	✗	✗
Command Line Interface	✓	✓	✓	✗	✗	✗
Custom Environment	✓	✓	✓	✗	✗	✗
Docker Support	✓	✓	✓	✗	✗	✗
GPU Support	✓	✓	✓	✓	✗	✗
Ipython / Notebook	✓	✓	✓	✗	✗	✗
PEP8 Code Style	✓	✓	✓	✓	✓	✓
Statistics Tools	✓	✓	✓	✗	✗	✗
Test Coverage	97%	91%	96%	91%	-	-
Type Hints	✓	✓	✓	✓	✓	✓
Vectorized Environments	✓	✓	✓	✓	✗	✗
Video Examples	✓	✓	✓	✗	✗	✗
+ + + (a) SafetyPointGoal1-v0 +	+ + + (b) SafetyPointButton1-v0 +
+ + + (c) SafetyCarGoal1-v0 +	+ + + (d) SafetyCarButton1-v0 +