Scripts for MBR/C-GRPO experiments.
The experiments conducted using NVIDIA A100 GPUs with 80 GB of VRAM.
src/: main training/evaluation codescripts/: runnable shell scriptsdata/: data prep and utilitiesdataset/: dataset directory (after download/creation)
- Python 3.12
- cuda:12.6.1-devel-ubuntu22.04
- Install dependencies
bash scripts/setup.sh- Run
bash scripts/run_c_grpo.shUse scripts/run_mbr.sh for using MBR decoding. You can edit arguments at the top of each script.
