Action abstractions for amortized sampling

Description

This is the official repository to the paper "Action abstractions for amortized sampling" by Oussama Boussif, Léna Néhale Ezzine, Joseph D Viviano, Michał Koziarski, Moksh Jain, Nikolay Malkin, Emmanuel Bengio, Rim Assouel and Yoshua Bengio.

We introduce ActionPiece, a method for discovering action abstractions in reinforcement learning (RL) and generative flow networks (GFlowNets) to improve exploration and credit assignment in long-horizon tasks. By iteratively identifying and chunking frequently used action subsequences, our approach enhances sample efficiency and mode discovery, particularly in entropy-seeking RL. Empirical results show improved performance in discovering diverse high-reward states, with learned abstractions capturing the latent structure of the action space.

Citation

If you use this codebase, or otherwise found our work valuable, please cite ActionPiece

@inproceedings{Boussif2024action,
  title  = {Action abstractions for amortized sampling},
  author = {Oussama Boussif and Lena Nehale Ezzine and Joseph D Viviano and Michał Koziarski and Moksh Jain and Nikolay Malkin and Emmanuel Bengio and Rim Assouel and Yoshua Bengio},
  year   = {2024},
  url    = {https://openreview.net/forum?id=ispjankYab&referrer=%5Bthe%20profile%20of%20Oussama%20Boussif%5D(%2Fprofile%3Fid%3D~Oussama_Boussif1)}
}

Installation

This project requires python>=3.10. To install, we recommend first setting up a virtual environment of your choice, and then pip installing this package:

pip install -e .

Experiments

Experiment runs can be found in sbatch_scripts/. Runs are run via main.py and all options are handled by hydra. See below for an example.

python main.py seed=42 environment=bit_sequence algo=tb_gfn trainer.max_epochs=1000 environment.max_len=128 algo.replay_buffer.cutoff_distance=25 algo.reward_temperature=0.3333 logger.wandb.name="prioritized-len-128"

Datasets

To make some datasets available, make sure to add this to your environment.

#!/bin/bash
export CHUNKGFN_DATA="/path/to/code/chunk-gfn/data"

to download those datasets, look in /path/to/code/chunk-gfn/data/${dataset}/download.sh.

Logs

The logging directory is determined in configs/paths/default.yaml it is by default log_dir: ${oc.env:PROJECT_DIR}/logs/ and could be changed if to any location in your environment if desired.

When using SLURM, the system will automatically define the following environment variables and our code expects them to be defined. When not using slurm, SLURM_JOB_ID and SLURM_JOB_NAME will be automatically generated. This will determine the log directory.

Name		Name	Last commit message	Last commit date
Latest commit History 416 Commits
chunkgfn		chunkgfn
configs		configs
data		data
logs		logs
notebooks		notebooks
sbatch_scripts		sbatch_scripts
.gitignore		.gitignore
.project-root		.project-root
ActionPiece.png		ActionPiece.png
L14_RNA1.pdf.png		L14_RNA1.pdf.png
L14_RNA1_dataset.pickle		L14_RNA1_dataset.pickle
L14_RNA1_high_rewards.pickle		L14_RNA1_high_rewards.pickle
L14_RNA1_highrewards_library.txt		L14_RNA1_highrewards_library.txt
L14_RNA1_modes.pickle		L14_RNA1_modes.pickle
L14_RNA2.pdf.png		L14_RNA2.pdf.png
L14_RNA2_dataset.pickle		L14_RNA2_dataset.pickle
L14_RNA2_high_rewards.pickle		L14_RNA2_high_rewards.pickle
L14_RNA2_highrewards_library.txt		L14_RNA2_highrewards_library.txt
L14_RNA2_modes.pickle		L14_RNA2_modes.pickle
L14_RNA3_dataset.pickle		L14_RNA3_dataset.pickle
L14_RNA3_modes.pickle		L14_RNA3_modes.pickle
README.md		README.md
bit_sequence.sh		bit_sequence.sh
compute_distance.py		compute_distance.py
main.py		main.py
modes_128.txt		modes_128.txt
modes_128_dataset.txt		modes_128_dataset.txt
modes_32.txt		modes_32.txt
modes_64.txt		modes_64.txt
modes_64_dataset.txt		modes_64_dataset.txt
output.png		output.png
pyproject.toml		pyproject.toml
run_all.sh		run_all.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Action abstractions for amortized sampling

Description

Citation

Installation

Experiments

Datasets

Logs

About

Uh oh!

Releases 1

Packages

Contributors 5

Uh oh!

Languages

GFNOrg/Chunk-GFN

Folders and files

Latest commit

History

Repository files navigation

Action abstractions for amortized sampling

Description

Citation

Installation

Experiments

Datasets

Logs

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 5

Uh oh!

Languages

Packages