Introduction

uniMASK is a generalization of BERT models with flexible abstractions for performing inference on subportions of sequences. Masking and prediction can occur both on the token level (as in traditional transformer), or even on subportions of tokens.

You can find the full paper here

Getting Started

To install uniMASK, run:

conda create -n uniMASK python=3.7
conda activate uniMASK
pip install -e .

uniMASK requires D4RL. You may install as detailed in the D4RL repo, e.g., by running:

pip install git+https://github.com/Farama-Foundation/d4rl@master#egg=d4rl

For CUDA support, you may need to reinstall pytorch in CUDA mode, for example:

pip install torch --extra-index-url https://download.pytorch.org/whl/cu116

To verify that the installation was successful, run pytest.

Reproducing results from the paper

Minigrid heatmap (figure 7)

Note: Reproducing all runs can a long time. We recommend parallelizing runs. In each script, the first line (comment) contains an example of how to use GNU Parallel towards this end.

Run the commands found in minigrid_repro.sh.
Fine-tune the pre-trained models generated in the previous step by running the commands in minigrid_ft_repro.sh.
Generate the heatmaps from these runs by running minigrid_heatmap.sh (no parallelization here).
You may then find the heatmap at uniMASK/scripts.

Maze2D results

To reproduce the Maze2D table in the paper:

Run wandb sweeps: medium_maze_sweep_all.yaml and medium_maze_sweep_DT.yaml
Run finetuning runs: maze_ft_final.sh
Parse results with Parse wandb Maze Experiments.ipynb

File structure

scripts/train.py: the main script from running uniMASK -- start here.
data/: where rollouts (datasets) and trained models (transformer_runs) are stored.
envs/: data-handling and evaluation for each supported environment. Currently
scripts/: reproducing results from the paper, and running uniMASK in general.
batches.py: has all data pipeline processing classes (FactorSeq, TokenSeq, FullTokenSeq, Batch, SubBatch)
sequences.py:
trainer.py: the Trainer class handles the training loop for all models.
transformer.py: contains the transformer model class itself.
transformer_train.py: interface and config setting for training a transformer, through Trainer class.
utils.py: misc utilities, namely math functions, gpu handling, profiling, etc.
transformer_eval.py: interface for getting predictions from transformer (currently empty).

Name		Name	Last commit message	Last commit date
Latest commit History 48 Commits
.github/workflows		.github/workflows
uniMASK		uniMASK
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
install.sh		install.sh
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Introduction

Getting Started

Reproducing results from the paper

Minigrid heatmap (figure 7)

Maze2D results

File structure

About

Releases

Packages

Contributors 2

Languages

License

micahcarroll/uniMASK

Folders and files

Latest commit

History

Repository files navigation

Introduction

Getting Started

Reproducing results from the paper

Minigrid heatmap (figure 7)

Maze2D results

File structure

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages