This repository contains the code used for the paper
This repository is no longer active. However, you are welcome to email the lead author at [email protected] with questions regarding the code.
The env.yml file contains the necessary packages. You can easily build all dependencies with Mamba
mamba env create -f env.yml
We introduce three samplers in the paper (NCG, AVG & PAVG), which are all represented as python classes. The NCG class can be found in samplers/regular_samplers.py
, whilst the (P)AVG classes are in samplers/auxiliary_samplers.py
. All of these classes have a .step()
method that implements one step of the respective transition operator. These step methods all use caching and gpu-friendly vectorisation to reduce memory & runtime.
Each python script in the main directory (except networks.py
) corresponds to an experiment reported in the paper, as we elaborate in the sections below.
Figure 1 in the paper can be reproduced by running plotting_scripts/make_paper_overview_figure.py
The results shown in Figure 2 of the paper were created by running the following two commands
python sample_ordinal --model_name=mixture50_poly2
python sample_ordinal --model_name=mixture50_poly4
The results from the first command are placed in results/ordinal/dim20/mixture50_poly2_ssize50/TIMESTAMP
, and analgously for the poly4
results.
python sample_sparse_bayes_linear.py
Results are placed in results/sbl/20_100/TIMESTAMP
.
Table 1 was generated by first running
python run_all_ising_script.py
and then running
python plotting_scripts/make_tables_for_paper.py --plot_type=ising_lattice_table
Table 2 was generated by first running
python run_all_neural_ising_script.py
and then running
python plotting_scripts/make_tables_for_paper.py --plot_type=usps_table
python run_ising_stepsize_sensitivity.py
and then running
python plotting_scripts/stepsize_sensitivity_ising_plots.py
you should find Figure 7 at results/ising_lattice_sigma0.2_stepsize_sensitivity/step_size_sensitivity.pdf
python analyse_higher_order_sensitivity_ordinal.py
The results from this run, including Figure 12, will be placed in results/ising/sigma0.2/dim16/TIMESTAMP
. If we then run
python plotting_scripts/higher_order_sensitivity_plots.py --timestamp=TIMESTAMP
then Figure 11 we be placed in the same directory as Figure 12.
We have tried to structure the code to be relatively re-usable.
All of the "pure" sampling scripts (Ordinal + bayesian regression + Ising model with higher-order interactions) have the same structure that can serve as a template if you wish to alter our code by e.g. adding a new MCMC sampler or defining a new target distribution. The main()
of each script essentially does the following:
- defines a
target_dist
which is a callable that computes the (unnormalised) log probability of the target distribution - defines the initial batch of chains i.e.
chain_init
- defines a
methods
list-of-dicts, where each dict specifies a particular MCMC operator (including any hyperparameters like step-size) - defines a callable
metric_fn
, which will be repeatedly called during sampling to compute convergence metrics.metric_fn
takes inx_all
, which is the history of the MCMC chains up until that point, computes arbitrary metrics of interest using that history, and saves these metrics tometrics_dict
. - defines a callable
plot_and_save_fn
, which takes all of the data accumulated during sampling (such as the data stored inmetrics_dict
), and creates plots from them. - All of the above data/functions are then fed into
run_sampling_procedure
, which is a generic function for running a set of MCMC methods for many iterations and collating the results into figures.