GitHub - fraware/InverseBench: InverseBench (ICLR 2025 spotlight)

InverseBench: Benchmarking Plug-and-Play Diffusion Priors for Inverse Problems in Physical Sciences (ICLR 2025 spotlight)

Hongkai Zheng*, Wenda Chu*, Bingliang Zhang*, Zihui Wu*, Austin Wang, Berthy T. Feng, Caifeng Zou, Yu Sun, Nikola Kovachki, Zachary E. Ross, Katherine L. Bouman, Yisong Yue
(*: Equal contribution)

Abstract: Plug-and-play diffusion priors (PnPDP) have emerged as a promising research direction for solving inverse problems. However, current studies primarily focus on natural image restoration, leaving the performance of these algorithms in scientific inverse problems largely unexplored. To address this gap, we introduce InverseBench, a framework that evaluates diffusion models across five distinct scientific inverse problems. These problems present unique structural challenges that differ from existing benchmarks, arising from critical scientific applications such as optical tomography, medical imaging, black hole imaging, seismology, and fluid dynamics. With InverseBench, we benchmark 14 inverse problem algorithms that use plug-and-play diffusion priors against strong, domain-specific baselines, offering valuable new insights into the strengths and weaknesses of existing algorithms.

Environment requirements

We recommend Linux with 64-bit Python 3.11 for the best compatiblity.
At least one high-end GPU for inference. All our experiments were conducted on A100 GPUs.
The main dependencies are pytorch, scipy, hydra-core, lmdb, piq, wandb, accelerate, devito, dask, ehtim, ehtplot, h5py, sigpy, pynfft, pytest. Note that pynfft is required for the black hole imaging task and is compatible only with numpy 1.xx.
We also provide Dockerfile under Docker, offering the best compatibility for all five tasks. It can be used as follows:

# Build docker image
docker build -t [image tag] --build-arg USER_ID=$(id -u) --build-arg GROUP_ID=$(id -g) .

# Run docker container
docker run --gpus all -it --rm --ipc=host --ulimit memlock=-1 --ulimit stack=67108864 -v [path to the top of this git repo]:/inverse-bench -v [path to data]:/data [image tag]

Breakdown of the docker run command:

--gpus all -it --rm: With all GPUs enabled, run an interactive session, and automatically remove the container when it exits.
--ipc=host --ulimit memlock=-1 --ulimit stack=67108864: Flags recommended by NVIDIA. Unlock the resource constraint.
-v [path to the top of this repo]:/inverse-bench -v [path to data]:/data: Mount the current dir to /inverse-bench. Mount the data directory to /data.

Pre-trained models

Pre-trained model weights can be found in the GitHub release page.

Problem	Pretrained diffusion model	Training config
Full waveform inversion	`fwi-5m.pt`	`configs/pretrain/fwi.yaml`
Linear inverse scattering	`in-scatter-5m.pt`	`configs/pretrain/inv-scatter.yaml`
2D Navier-Stokes	`ns-5m.pt`	`configs/pretrain/navier-stokes.yaml`
Black hole	`blackhole.pt`	`configs/pretrain/blackhole.yaml`
MRI (brain)	`mri-brain.pt`	`configs/pretrain/brain-knee-mvue.yaml`
MRI (knee)	`mri-knee.pt`	`configs/pretrain/mri-knee-mvue.yaml`
FFHQ256	`ffhq256.pt`	`configs/pretrain/ffhq256.yaml`

Note that ffhq256.pt is converted from the DPS repository. We used this model to verify that our reimplementation matches the results reported in the original paper.

You can also train your own diffusion prior via

accelerate launch --multi_gpu train.py -cn [specify config_name]

Data

The test and validation data for each problem are available at the data page. A detailed description of each dataset can be found in Dataset_Card.md.

Inference

By default, configs/config.yaml will be loaded for inference. You can override the config value by

python3 main.py problem=[inverse problem config name] algorithm=[algorithm config name] pretrain=[pretrained model config name]

The structure of the inference config is explained below.

Key	Description
`problem`	Name of the inverse problem configuration. (See `configs/problem`)
`algorithm`	Name of the algorithm configuration. (See `configs/algorithm`)
`pretrain`	Name of the pre-trained model configuration. (see `configs/pretrain`)
`tf32`	(bool) Enables TF32 mode for improved performance on Ampere+ GPUs.
`compile`	(bool) Enable `torch.compile` (recommended for ensemble methods).
`seed`	(int) Random seed.
`inference`	(bool) If False, skip inference and only run evaluation.
`exp_name`	(string) Sets the experiment name for logging and saving results.
`wandb`	(bool) Enables logging to Weights & Biases (WandB).

Hyperparameter search

To run hyperparameter sweeps using Wandb, follow these steps:

Run wandb login in the command line and follow the prompts to authenticate
Navigate to the configs/sweep directory, where you'll define sweep configuration files. These files define the hyperparameters to be tuned and their ranges for each specific problem and algorithm combination. See configs/sweep/inv-scatter/dps.yaml for a concrete example.

To start a sweep, use the following command:

wandb sweep configs/sweep/[problem]/[algorithm].yaml

For example,

wandb sweep configs/sweep/inv-scatter/dps.yaml

Wandb will generate a sweep ID in the form of [your-entity]/[your-project]/[sweep-id]. Use this ID to start sweep agents:
```
wandb agent [your-entity]/[your-project]/[sweep-id] --count 10
```
You can add count flag to set the maximum number of runs to try.
You can start multiple agents to run the sweep in parallel by opening new terminals and running the same command wandb agent [your-entity]/[your-project]/[sweep-id] --count 10.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Citation

@inproceedings{
zheng2025inversebench,
title={InverseBench: Benchmarking Plug-and-Play Diffusion Priors for Inverse Problems in Physical Sciences},
author={Hongkai Zheng and Wenda Chu and Bingliang Zhang and Zihui Wu and Austin Wang and Berthy Feng and Caifeng Zou and Yu Sun and Nikola Borislavov Kovachki and Zachary E Ross and Katherine Bouman and Yisong Yue},
booktitle={The Thirteenth International Conference on Learning Representations},
year={2025},
url={https://openreview.net/forum?id=U3PBITXNG6}
}

Troubleshooting

This section contains the solutions to common issues.

Devito getting stuck? If your code gets stuck at codepy/jit.py, run the script scripts/fwi/clear_devito_cache.py to make sure the cache is in a consistent state.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Environment requirements

Pre-trained models

Data

Inference

Hyperparameter search

License

Citation

Troubleshooting

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
Docker		Docker
algo		algo
assets		assets
configs		configs
inverse_problems		inverse_problems
models		models
scripts/fwi		scripts/fwi
training		training
utils		utils
.gitignore		.gitignore
Dataset_Card.md		Dataset_Card.md
LICENSE		LICENSE
README.md		README.md
env.yaml		env.yaml
eval.py		eval.py
main.py		main.py
train.py		train.py

License

fraware/InverseBench

Folders and files

Latest commit

History

Repository files navigation

Environment requirements

Pre-trained models

Data

Inference

Hyperparameter search

License

Citation

Troubleshooting

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages