Gradient Work - Neural Network Analysis Framework for Research

This repository contains a standalone research framework for short to long (+20 hours) experiments, targeted at iterative algorithms like gradient descent.

As well as three different algorithms for ReLU Shallow Networks training.

Convex Reformulation, as explained in my blog post @ ICLR2024, following the work Neural Network are Convex regularizers (Pilanci, M. and Ergen, T.,).
Wasserstein Gradient Flow Discretization
- As a proximal point algorithm, on shallow networks described in Chizat, L. and Bach, F., NIPS2018
- As a JKO-step on a fixed grid, a variant of the sinkhorn algorithm Peyré, G., SIAM2015.

To do so, set what should be saved during the experiments and how often, when to stop the experiment and what to log in real time. Each experiment will be stored along with all the parameters in a file for future analysis and exploitation.

Example

Once it's finished, we can check the file that has been created.

Run plot.py without arguments to create plots for the latest experiment.

Experiments and helper files

config.py: configuration are python files (contain algo choice, data setup, hyperparameters...)
runner.py: different loops (animation, loss display...)
postprocess.py: compute indicators
utils.py: helper functions and such

This project uses but does not depend on pytorch, cvxpy. However it depends on NumPy.

Implemented Algorithms

Gradient Descent

algo_GD_torch.py: pytorch implem of 2 layer ReLU gradient descent

Convex Reformulation

algo_convex_cvxpy.py: 2 layer ReLU gradient descent convex solver

Wasserstein Gradient Flow Simulation

Proximal and Wasserstein Descent

algo_prox.py: Proximal Point
proxdistance.py : implements Frobenius, Wasserstein, Sliced Wasserstein distances

JKO-step and Proximal Solvers

algo_jko.py: Mean-field discretization using JKO, replace Wasserstein proxf by kl_div proxf.
jko_proxf_scipy.py: Proximal Scipy solver
jko_proxf_cvxpy.py: Proximal Cvxpy solver
jko_proxf_pytorch.py: Proximal solver using pytorch gradient descent

Name		Name	Last commit message	Last commit date
Latest commit History 116 Commits
JKO		JKO
algos		algos
imgs		imgs
README.md		README.md
configs.py		configs.py
configsGenData.py		configsGenData.py
configsGenModel.py		configsGenModel.py
experiment.py		experiment.py
load.py		load.py
plot.py		plot.py
postprocess.py		postprocess.py
runner.py		runner.py
schemas.md		schemas.md
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Gradient Work - Neural Network Analysis Framework for Research

Example

Experiments and helper files

Implemented Algorithms

Gradient Descent

Convex Reformulation

Wasserstein Gradient Flow Simulation

Proximal and Wasserstein Descent

JKO-step and Proximal Solvers

About

Releases

Packages

Languages

vmerckle/Gradient-Work

Folders and files

Latest commit

History

Repository files navigation

Gradient Work - Neural Network Analysis Framework for Research

Example

Experiments and helper files

Implemented Algorithms

Gradient Descent

Convex Reformulation

Wasserstein Gradient Flow Simulation

Proximal and Wasserstein Descent

JKO-step and Proximal Solvers

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages