rebasin

Replicating some experiments from the Git Re-Basin paper

Experiments

Below you can see one of the experiments motivating the paper. We train two models (Model A and Model B) on a dataset (here: MNIST) until convergence (~98% test accuracy). When we then linearly interpolate between the parameters of these two trained models we can see that the accuracy decreases towards $\alpha = 0.5$, i.e. where the parameters are most mixed.

While we can see that linear interpolation between parameters does not work by default, the paper suggests that there are certain permuations we can apply to parameters B to match them to the parameters of model A.

Matching the weights

Activation matching

The first of the three proposed weight-matching algorithms works by matching the activations of the two models. After using this method to match the parameters of model B to those of A, we can see that interpolating between parameters A and the permuted parameters B works much better.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
graphs		graphs
.gitignore		.gitignore
README.md		README.md
activation_matching.py		activation_matching.py
interpolate_mlp.py		interpolate_mlp.py
mlp.py		mlp.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

rebasin

Experiments

Matching the weights

Activation matching

About

Releases

Packages

Languages

lucabeetz/rebasin

Folders and files

Latest commit

History

Repository files navigation

rebasin

Experiments

Matching the weights

Activation matching

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages