-
Notifications
You must be signed in to change notification settings - Fork 8
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support type-2 Fisher in KFAC #49
Comments
Question: Is this already implemented in |
I don't think so, because curvlinops.GGNLinearOperator computes the exact GGN/Fisher, whereas this issue request a KFAC approximation using one backward pass per output dimension (see the implementation in ASDL as an example) to compute the gradients. I realize that the naming of the issue is a bit confusing, since it says "exact Fisher/GGN". |
Oh okay, so this is sampling gradients versus using the columns of the loss-output Hessian's matrix square root? (I think in BackPACK this corresponds to KFLR) |
Yeah, that sounds correct. |
I was trying to find a meaningful title for this. I could not find it on the internet, but believe the gradient-sampling-based estimation of the Fisher is called 'type-1', whereas the loss-Hessian-based estimation of the Fisher is called 'type-2'. Maybe @yorkerlin can help us out on this one. |
As opposed to using MC samples from the model's predictive distribution.
The text was updated successfully, but these errors were encountered: