Skip to content

psgd-jax 0.2.9

Latest
Compare
Choose a tag to compare
@evanatyourservice evanatyourservice released this 31 Dec 22:16

What's Changed

  • swapped normalize_grads out for clipping outputs by RMS. This is more stable, more accurate, and will work in a wider variety of situations. normalizing input grads is worse due to getting rid of valuable info for preconditioners.