- Add option for learned bias vectors
- Add CLI arguments for scripts
- Save state dict instead of model
- Give example results and saved models
- Reparameterize to initialize with all zeros to fix weight decay
- Add requirements.txt
- Add comments
- Match the method used in the (IA)^3 paper