Popular repositories Loading
-
-
torchtune
torchtune PublicForked from meta-pytorch/torchtune
PyTorch native post-training library
Python
-
grokking
grokking PublicDemonstration of grokking using a Transformer trained to compute modulo 97 division. Despite a small test set (50%), the model generalizes only after extended training—showcasing the grokking pheno…
Python
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.