Skip to content

Releases: deepakn94/Megatron-LM

SuperComputing 2021

11 Aug 17:06
Compare
Choose a tag to compare

Code and scripts accompanying the SuperComputing 2021 paper "Efficient Large-Scale Language Model Training on GPU Clusters Using Megatron-LM".