[Feature] Use CUDA event for measuring elasped time #88
Labels
enhancement
New feature or request
good first issue
Good for newcomers
help wanted
Extra attention is needed
As mentioned in the MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUs, page 8
Use
torch.cuda.Event
for measuring elapsed time minimize CUDA synchronization compared totime.time()
[link]The text was updated successfully, but these errors were encountered: