You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The codebase has provided the training code. But how the reproduce the eval result in the paper 'DeepNet: Scaling Transformers to 1,000 Layers'. Could you please provide the code to reproduce the results in table 6 and table 7 of the paper 'DeepNet: Scaling Transformers to 1,000 Layers'.
The text was updated successfully, but these errors were encountered:
The codebase has provided the training code. But how the reproduce the eval result in the paper 'DeepNet: Scaling Transformers to 1,000 Layers'. Could you please provide the code to reproduce the results in table 6 and table 7 of the paper 'DeepNet: Scaling Transformers to 1,000 Layers'.
The text was updated successfully, but these errors were encountered: