Skip to content

Pull requests: bigcode-project/Megatron-LM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Updated Megatron version
#85 opened Dec 21, 2023 by jlamypoirier Draft
Diff with nvidia main
#84 opened Dec 21, 2023 by jlamypoirier Draft
Create pretrain_starcoder2_1b.slurm
#82 opened Nov 10, 2023 by loubnabnl Loading…
re-merge from NVIDIA main
#68 opened Jun 27, 2023 by RaymondLi0 Loading…
fix missing world_size in args_to_keep
#66 opened Jun 23, 2023 by mayank31398 Loading…
Add Deepspeed integration [WIP]
#62 opened Jun 14, 2023 by mayank31398 Draft
Fix mqa parallelization
#51 opened May 11, 2023 by thomasw21 Loading…
Mtf
#47 opened Apr 14, 2023 by Muennighoff Loading…
WIP: UL2 merge
#23 opened Feb 7, 2023 by RaymondLi0 Loading…
From NVIDIA Megatron-LM for visibility
#18 opened Jan 24, 2023 by RaymondLi0 Loading…
ProTip! no:milestone will show everything without a milestone.