Skip to content

Pull requests: pytorch/torchtitan

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

[MoE][PoC] Expert Parallel: dp2ep CLA Signed This label is managed by the Meta Open Source bot.
#732 opened Dec 12, 2024 by tianyu-l Draft
[MoE][PoC] Expert Parallel: tp and tp2ep CLA Signed This label is managed by the Meta Open Source bot.
#731 opened Dec 12, 2024 by tianyu-l Draft
[MoE][PoC] model code CLA Signed This label is managed by the Meta Open Source bot.
#730 opened Dec 12, 2024 by tianyu-l Draft
[Not for land] Show replicated fp32 norm weights CLA Signed This label is managed by the Meta Open Source bot.
#717 opened Dec 4, 2024 by awgu Draft
First draft Auto-SAC workflow CLA Signed This label is managed by the Meta Open Source bot.
#710 opened Dec 2, 2024 by sanketpurandare Draft
[WIP] Allow benchmark between multiple configs CLA Signed This label is managed by the Meta Open Source bot.
#703 opened Nov 26, 2024 by H-Huang Loading…
[WIP] Adding OBELICS DataLoader CLA Signed This label is managed by the Meta Open Source bot.
#663 opened Oct 30, 2024 by TJ-Solergibert Loading…
[not for land] torch.compile individual linears CLA Signed This label is managed by the Meta Open Source bot.
#661 opened Oct 29, 2024 by vkuzo Loading…
Use enable_gqa in place of repeat_kv CLA Signed This label is managed by the Meta Open Source bot.
#641 opened Oct 22, 2024 by awgu Draft
Init weights only if not loading a checkpoint CLA Signed This label is managed by the Meta Open Source bot.
#628 opened Oct 18, 2024 by carmocca Draft
[DO NOT REVIEW] gaps to enable FDSP2 cpu offloading CLA Signed This label is managed by the Meta Open Source bot.
#622 opened Oct 16, 2024 by weifengpy Loading…
[Not for land] Settings to make Llama3-8B on 8 GPUs faster CLA Signed This label is managed by the Meta Open Source bot.
#615 opened Oct 14, 2024 by awgu Draft
[not for land] TE experiments, take 2 CLA Signed This label is managed by the Meta Open Source bot.
#614 opened Oct 14, 2024 by vkuzo Loading…
[DO NOT REVIEW] --experimental.fsdp_sharding_on_largest_dim CLA Signed This label is managed by the Meta Open Source bot.
#607 opened Oct 9, 2024 by weifengpy Loading…
fix mixed precision for replicate / pure DDP CLA Signed This label is managed by the Meta Open Source bot.
#591 opened Sep 29, 2024 by 152334H Loading…
[not for land yet] hack max and abs out of ops eligible for AC CLA Signed This label is managed by the Meta Open Source bot.
#580 opened Sep 17, 2024 by vkuzo Loading…
add pp validation for schedule CLA Signed This label is managed by the Meta Open Source bot.
#568 opened Sep 5, 2024 by H-Huang Loading…
3d with fp8 in test runner CLA Signed This label is managed by the Meta Open Source bot.
#564 opened Aug 29, 2024 by H-Huang Draft
[WIP] zero bubble CLA Signed This label is managed by the Meta Open Source bot.
#546 opened Aug 20, 2024 by H-Huang Draft
[DO NOT REVIEW] Runtime estimation with FakeTensor + TorchDispatchMode CLA Signed This label is managed by the Meta Open Source bot.
#536 opened Aug 20, 2024 by weifengpy Loading…
[Not for land] Added changes for GPT-2 perf CLA Signed This label is managed by the Meta Open Source bot.
#533 opened Aug 19, 2024 by awgu Draft
[Not for land] Added GPT-2-like config CLA Signed This label is managed by the Meta Open Source bot.
#532 opened Aug 19, 2024 by awgu Draft
[Not for land] GaLore example CLA Signed This label is managed by the Meta Open Source bot.
#488 opened Jul 29, 2024 by awgu Draft
[torchtitan][debug] integrated CommDebugMode into TorchTitan CLA Signed This label is managed by the Meta Open Source bot.
#480 opened Jul 24, 2024 by sinhaanshul Loading…
[not for land] TE experiments CLA Signed This label is managed by the Meta Open Source bot.
#477 opened Jul 23, 2024 by vkuzo Loading…
ProTip! Follow long discussions with comments:>50.