-
Notifications
You must be signed in to change notification settings - Fork 333
Issues: AI-Hypercomputer/maxtext
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
moe_lb_loss should be divided by gradient_accumulation_steps for reporting.
#1483
opened Mar 26, 2025 by
bzantium
When using dcn-DP and dcn-FSDP together got error when saving checkpoint.
#1434
opened Mar 20, 2025 by
jiagaoxiang
load_balance_loss for dense_matmul should take whole whole weights not topk weights
#1432
opened Mar 20, 2025 by
bzantium
The default setting of
param_scan_axis=1
hurts performance and memory consumption on GPUs
#1382
opened Mar 12, 2025 by
jaro-sevcik
MFU drops significantly when using megablox with more experts
#1256
opened Feb 9, 2025 by
rodrigo-f-nogueira
llama GPU model with dcn fsdp + ici tp + cudnn flash attention broken
#1093
opened Dec 10, 2024 by
wang2yn84
Support nsys profiler upload in all cases
bug
Something isn't working
good first issue
Good for newcomers
#911
opened Sep 24, 2024 by
gobbleturk
Move maxtext docker images being built to artifact registry
enhancement
New feature or request
#904
opened Sep 20, 2024 by
parambole
Unable to recover after checkpoint saving
bug
Something isn't working
#868
opened Sep 6, 2024 by
peregilk
Cannot see multiple GPUs when using Slurm (with proposed fix)
feature request
good first issue
Good for newcomers
#865
opened Sep 4, 2024 by
gabeweisz
converting Gemma maxtext compatible checkpoint to Hugging Face format
feature request
#829
opened Aug 16, 2024 by
salrowili
Multihost training collapses from time to time when loading the next batch
bug
Something isn't working
#786
opened Jul 18, 2024 by
YUE-FAN
Previous Next
ProTip!
Adding no:label will show everything without a label.