forked from NVIDIA/Megatron-LM
-
Notifications
You must be signed in to change notification settings - Fork 50
From NVIDIA Megatron-LM for visibility #18
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Open
RaymondLi0
wants to merge
4,603
commits into
bigcode-project:multi-query-attention
Choose a base branch
from
NVIDIA:main
base: multi-query-attention
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Conversation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Fix `post_training/test_get_gpt_modelopt_spec_interface` See merge request ADLR/megatron-lm!3118
Co-authored-by: Shanmugam Ramasamy <[email protected]>
Remove legacy bert tests See merge request ADLR/megatron-lm!3023
Co-authored-by: Ali Taghibakhshi <[email protected]> Co-authored-by: Mcore Bot <[email protected]>
Alit/config mamba head See merge request ADLR/megatron-lm!2601
Update CODEOWNERS to make modelopt review only for QAT. See merge request ADLR/megatron-lm!3125
Run nemo2 tests instead of nemo1 See merge request ADLR/megatron-lm!3119
…attn for dynamic batching. Co-authored-by: Shanmugam Ramasamy <[email protected]> Co-authored-by: root <[email protected]> Co-authored-by: Vijay Korthikanti <[email protected]> Co-authored-by: Mcore Bot <[email protected]> Co-authored-by: root <[email protected]> Co-authored-by: root <[email protected]> Co-authored-by: root <[email protected]> Co-authored-by: root <[email protected]> Co-authored-by: root <[email protected]> Co-authored-by: root <[email protected]>
Integrating paged attention feature of flash_attn for dynamic batching. See merge request ADLR/megatron-lm!2955
Co-authored-by: Mcore Bot <[email protected]> Co-authored-by: yaoyu-33 <[email protected]> Co-authored-by: Chenhan Yu <[email protected]>
add l2 norm in torch_norm.py for LLAMA-4 support See merge request ADLR/megatron-lm!2960
fix: Improvements to the auto-reminder bot See merge request ADLR/megatron-lm!3126
Fix Gemma TRTLLM export See merge request ADLR/megatron-lm!2475
Co-authored-by: Yuzhong Wang <[email protected]> Co-authored-by: Shunkang <[email protected]>
Fix MLA THD format support See merge request ADLR/megatron-lm!2691
…t load strictness.
Dynamic inference example | Control checkpoint load strictness. See merge request ADLR/megatron-lm!2914
Co-authored-by: jianbinc <[email protected]>
patch for fp8 primary weight custom fsdp support See merge request ADLR/megatron-lm!3057
ci: Track info about MR See merge request ADLR/megatron-lm!3129
ci: Handle nargs See merge request ADLR/megatron-lm!3105
…h --no-optim-load Co-authored-by: jianbinc <[email protected]> Co-authored-by: 胡凯文 <[email protected]>
This reverts commit d87ba91.
ci: Run on multiple clusters See merge request ADLR/megatron-lm!3292
ci: Allow specific TE-ref See merge request ADLR/megatron-lm!3302
ci(fix): Write logs to log_dir See merge request ADLR/megatron-lm!3299
Address dist checkpointing PyT 24.08 failure See merge request ADLR/megatron-lm!3253
ci(hotfix): Downstream pipeline See merge request ADLR/megatron-lm!3307
…nal argparse flag to clear GPU... Co-authored-by: Szymon Migacz <[email protected]>
MR feedback: added units for arguments, optional argparse flag to clear GPU... See merge request ADLR/megatron-lm!3308
…mamba class constructor Co-authored-by: Zhiyu Li <[email protected]>
Allow process group as optional argument for mamba class constructor See merge request ADLR/megatron-lm!2966
Add NVTX ranges to categorize execution See merge request ADLR/megatron-lm!2588
Move fsdp 2 import from _composable to public See merge request ADLR/megatron-lm!3116
ci: Add nemo-image to `ci-rebuild-mcore-nemo-image` See merge request ADLR/megatron-lm!3321
ci: Re-enable tests that failed on memory See merge request ADLR/megatron-lm!3197
Signed-off-by: oliver könig <[email protected]>
Co-authored-by: Shanmugam Ramasamy <[email protected]> Co-authored-by: Shanmugam Ramasamy <[email protected]>
Engine updates See merge request ADLR/megatron-lm!3254
Co-authored-by: Mcore Bot <[email protected]>
ci: Onboard mr-slim to h100 See merge request ADLR/megatron-lm!3312
chore: Deprecate T5 tests See merge request ADLR/megatron-lm!3334
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
No description provided.