Skip to content

Pull requests: PaddlePaddle/PaddleFormers

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Feat/model unittest ci action contributor
#2683 opened Sep 25, 2025 by huanghengheng Loading…
2 tasks
GLM4.5 support sp + moe aux loss contributor
#2682 opened Sep 24, 2025 by WYB27 Loading…
[DSv3]: Add Tokenizer Config for DSv3
#2650 opened Sep 22, 2025 by hushenwei2000 Loading…
Glm4Moe: fix attn_mask && fused_loss contributor
#2648 opened Sep 20, 2025 by WYB27 Loading…
Update CODE_OF_CONDUCT.md contributor
#2636 opened Sep 18, 2025 by Jagdish2810 Draft
2 tasks done
[dsv3]Move dsv3 model from paddlenlp-dsv3-sft
#2593 opened Sep 11, 2025 by Difers Loading…
1 of 7 tasks
【Bug】Fix attn_mask_startend_row_indices shape mismatch
#2564 opened Sep 8, 2025 by cheng221 Loading…
2 tasks
【FlexCP】add Flexcp for trainer
#2541 opened Sep 4, 2025 by xiaoguoguo626807 Loading…
2 tasks
feat(dsv3):Runnable N1C8 configs
#2525 opened Sep 1, 2025 by hushenwei2000 Loading…
feat(dsv3): add dsv3 fast pretrain into paddleformers
#2524 opened Aug 31, 2025 by chen2016013 Loading…
2 tasks
feat(dsv3):Runnable N1C8 configs
#2523 opened Aug 31, 2025 by chen2016013 Loading…
2 tasks
add moe
#2510 opened Aug 28, 2025 by a31413510 Loading…
fix bug support download ernie model contributor
#2509 opened Aug 28, 2025 by fjjF77 Loading…
fix typos contributor
#2500 opened Aug 28, 2025 by co63oc Loading…
2 tasks
feat(dsv3): add dsv3 fast pretrain into paddleformers
#2496 opened Aug 27, 2025 by chen2016013 Loading…
2 tasks
Update lora layer source contributor
#2489 opened Aug 27, 2025 by emmanuel-ferdman Loading…
1 of 2 tasks
Merge dsv3 tainer part
#2487 opened Aug 27, 2025 by hushenwei2000 Draft
change deepseekv2 model
#2486 opened Aug 26, 2025 by chen2016013 Loading…
2 tasks
add pre_train entrance
#2483 opened Aug 26, 2025 by chen2016013 Loading…
2 tasks
Support GPT-OSS contributor
#2478 opened Aug 25, 2025 by WYB27 Loading…
support general pp model
#2473 opened Aug 25, 2025 by cheng221 Loading…
2 tasks
add moe
#2467 opened Aug 25, 2025 by a31413510 Loading…
2 tasks
ProTip! What’s not been updated in a month: updated:<2025-09-03.