add qwen2/3 old fused qkv/ffn #3150

llbdyiu66 · 2025-12-10T07:07:25Z

No description provided.

paddle-bot · 2025-12-10T07:07:31Z

Thanks for your contribution!

lugimzzz

LGTM

JunnYu

需要修改

JunnYu · 2025-12-10T09:41:32Z

paddleformers/transformers/qwen2/modeling.py

-                )

-            return actions
+            if config.tie_word_embeddings:


这里没有tie_word_embeddings的概念了吧？formers默认是 base_actions["lm_head.weight"] = partial(fn, is_column=False) 这个吧

JunnYu · 2025-12-10T09:42:16Z

paddleformers/transformers/qwen2_moe/modeling.py

+            final_actions = {}
+
+            base_actions = {
+                "lm_head.weight": partial(fn, is_column=True),


partial(fn, is_column=False)

JunnYu · 2025-12-10T09:42:36Z

paddleformers/transformers/qwen3/modeling.py

-
-        mappings = make_base_actions()
+
+            if config.tie_word_embeddings:


JunnYu · 2025-12-10T09:42:47Z

paddleformers/transformers/qwen3_moe/modeling.py


-        LAYER_ROWWISE = ["self_attn.o_proj.weight"]
+            base_actions = {
+                "lm_head.weight": partial(fn, is_column=True),


JunnYu · 2025-12-10T09:44:12Z

完善PR描述内容，新增测试等。

JunnYu · 2025-12-11T08:38:33Z

paddleformers/transformers/qwen3_moe/modeling.py

+                            {
+                                f"{cls.base_model_prefix}.layers.{layer_idx}.mlp.experts.{e}.{k}": partial(
+                                    fn, is_column=True
+                                )


fuse_attention_ffn的时候需要输入 is_naive_2fuse=True

JunnYu · 2025-12-11T08:39:49Z

paddleformers/transformers/qwen2_moe/modeling.py

-                        for k in LAYER_COLWISE
-                    }
-                )
+                # colwise


缺少fuse ffn, 然后注意 is_naive_2fuse 的添加

JunnYu · 2025-12-11T08:40:28Z

paddleformers/transformers/qwen2/modeling.py

            "mlp.gate_proj.weight",
        ]

+        FUSE_LAYER_COLWISE = [


全缺少 is_naive_2fuse

codecov-commenter · 2025-12-11T16:04:51Z

Codecov Report

❌ Patch coverage is 55.20833% with 86 lines in your changes missing coverage. Please review.
⚠️ Please upload report for BASE (develop@4e7b961). Learn more about missing BASE report.

Files with missing lines	Patch %	Lines
paddleformers/transformers/qwen2_moe/modeling.py	53.84%	24 Missing ⚠️
paddleformers/transformers/qwen3_moe/modeling.py	56.00%	22 Missing ⚠️
paddleformers/transformers/qwen3/modeling.py	54.34%	21 Missing ⚠️
paddleformers/transformers/qwen2/modeling.py	56.81%	19 Missing ⚠️

❌ Your patch status has failed because the patch coverage (55.20%) is below the target coverage (75.00%). You can increase the patch coverage or adjust the target coverage.

Additional details and impacted files

@@            Coverage Diff             @@
##             develop    #3150   +/-   ##
==========================================
  Coverage           ?   36.74%           
==========================================
  Files              ?      416           
  Lines              ?    71324           
  Branches           ?        0           
==========================================
  Hits               ?    26207           
  Misses             ?    45117           
  Partials           ?        0

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

add qwen2/3 old fused qkv/ffn

de46dcf

paddle-bot bot added the contributor label Dec 10, 2025

lugimzzz previously approved these changes Dec 10, 2025

View reviewed changes

change fuse key up_gate_proj

fd4cb62

llbdyiu66 dismissed lugimzzz’s stale review via fd4cb62 December 10, 2025 08:57

JunnYu suggested changes Dec 10, 2025

View reviewed changes

fix

147c322

JunnYu reviewed Dec 11, 2025

View reviewed changes

paddleformers/transformers/qwen2/modeling.py

"mlp.gate_proj.weight",

]

FUSE_LAYER_COLWISE = [

Copy link

Member

JunnYu Dec 11, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

全缺少 is_naive_2fuse

fix

98a6f8d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add qwen2/3 old fused qkv/ffn #3150

add qwen2/3 old fused qkv/ffn #3150

llbdyiu66 commented Dec 10, 2025

Uh oh!

paddle-bot bot commented Dec 10, 2025

Uh oh!

lugimzzz left a comment

Uh oh!

JunnYu left a comment

Uh oh!

JunnYu Dec 10, 2025

Uh oh!

JunnYu Dec 10, 2025

Uh oh!

JunnYu Dec 10, 2025

Uh oh!

JunnYu Dec 10, 2025

Uh oh!

JunnYu commented Dec 10, 2025

Uh oh!

JunnYu Dec 11, 2025

Uh oh!

JunnYu Dec 11, 2025

Uh oh!

JunnYu Dec 11, 2025

Uh oh!

codecov-commenter commented Dec 11, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants


		mappings = make_base_actions()

		if config.tie_word_embeddings:

add qwen2/3 old fused qkv/ffn #3150

Are you sure you want to change the base?

add qwen2/3 old fused qkv/ffn #3150

Conversation

llbdyiu66 commented Dec 10, 2025

Uh oh!

paddle-bot bot commented Dec 10, 2025

Uh oh!

lugimzzz left a comment

Choose a reason for hiding this comment

Uh oh!

JunnYu left a comment

Choose a reason for hiding this comment

Uh oh!

JunnYu Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

JunnYu Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

JunnYu Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

JunnYu Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

JunnYu commented Dec 10, 2025

Uh oh!

JunnYu Dec 11, 2025

Choose a reason for hiding this comment

Uh oh!

JunnYu Dec 11, 2025

Choose a reason for hiding this comment

Uh oh!

JunnYu Dec 11, 2025

Choose a reason for hiding this comment

Uh oh!

codecov-commenter commented Dec 11, 2025

Codecov Report

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants