[wgmma] Insert commit_group and wait_group after mma_async #3573

jacobhinkle · 2024-12-11T14:21:39Z

jacobhinkle · 2024-12-11T17:35:42Z

!test

rdspring1 · 2025-01-03T02:00:01Z

@jacobhinkle I ran into #3561 when working with register sharing and warp specialization. I wonder if I can use this.

rdspring1

There can be multiple wgmma operations per circular buffer stage. e.g., when the k dimension is a multiple of mma macro.

It should be safe to wait for all operations per stage.

I made the following change in WarAsyncWaitInserter.
https://github.com/NVIDIA/Fuser/pull/3616/files#diff-49bea61a8cde014ec0396c89d0654813136e0bd3ffe8b3a5974ee9ccf3a5fbb8R1010-R1024

It didn't any performance difference. 🤷🏼

rdspring1 · 2025-01-03T02:50:46Z

csrc/device_lower/pass/inline_ptx.cpp

+    auto* commit = IrBuilder::create<kir::AsyncCommit>(AsyncOpType::WgMma);
+    auto* wait = IrBuilder::create<kir::AsyncWait>(
+        AsyncOpType::WgMma,
+        /*keep_stages=*/cb_opts.stage - cb_opts.prefetch - 1);


If cb_opts.stage - cb_opts.prefetch == 0, then the keep_stages will be -1, which is invalid.

Suggested change

/*keep_stages=*/cb_opts.stage - cb_opts.prefetch - 1);

std::min(0LL, /*keep_stages=*/cb_opts.stage - cb_opts.prefetch - 1));

[wgmma] Insert commit_group and wait_group after mma_async

e6ea681

jacobhinkle requested a review from zasdfgbnm December 11, 2024 14:21

jacobhinkle added the Matmuls label Dec 11, 2024

rdspring1 reviewed Jan 3, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[wgmma] Insert commit_group and wait_group after mma_async #3573

[wgmma] Insert commit_group and wait_group after mma_async #3573

jacobhinkle commented Dec 11, 2024

jacobhinkle commented Dec 11, 2024

rdspring1 commented Jan 3, 2025

rdspring1 left a comment •

edited

Loading

rdspring1 Jan 3, 2025

	/keep_stages=/cb_opts.stage - cb_opts.prefetch - 1);
	std::min(0LL, /keep_stages=/cb_opts.stage - cb_opts.prefetch - 1));

[wgmma] Insert commit_group and wait_group after mma_async #3573

Are you sure you want to change the base?

[wgmma] Insert commit_group and wait_group after mma_async #3573

Conversation

jacobhinkle commented Dec 11, 2024

jacobhinkle commented Dec 11, 2024

rdspring1 commented Jan 3, 2025

rdspring1 left a comment • edited Loading

Choose a reason for hiding this comment

rdspring1 Jan 3, 2025

Choose a reason for hiding this comment

rdspring1 left a comment •

edited

Loading