Skip to content

feat(example): Add flashdecoding kernel [stage-1].#89

Closed
KuangjuX wants to merge 13 commits intomicrosoft:masterfrom
KuangjuX:flashdecoding
Closed

feat(example): Add flashdecoding kernel [stage-1].#89
KuangjuX wants to merge 13 commits intomicrosoft:masterfrom
KuangjuX:flashdecoding

Conversation

@KuangjuX
Copy link
Contributor

@KuangjuX KuangjuX commented Mar 18, 2025

Progress:

  • Add storing of LSE vector to GMEM from ke_flash_decoding_split_kv_fwd.
  • Add loading of LSE vector from global memory to shared memory.
  • Add allocation of LSE vector bundle shared memory to registers and transposition.

@KuangjuX KuangjuX marked this pull request as draft March 18, 2025 07:36
@KuangjuX KuangjuX changed the title feat(example): Add flashdecoding. feat(example): Add flashdecoding kernel [stage-1]. Mar 28, 2025
@haruhi55 haruhi55 closed this Jun 5, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants