Skip to content

Add efficient Triton decode attention kernel with fused skip-softmax …#1624

Draft
kaix-nv wants to merge 1 commit into
mainfrom
kaix/triton_decode_skip_softmax
Draft

Add efficient Triton decode attention kernel with fused skip-softmax …#1624
kaix-nv wants to merge 1 commit into
mainfrom
kaix/triton_decode_skip_softmax

Commits

Commits on Jun 3, 2026