-
Notifications
You must be signed in to change notification settings - Fork 1.6k
Pull requests: NVIDIA/cutlass
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[CUTEDSL] Update example code nvvm API usage from nvvm enum to str
#2985
opened Jan 26, 2026 by
XiaoSong9905
Loading…
Fix redundant tile copies in wgmma_sm90 tutorial pipeline loop
#2982
opened Jan 25, 2026 by
Johnsonms
Loading…
Fix error in Blackwell document of referring to Mxf4 format as NVF4
#2977
opened Jan 23, 2026 by
zianglih
Loading…
fix(examples): fix device compatibility check for Ada FP8 GEMM
#2954
opened Jan 13, 2026 by
w1ndseeker
Loading…
cutlass profiler - align emitted SFA/SFB kernel naming with typical convention
#2942
opened Jan 10, 2026 by
aidando73
Loading…
Fix Warp Memory Access Arrangement in Epilogue: Upper Bound memory access width by output tile width
#2938
opened Jan 8, 2026 by
lukas-ruettgers
Loading…
Refactor binary_op functions to remove unused result parameter
#2919
opened Jan 2, 2026 by
pbelevich
Loading…
docs: Add FP16 GEMM documentation to sgemm_sm80.cu - Fixes #1686
#2870
opened Dec 10, 2025 by
blueberrycongee
Loading…
[WIP]Unit tests for Kernels that perform BF16 x BF16 = MXFP8 and MXFP8 x MXFP8 = BF16
#2857
opened Dec 8, 2025 by
Shreya-gaur
Loading…
use cp.async.bulk for per-row data; quiets synccheck
inactive-30d
#2850
opened Dec 5, 2025 by
v0i0
Loading…
[FIX] Update nvidia-cutlass-dsl
requirements version from 4.3.0 to 4.3.1
inactive-30d
#2823
opened Nov 29, 2025 by
jeromeku
Loading…
[CuTeDSL] Feature/fp8e4m3 to fp16 conversion
inactive-30d
#2822
opened Nov 28, 2025 by
arseniivanov
Loading…
Fix processing of relative imports in AST preprocessing
#2821
opened Nov 28, 2025 by
danieldk
Loading…
Previous Next
ProTip!
Follow long discussions with comments:>50.