Pinned Loading
-
tilelang
tilelang PublicForked from tile-ai/tilelang
Domain-specific language designed to streamline the development of high-performance GPU/CPU kernels
C++
-
tile-ai/tilelang
tile-ai/tilelang PublicDomain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels
207 contributions in the last year
Day of Week | April Apr | May May | June Jun | July Jul | August Aug | September Sep | October Oct | November Nov | December Dec | January Jan | February Feb | March Mar | April Apr | ||||||||||||||||||||||||||||||||||||||||
Sunday Sun | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Monday Mon | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Tuesday Tue | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Wednesday Wed | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Thursday Thu | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Friday Fri | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Saturday Sat |
Less
No contributions.
Low contributions.
Medium-low contributions.
Medium-high contributions.
High contributions.
More
Activity overview
Contributed to
tile-ai/tilelang,
chengyupku/Ladder,
chengyupku/chengyupku.github.io
and 11 other
repositories
Loading
Contribution activity
April 2025
Created 16 commits in 3 repositories
Created 3 repositories
-
chengyupku/cafe_plot
Python
This contribution was made on Apr 17
-
chengyupku/DeepGEMM
Cuda
This contribution was made on Apr 9
-
chengyupku/Triton-distributed
MLIR
This contribution was made on Apr 5
Opened 12 pull requests in 1 repository
tile-ai/tilelang
12
merged
-
[Enhancement] Add TMA+WS support in pipeline planning logic
This contribution was made on Apr 22
-
[Refactor] Enhance layout inference logic in ParallelOp
This contribution was made on Apr 22
-
[Enhancement] Report Error Body in ParallelOp Layout Inference
This contribution was made on Apr 15
-
[Refactor] Refactor warp_specialized_rewriter to support multiple acquire/release patterns
This contribution was made on Apr 14
-
[Example] Introduce autotuning example for GEMM with enhanced configuration options
This contribution was made on Apr 9
-
[Enhancement] Update group_per_split_token_cast_to_fp8 to support multiple data types
This contribution was made on Apr 8
-
[Bugfix] Fix X_amax Correctness Issue in Group Cast FP8
This contribution was made on Apr 6
-
[Dev] Add Group Cast FP8 Example
This contribution was made on Apr 5
-
[Refactor] Optimize RMS normalization kernel in rms_norm.py
This contribution was made on Apr 4
-
[Dev] Add FP8 Quantization Examples and Absolute Maximum Reduction Operation Support
This contribution was made on Apr 3
-
[Bugfix] Fix logic error in ReduceOp when handling CUDA architecture
This contribution was made on Apr 1
-
[Bugfix] Fixed the handling logic of IfThenElseNode in if_stmt_binding
This contribution was made on Apr 1
1
contribution
in private repositories
Apr 5