Update Add Docker Support for CUTLASS FP8 GEMM #36

SamirMoustafa · 2024-12-12T15:33:36Z

This pull request includes changes to the cutlass_gemm kernel, aimed at improving the setup, installation, and usage of the kernel. The most important changes include the addition of a Dockerfile for containerized builds, updates to the readme.md for clearer installation instructions, and improvements to the setup.py for better path handling and CUDA library linking.

kernels/cuda/cutlass_gemm/Dockerfile: Added a Dockerfile to facilitate building and running the project in a containerized environment. This includes instructions for building the image and running the container.
kernels/cuda/cutlass_gemm/readme.md: Updated the README with detailed installation instructions for both Docker and non-Docker environments, including prerequisites and steps for building and running the project.
kernels/cuda/cutlass_gemm/setup.py: Modified the setup script to dynamically determine the current location for include directories and to use CUDA_HOME for library directories, enhancing portability and ease of setup. [1] [2]

Minor Code Adjustments:

kernels/cuda/cutlass_gemm/test_cutlass_gemm.py: Fixed import formatting for cutlass_scaled_mm to maintain consistency.

This reverts commit 119e41f.

This reverts commit eccdb04.

SamirMoustafa · 2024-12-12T17:10:01Z

I can also add a benchmark script that compares against PyTorch data types, but this would make the PR slightly longer. Please let me know if it is needed.

SamirMoustafa added 9 commits December 12, 2024 13:44

remove personal paths

601d20c

Added cutlass 3.5.1 as submodule

eccdb04

remove cutlass 3.6 from the submodules

119e41f

change the import order to avoid libc10.so not found

87497f9

add docker to specify the cutlass version

2126b7b

Revert "remove cutlass 3.6 from the submodules"

81ecc8a

This reverts commit 119e41f.

Revert "Added cutlass 3.5.1 as submodule"

f0a84f8

This reverts commit eccdb04.

install cutlass_gemm within the Dockerfile

e288b7f

add a readme for the cuda/cutlass fp8 gemm kernel

aa618a3

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Dec 12, 2024

minor update for the setup without Docker

25e87fc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update Add Docker Support for CUTLASS FP8 GEMM #36

Update Add Docker Support for CUTLASS FP8 GEMM #36

SamirMoustafa commented Dec 12, 2024

SamirMoustafa commented Dec 12, 2024

Update Add Docker Support for CUTLASS FP8 GEMM #36

Are you sure you want to change the base?

Update Add Docker Support for CUTLASS FP8 GEMM #36

Conversation

SamirMoustafa commented Dec 12, 2024

Minor Code Adjustments:

SamirMoustafa commented Dec 12, 2024