Update Add Docker Support for CUTLASS FP8 GEMM #36
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This pull request includes changes to the
cutlass_gemm
kernel, aimed at improving the setup, installation, and usage of the kernel. The most important changes include the addition of aDockerfile
for containerized builds, updates to thereadme.md
for clearer installation instructions, and improvements to thesetup.py
for better path handling and CUDA library linking.kernels/cuda/cutlass_gemm/Dockerfile
: Added a Dockerfile to facilitate building and running the project in a containerized environment. This includes instructions for building the image and running the container.kernels/cuda/cutlass_gemm/readme.md
: Updated the README with detailed installation instructions for both Docker and non-Docker environments, including prerequisites and steps for building and running the project.kernels/cuda/cutlass_gemm/setup.py
: Modified the setup script to dynamically determine the current location for include directories and to useCUDA_HOME
for library directories, enhancing portability and ease of setup. [1] [2]Minor Code Adjustments:
kernels/cuda/cutlass_gemm/test_cutlass_gemm.py
: Fixed import formatting forcutlass_scaled_mm
to maintain consistency.