-
Notifications
You must be signed in to change notification settings - Fork 1.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add TMA example for Hopper H100 #214
base: master
Are you sure you want to change the base?
Conversation
Maybe modify the printf to state that the following code will fail. You have Care must be taken to ensure that the coordinates result in a memory offset Maybe add the following functions should fail due to .... |
Thanks! I agree that seeing the end of the example fail is confusing. I have taken your suggestion and also added
|
Thanks, run first, ask questions later, that's my motto... when I see errors I tend to look for problems instead of thinking, learning exercise :) |
Haha no problem. As they say "you don't have to prepare to win the lottery, but the lottery has to prepare for someone to win". There will always be somebody running the samples in a hurry, even if many people won't. Better to have them covered from the start :) |
so what to do with this? |
No MakeFile in this folder, copy one from folder globalToShmemAsyncCopy and change globalToShmemAsyncCopy to globalToShmemTMACopy works! |
This sample code shows how to create a TMA descriptor using the driver API and how to initiate a TMA transfer using inline PTX.
I have not yet gotten the chance to copy over the Makefile from other directories. What is the preferred solution here? Also for creating the Visual Studio solution files?
This example can only be compiled with
-arch sm_90
. Previous architectures are not supported.