Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Bug] H800 run UT failed. #6

Open
Ageliss opened this issue Jan 22, 2024 · 3 comments
Open

[Bug] H800 run UT failed. #6

Ageliss opened this issue Jan 22, 2024 · 3 comments

Comments

@Ageliss
Copy link

Ageliss commented Jan 22, 2024

image This setup can not pass UT. Could you please check it ?
@efrantar
Copy link
Member

Hi, unfortunately, I don't have access to any H800s (or any Hopper GPUs for that matter), so it is a bit hard to test. Which of the matrix shapes are failing and by how much? Can you perhaps print the result of this line for all test cases, i.e., what is the relative average error?

@Ageliss
Copy link
Author

Ageliss commented Jan 23, 2024

Hi, unfortunately, I don't have access to any H800s (or any Hopper GPUs for that matter), so it is a bit hard to test. Which of the matrix shapes are failing and by how much? Can you perhaps print the result of this line for all test cases, i.e., what is the relative average error?

Yes, if the thread_shape = [64, 256], I get the right thing:
image

However, as for [128, 128], I get the error:
image

@Qubitium
Copy link

Qubitium commented Mar 29, 2024

@Ageliss Which cuda version was the failed test ran on? Can you retest on latest Cuda 12.4 and/or pytorch 2.2.2?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants