-
Notifications
You must be signed in to change notification settings - Fork 238
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Problem with AMD benchmark #105
Comments
Hi @computingdolas , could you help provide the versions on your ROCm software environment? |
Thanks @sunway513 for your response. Here is the what rocminfo says 👍 =====================
|
Thanks @computingdolas , can you share more information? e.g. the log of: |
Hey @sunway513 See this :) rocm-clang-ocl-0.3.0_7997136-1.x86_64 |
Great, so you are on the latest ROCm, thanks :-) |
Here are my results for gemm benchmark in code/amd folder. matrix flops approximately is 2mn*k and TFLOPS = (flops/time)/10^12. For the first case in this I am getting somewhere around 0.15 TFLOPs but I should according to results folder get 1.5 TFLOPs. Please find the data below 👍 Thank you for your support @sunway513 |
Hi @computingdolas, my numbers are very different (much faster) than yours, please find it here: To reproduce my number, please use the following command to run the docker image I've prepared:
|
Hey @sunway513 Thank you for your response. Those are nice numbers for AMD GPUs. Why I am getting this issue any idea ? |
Just to confirm you are still using AMD Vega gfx900. I am using AMD Pro SSG ? |
Is it the driver problem because I am really confused now ? I am looking non-docker solution. I want to know what happened that these numbers are so bad ? |
Hi @computingdolas , my test GPU is MI25, it's GFX900 based. |
Please also clarify which GPU are you really using. I saw some reference of AMD Pro SSG, and that is NOT one of the supported deep learning AMD GPUs. |
hi @sunway513 Ok let's try the docker solution and I will update you in that :) @dagamayank Hey I am using AMD SSG-PRO which is Vega 10 XT architecture. Are you saying we have good ROCm support for that GPU ? I saw the white paper and the data sheet and I saw many references where they mentioned about this capabilities for deep learning stuff. Can you let me know more about this. Thanks :) |
Correction @dagamayank Are you saying we don't have good ROCm support for this GPU ? |
@sunway513 Is it possible to provide me remote access to your AMD mi25 GPU ? |
@computingdolas , I'm not able to provide public access to the MI25 node. However, you can alternatively try with third-party cloud services using VegaFE: |
Hey @sunway513 but https://www.gpueater.com/ don't have MI25 GPUs although having Vega frontier edition |
Yes, VegaFE should run ROCm fine with the similar performance as what I've provided in my log for MI25. |
I am getting the 1/10 flops/s on the AMD Vega architecture as compared to one mentioned in the results folder. Anybody know why ???
The text was updated successfully, but these errors were encountered: