-
Notifications
You must be signed in to change notification settings - Fork 13
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
set -mllvm -amdgpu-function-calls=true
for HIP builds
#569
Comments
I like the alternative you have considered. |
On a more serious note, this is easy to do for a single machine (like |
Can you open an issue on the AMReX GitHub repo for this problem? Allowing downstream codes to change this is something they need to fix in their CMake settings. |
### Description This updates the Setonix profile to use the latest module versions that are available. Pawsey updated the software stack in mid-June 2024 to Cray PE 23.09. I've compiled and tested the ShockCloud problem on Setonix GPUs. It appears to work. ### Related issues #569 ### Checklist _Before this pull request can be reviewed, all of these tasks should be completed. Denote completed tasks with an `x` inside the square brackets `[ ]` in the Markdown source below:_ - [x] I have added a description (see above). - [x] I have added a link to any related issues see (see above). - [x] I have read the [Contributing Guide](https://github.com/quokka-astro/quokka/blob/development/CONTRIBUTING.md). - [ ] I have added tests for any new physics that this PR adds to the code. - [ ] I have tested this PR on my local computer and all tests pass. - [ ] I have manually triggered the GPU tests with the magic comment `/azp run`. - [x] I have requested a reviewer for this PR. --------- Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
The underlying problem is that the AMD GPU compiler does register allocation incorrectly: https://discourse.llvm.org/t/the-current-state-of-spilling-function-calls-and-related-problems/2863. Unfortunately, there is nothing we can do about this until they rewrite their compiler. |
@psharda This issue is supposed to be fixed in ROCm 6.3.1, which is now on moth. Do you have a short test case for PopIII that I can run that didn't work before? |
Mike says it works now for their nuclear networks without any of the old workarounds. |
It looks like this is no longer needed as of ROCm 6.3.1, so I'll close this issue. |
Describe the proposal
We should automatically add
-mllvm -amdgpu-function-calls=true
to the compiler flags when-DAMREX_GPU_BACKEND=HIP
(AMD GPUs). This works around compiler bugs for large GPU kernels (e.g., chemistry networks).Describe alternatives you've considered
Alternatively, don't use AMD GPUs.
Additional context
See also: AMReX-Astro/Microphysics#1489.
The text was updated successfully, but these errors were encountered: