-
Notifications
You must be signed in to change notification settings - Fork 9
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Perlmutter/Frontier Test Failure using CUDA aware MPI #102
Comments
I ran into the same error, below you'll find my stack trace.
I am running this test (https://github.com/SCOREC/meshFieldsDist/blob/performance_testing/testDist.cpp) with Cuda aware enabled and submitting my job with the following script:
Note: I've noticed this only occurs for meshes over approximately 600k elements. |
I also tried using |
@Angelyr - after interfacing with NERSC support, it appears that the kokkos parameter |
Note, there was a significant performance penalty with |
warp_test_parallel & rc_field_testp failing with the following error:
Both tests output vaild results for a while before failing. Follow these instructions to reproduce:
environment script:
install script:
allocation script:
The text was updated successfully, but these errors were encountered: