-
-
Notifications
You must be signed in to change notification settings - Fork 83
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
cudarc fails to load libraries on official nvidia ubuntu images #274
Comments
Hmm I've always used the cuda devel docker images (e.g. 12.5.1-cudnn-devel-ubuntu20.04) and those have worked for me. Can you try the devel images? If runtime images are necessary for you I can look into why they are different (I'm thinking the .12 at the end of the library name is messing up the dynamic loading searching). Alternatively - You can disable dynamic loading in favor of using dynamic linking, and that will likely work. |
That could be the reason, because creating symlinks for the libraries above resolves the issue on runtime image. That work around works, but I would love to get rid of it :-) We use devel image for the first build stage and then move binary to the runtime image to keep the image size small.
We used I would have tested cudarc |
I've just tried
How can I do that? We use |
Hmm looks like the main branch of candle is using dynamic linking already - are ya'll on an older version or a branch? Also FYI there was a bug with 0.11.7, so recommend either upgrading to 0.11.8 or downgrading to 0.11.6 (which is the version candle is targetting). I'll play around and see if I can get the dynamic loader to account for postfixes to the path. I'm not sure if we have that much control over pre & post fixes though (e.g. adding a |
BTW I don't see the driver library Did ya'll see any errors related to not finding |
I've upgraded to 0.11.8. In logs, I see only the following message. The same message as before. thread 'main' panicked at /usr/local/cargo/registry/src/index.crates.io-6f17d22bba15001f/cudarc-0.11.8/src/lib.rs:98:5:
Unable to dynamically load the "cublas" shared library - searched for library names: ["cublas", "cublas64", "cublas64_12", "cublas64_125", "cublas64_125_0", "cublas64_120_5", "cublas64_10", "cublas64_120_0", "cublas64_9"]. Ensure that `LD_LIBRARY_PATH` has the correct path to the installed library. If the shared library is present on the system under a different name than one of those listed above, please open a GitHub issue.
note: run with `RUST_BACKTRACE=1` environment variable to display a backtrace |
Ah yeah, sorry for miscommunicating - upgrading to 0.11.8 won't fix the message in this issue |
I've got you. Just wanted to clarify that I on the latest version in case you want me to test something :-) |
@coreylowman Wondering if LibreCuda might help? |
I've built @coreylowman Are you planning on making a release? |
docker image:
nvidia/cuda:12.5.1-runtime-ubuntu24.04
cudarc version:
0.11.7
Error message:
Location of the libraries on
nvidia/cuda:12.5.1-runtime-ubuntu24.04
:The text was updated successfully, but these errors were encountered: