-
Notifications
You must be signed in to change notification settings - Fork 288
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
The NVIDIA ICD JSON occasionally goes missing from 'nvidia-ctk cdi generate' #767
Comments
I forgot to mention the NVIDIA Container Toolkit version: $ nvidia-ctk --version
NVIDIA Container Toolkit CLI version 1.16.1
$ rpm -qf $(which nvidia-ctk)
golang-github-nvidia-container-toolkit-1.16.1-1.fc39.x86_64 Note that the NVIDIA Container Toolkit version didn't change between the NVIDIA ICD JSON for Vulkan being listed and not listed. What changed was that I pulled in the RPM updates for the rest of the Fedora host. |
@debarshiray the host path you mention (Looking at some older internal documentation it seems as if this has been the case for a while). |
Thanks for looking into it, @elezar ! Meanwhile, I reinstalled different versions of Fedora a few times to see if the problem is specific to a particular combination of package versions. I could reproduce it reliably on Fedora 40 and 41, which was surprising because this used to work. :) Now with Fedora 41 Workstation and the proprietary NVIDIA driver from RPM Fusion, I see: $ rpm --query --file /usr/share/vulkan/icd.d/nvidia_icd.x86_64.json
xorg-x11-drv-nvidia-libs-560.35.03-5.fc41.x86_64 If I force In all cases, Vulkan support is available through the proprietary driver on the host operating system, as shown in the |
Who is the publisher of the |
It's RPM Fusion. That's where I got the proprietary NVIDIA driver from. |
The issue is that the driver package definition (see https://pkgs.rpmfusion.org/cgit/nonfree/xorg-x11-drv-nvidia.git/tree/xorg-x11-drv-nvidia.spec#n294) changes the name that the NVIDIA tooling expects. This means that the NVIDIA Container Toolkit can't locate the expected ICD. My suggestion would be to create a bug against RPM Fusion so as to maintain the behaviour of the NVIDIA driver. As a workaround you could rename / copy the
or adding an additional mount to your CDI spec. I will look into what would be required for a more stable workaround, but can't commit to a specific timeline. |
I have been playing with the NVIDIA Container Toolkit on Fedora 39 Workstation and the proprietary NVIDIA driver from RPM Fusion. I have noticed that the NVIDIA installable client driver (or ICD) JSON for Vulkan occasionally goes missing from
nvidia-ctk cdi generate
:... even though the file is present on the host operating system at
/usr/share/vulkan/icd.d/nvidia_icd.x86_64.json
and Vulkan support on the host is confirmed by:This means that Podman containers don't have Vulkan support through the proprietary NVIDIA driver, and can only use LLVMpipe.
Right now, I am observing this problem with:
The text was updated successfully, but these errors were encountered: