Commit 63d98ea
authored
Pin nvidia-container-toolkit to version 1.16.2 (#5852)
Yesterday's nvidia-container-toolkit v1.17.0
[release](https://github.com/NVIDIA/nvidia-container-toolkit/releases/tag/v1.17.0)
seems to have broken some of our domain images, causing `docker run
--gpus all [image]" to fail with the error:
```
$ docker run --gpus all [IMAGE]
docker: Error response from daemon: failed to create task for container: failed to create shim task: OCI runtime create failed: runc create failed: unable to start container process: error during container init: error running hook #0: error running hook: exit status 1, stdout: , stderr: Auto-detected mode as 'legacy'
nvidia-container-cli: error parsing IMEX info: unsupported IMEX channel value: all: unknown.
ERRO[0000] error waiting for container: context canceled
```
Pinning the toolkit to the previous version to mitigate the failure for
now
Testing:
- Validated locally
- TBD: Currently testing on a domain repo1 parent 49fb39b commit 63d98ea
1 file changed
+2
-2
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
40 | 40 | | |
41 | 41 | | |
42 | 42 | | |
43 | | - | |
| 43 | + | |
44 | 44 | | |
45 | 45 | | |
46 | 46 | | |
| |||
51 | 51 | | |
52 | 52 | | |
53 | 53 | | |
54 | | - | |
| 54 | + | |
55 | 55 | | |
56 | 56 | | |
57 | 57 | | |
| |||
0 commit comments