-
Notifications
You must be signed in to change notification settings - Fork 306
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
NVIDIA_DRIVER_CAPABILITIES=graphics is broken on Jetson devices (1.17.1 or later) #795
Comments
I can confirm this behaviour on the following additional env:
Both on Ubuntu 20.04, docker version 27.3.1 The failing symlink happens to be the first I suspect that somewhere in v1.16.2...v1.17.1 is a change to the handling of symlinks that has broken the functionality. |
Facing the same issue with:
Also, I think it's worth mentioning that I have both Temporary Fix Downgraded container toolkit to
|
I am also experiencing this issue on my Jetson device. Here are the details of my setup: Hardware: Jetson Orin 16GB
After downgrading to nvidia-container-toolkit version 1.15.0-1, the container runs successfully:
To resolve the issue, I only needed to purge nvidia-container-toolkit-base and nvidia-container-toolkit, and install version 1.15.0-1 of both. Here are the steps I followed:
After downgrading, the containers are running correctly using the NVIDIA runtime. Summary of my findings: Hardware: Jetson Orin 16GB I hope this information helps in identifying and fixing the bug. |
Facing the same issue. |
I don't think the problem is in the environment variable. I can run a vanilla ubuntu image with any value of NVIDIA_DRIVER_CAPABILITIES and it runs without a problem. It seems more likely that the issue is with the specific symlink to I suspect that there's some more general fix to symlink handling that needs to happen in order to prevent the issue from just moving on the next file that needs a symlink. |
Using |
I chased this a bit harder, put a bunch of print statements into the createLink code in
For some reason, the symlink that fails has a target that's just the symlink itself, rather than the real target. My understanding stops at that point. |
Summary
On Jetson(
aarch64
,Tegra SoC
) devices, version1.17.1
is not creating containers properly, if environment variableNVIDIA_DRIVER_CAPABILITIES
contains any ofdisplay
,graphics
,all
value.This could be mitigated by overriding container env, for example
docker run -e NVIDIA_DRIVER_CAPABILITIES=compute nvcr.io/...
.Steps to reproduce
Get a Jetson device. I tested with {Xavier, Orin} AGX DevKit as a reference.
Install
Docker runtime
andnvidia-container-runtime=1.17.1-1
Ensure nvidia container runtime has configured. To configure, run
sudo nvidia-ctk runtime configure --set-as-default
Try running a container. For example, l4t-base image could be used. For example:
OR, even with non-jetson base images:
Result
Example of error message
The text was updated successfully, but these errors were encountered: