Parsing default IMEX info fails for legacy images

Since the latest 1.17.x versions, containers with images considered "legacy" and that do not have the `NVIDIA_IMEX_CHANNELS` environment variable set fail to start with the following error:

```
Error: container create failed: time="2024-11-13T16:24:41Z" level=error msg="runc create failed: unable to start container process: error during container init: error running hook #0: error running hook: exit status 1, stdout: , stderr: nvidia-container-cli.real: error parsing IMEX info: unsupported IMEX channel value: all\n" 
```

It seems the `NVIDIA_IMEX_CHANNELS` environment variable is defaulted to `all` here for "legacy" images:

https://github.com/NVIDIA/nvidia-container-toolkit/blob/1995925a7df644ead7afb767608841d9a08bcbc4/internal/config/image/cuda_image.go#L145

Which cannot be parsed by https://github.com/NVIDIA/libnvidia-container/blob/63d366ee3b4183513c310ac557bf31b05b83328f/src/cli/common.c#L446.

An occurrence of that issue has been reported here for example: https://github.com/pytorch/test-infra/pull/5852.

That case should ideally be more gracefully handled.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Parsing default IMEX info fails for legacy images #797

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Parsing default IMEX info fails for legacy images #797

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions