ImagePull Problems With Nvidia Toolkit

Neardy_Neanderthal · July 22, 2024, 4:25pm

Hello everyone, has anyone experienced issues with vGPU and the nvidia-toolkit container? I am trying to pull from a self-hosted artifactory instance and the container can’t pull the image. This host doesn’t have internet access. It still tries to reach docker.io which it can’t.

nvidia-driver-runtime-22gtq                            0/1     ImagePullBackOff   0               3d18h

Events:
  Type     Reason   Age                      From     Message
  ----     ------   ----                     ----     -------
  Warning  Failed   60m (x260 over 3d18h)    kubelet  Failed to pull image "rancher/harvester-nvidia-driver-toolkit:v1.3-20240613": rpc error: code = DeadlineExceeded desc = failed to pull and unpack image "docker.io/rancher/harvester-nvidia-driver-toolkit:v1.3-20240613": failed to resolve reference "docker.io/rancher/harvester-nvidia-driver-toolkit:v1.3-20240613": failed to do request: Head "https://registry-1.docker.io/v2/rancher/harvester-nvidia-driver-toolkit/manifests/v1.3-20240613": dial tcp 3.219.239.5:443: i/o timeout
  Warning  Failed   15m (x243 over 3d17h)    kubelet  Failed to pull image "rancher/harvester-nvidia-driver-toolkit:v1.3-20240613": rpc error: code = DeadlineExceeded desc = failed to pull and unpack image "docker.io/rancher/harvester-nvidia-driver-toolkit:v1.3-20240613": failed to resolve reference "docker.io/rancher/harvester-nvidia-driver-toolkit:v1.3-20240613": failed to do request: Head "https://registry-1.docker.io/v2/rancher/harvester-nvidia-driver-toolkit/manifests/v1.3-20240613": dial tcp 54.196.99.49:443: i/o timeout
  Normal   BackOff  37s (x21816 over 3d18h)  kubelet  Back-off pulling image "rancher/harvester-nvidia-driver-toolkit:v1.3-20240613"

tserong · July 25, 2024, 2:47am

Did you update the Image Repository and Image Tag settings on the nvidia-driver-toolkit screen to point to your private registry? (see Nvidia Driver Toolkit | Harvester)

Topic		Replies	Views
Rancher not passing GPU to Plex POD Rancher	0	442	October 20, 2022
Can't pull Docker image [rancher/rke-tools:v0.1.96] Rancher	1	122	July 29, 2024
Issues with local cluster in Rancher v2.5.7 Rancher	0	1257	February 21, 2023
Rancher Air-gapped Windows Downstream Cluster Images Rancher	0	295	April 6, 2021
Gpus, nvidia-docker and rancher Rancher 1.x	2	5807	July 11, 2017

ImagePull Problems With Nvidia Toolkit

Related topics