Infrastructure
https://docs.nvidia.com/datacenter/cloud-native/container-toolkit/latest/troubleshooting.html
更新環境:
sudo apt update && sudo apt upgrade
設定變數:
distribution=$(. /etc/os-release;echo $ID$VERSION_ID) \
&& curl -s -L https://nvidia.github.io/nvidia-docker/gpgkey | sudo apt-key add - \
&& curl -s -L https://nvidia.github.io/nvidia-docker/$distribution/nvidia-docker.list | sudo tee /etc/apt/sources.list.d/nvidia-docker.list
安裝 nvidia-docker2:
sudo apt update
sudo apt install -y nvidia-docker2
sudo systemctl restart docker
透過 nvidia 的 Docker image,然後下 nvidia-smi 指令確認該容器是否有讀取到GPU:
sudo docker run --rm --runtime=nvidia --gpus all nvidia/cuda:11.6.2-base-ubuntu20.04 nvidia-smi