NVIDIA Tesla Driver 安裝筆記
前言
- Tesla 系列的 GPU 通常用於 Data Center、HPC 等大型機群的情境
NVIDIA 針對 Tesla 的 driver 特別拉出一份文件說明,並有不同 support 週期
- production 環境應避免從 CUDA 安裝包安裝 driver,而是從 NVIDIA driver 頁面下載安裝
- 在 CUDA release note 搜尋
Tesla
,有此說明
Image Not Showing
Possible Reasons
- The image file may be corrupted
- The server hosting the image is unavailable
- The image path is incorrect
- The image format is not supported
Learn More →
檢查 GPU 型號
- 列出系統的 NVIDIA 設備,並確認 GPU 是 Tesla 型號
查閱 Release Notes
選擇 Tesla Driver 版本
- 參考 NVIDIA Tesla Driver Lifecycle
- 建議選擇 Long Term Service Branch (LTS),目前建議選擇
R450
系列
Image Not Showing
Possible Reasons
- The image file may be corrupted
- The server hosting the image is unavailable
- The image path is incorrect
- The image format is not supported
Learn More →
下載 Tesla Driver
- 下載連結 : https://www.nvidia.com/Download/Find.aspx
- 語言選
Chinse(Traditional)
,下載連結會是 tw.
開頭
Image Not Showing
Possible Reasons
- The image file may be corrupted
- The server hosting the image is unavailable
- The image path is incorrect
- The image format is not supported
Learn More →
放置 Driver 安裝檔到 Local Yum Repository
- 維運大型機群,建議在內部環境建立 local yum repository,維持機群的 driver 版本一致,也能避免因操作失誤等行為誤升級 driver
- 這邊是已有 local yum repository,下載及放置 driver rpm 到指定路徑的步驟
- local yum repository server
- client 的 yum 設定
/etc/yum.repos.d/nvidia-local.repo
安裝 Driver
啟用 Service 以及檢查測試
- 安裝完 driver,建議啟用 service
nvidia-persistenced
ref : https://stackoverflow.com/a/13127714
檢查 RPM script
- 檢查 rpm 的 pre/post 的 script,掌握裝 driver 的 rpm 時,額外執行了什麼動作
- 比如有
- 新增 nvidia-persistenced user (nvidia-persistenced-latest*.rpm)
- 新增 kernel cmd (nvidia-driver-latest-*.rpm)
檢查 nvidia-persistenced-latest-dkms
的 script
rpm -qp --scripts nvidia-persistenced-latest-dkms-450.119.04-1.el7.x86_64.rpm
- output
檢查 nvidia-driver-latest-dkms
的 script
rpm -qp --scripts nvidia-driver-latest-dkms-450.119.04-1.el7.x86_64.rpm
- output
Ansible
Image Not Showing
Possible Reasons
- The image file may be corrupted
- The server hosting the image is unavailable
- The image path is incorrect
- The image format is not supported
Learn More →
This work is licensed under a CC BY-NC-SA 4.0