问题形容:server重启之后,发现GPU无奈应用,显示如下error info

NVIDIA-SMI has failed because it couldn't communicate with the NVIDIA driver. Make sure that the latest NVIDIA driver is installed and running.

solution:

  1. sudo apt-get install dkms
  2. sudo dkms install -m nvidia -v 460.84

确认已装置NVIDIA driver 版本号

ls -lh /usr/src/nvidia-*** 即可找到对应版本号