I am just going to take a few notes of what I do to start new systems for machine learning.
Get basic tools installed for Ubuntu including Docker:
Download drivers for NVIDIA GPUs: Drivers | GeForce (you want Linux 64-bit). Once you get them on the machine (note need the direct download link):
Run through the prompts. If you installed an x server and you are getting an error you can add--no-x-check
Install nvidia-docker:
Get basic tools installed for Ubuntu including Docker:
Code:
sudo apt-get update && sudo apt-get upgrade -y
sudo apt-get install build-essential autoremove git
wget -qO- https://get.docker.com/ | sh
sudo usermod -aG docker $USER
sudo reboot
Code:
chmod +x NVIDIA-Linux-x86_64-381.22.run
sudo ./NVIDIA-Linux-x86_64-381.22.run
Install nvidia-docker:
Code:
# Install nvidia-docker and nvidia-docker-plugin
wget -P /tmp https://github.com/NVIDIA/nvidia-docker/releases/download/v1.0.1/nvidia-docker_1.0.1-1_amd64.deb
sudo dpkg -i /tmp/nvidia-docker*.deb && rm /tmp/nvidia-docker*.deb
# Test nvidia-smi
nvidia-docker run --rm nvidia/cuda nvidia-smi