Set up DL environment and versions/dependencies

When the server is available, the next step is to set up a compatible and robust environment that supports the DL platform.

The most commonly used system is Ubuntu 20.04 LTS, Ubuntu 18.04 LTS, and 16.04 LTS. What's need to be managed include: PyTorch, TensorFlow, CUDA, cuDNN, and NVIDIA drivers.

Lambda stack

The easiest way is to use lambda stack directly: https://lambdalabs.com/lambda-stack-deep-learning-software#server-installation

It includes the setup with python virtual environments, on ubuntu 20.04/18.04 servers, ubuntu 16.04, or some docker container.

Others:

if you already have a system set up, and here is some information that might be helpful:

  • conda takes care of the version compatibility, even on cudnn. preferred it over pip.

  • be cautious about CUDA 10.1 and PyTorch.

Last updated