forked from GDSL-UL/the_knowledge
-
Notifications
You must be signed in to change notification settings - Fork 1
/
06_Docker.Rmd
173 lines (123 loc) · 8.95 KB
/
06_Docker.Rmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
# Docker
This document describes how to install and use Docker on different platforms.
## Installation
If you are on Mac, Linux or Windows 10 Pro/Student editions, installing Docker
is relatively straightforward:
* [Mac](https://docs.docker.com/docker-for-mac/)
* [Linux official instructions](https://docs.docker.com/install/linux/docker-ce/ubuntu/)
* [Windows 10 Pro/Student](https://docs.docker.com/docker-for-windows/)
It is important to note that, on Mac and Windows, Docker runs under a virtual
machine so it will not use up all of the resources of your machine
(conversely, it'll equate to be working on a more limited machine). This can
be changed. But if you need more firepower, the idea is that you develop on
your laptop and scale to a server (e.g. running out of the lab).
The steps to install Docker include:
- Obtain a copy of Docker and install it:
- `Windows10 Pro/Enterprise`: [Install Docker Desktop for Windows](https://docs.docker.com/docker-for-windows/install/)
- `macOS`: [Get started with Docker Desktop for Mac](https://docs.docker.com/docker-for-mac/)
- Once Docker is successfully installed, make sure to enable access to your main drive (e.g. `C:\\`):
- `Windows10 Pro/Enterprise`: Open the preferences for Docker and click the
"Shared Drives" tab; click on the drive you want to add and then "Apply"
- `macOS`: this feature is automatically enabled
## Image and Container
Docker can be seen as a tool to generate a computer within your computer (the host). This creates a working environment that employs the host's resources but follows its own internal rules.
**Image** and **Container** are two key concepts to understand Docker processes.
**Images** are files containing all the instructions to build a complete and executable version of an application, relying on the host OS kernel.
**Containers** are instantiations of images, meaning that they are instances of the images running in an isolated environment. The same image can be instantiated in multiple containers.
## Useful Docker Commands
List available images :
```shell
docker image ls
```
Pull a new image:
```shell
docker pull image-tag
```
See what containers are running (this also shows you the ID and which port is occupying):
```shell
docker ps
```
The run command instantiates an image in an isolated container. The generic command is:
```shell
docker run [OPTIONS] IMAGE[:TAG] [COMMAND] [ARG...]
```
Options that are generally added are:
```shell
--detach or -d -> Run container in background and print container ID
```
If you do not add this option to the run command you can detach from a running docker session without exiting the shell, the escape sequence *Ctrl + p* followed by *Ctrl + q* can be used.
To re-attach the terminal to the docker session enter:
```shell
docker attach <session_id>
```
Other commonly used flags include:
```shell
--rm -> Automatically remove the container when it exits
```
```shell
--env or -e -> Set environment variables.
```
Environment variables in linux act as placeholders for information stored within the system that passes data to programs launched in shells. A common environment variable is *HOME* which is associated with the path of your home directory (*/home/your-user*). By adding *-e* to the run command you pass the environment variable to the container where a certain image is running. Cases when this option is necessary will be discussed.
```shell
--publish or -p -> Publish a container’s port(s) to the host (This is key when running server-based application through Docker. A network port is the communication enpoint in a client-server model, specifying which program a communication is intended for.)
--volume or -v -> Bind mount a volume
```
All other options are listed [here](https://docs.docker.com/engine/reference/commandline/run/)
To stop the container you have been running you need to detach from the container using the escape sequence (unless you added -d to the run command) and send the following commands:
```shell
docker ps to get the Container ID
docker stop <container-ID>
```
If you have not added -rm option you also need to manually remove the container
```shell
docker rm -f <container-ID>
```
An exemplary run command is the following:
```shell
docker run -d --rm -p 8080:8787 -v ${PWD}:/data rocker/geospatial
```
After the -p option you add the ports mapping: <host port>:<container port>. In this example, docker runs an image with r studio server which uses the port 8787 by default, but the host (the server where you are running docker) has the port 8787 already in use by other services. Therefore, you need to map the port of the image (8787) with a free port of the host (in this example 8080).
After the -v option you add the paths to mount a volume:<host path>:<container path>. In this example, the working directory (the directory where you are running the command from) is mapped to a data directory in the container. \${PWD} stands for **Print Working Directory**.
## Docker GPU Support
For the latest versions of Docker (since 19.03) to access GPUs from a container you will need to have the ```nvidia-container-toolkit``` installed. In addition a ```--gpus``` flag is required to indicate that you want your container to access a GPU. To enable your container to use all GPUs on a machine you can specify ```--gpus all```. For specific GPUs state the device ID ```--gpus '"device=1,2"'```.
To test if the nvidia-container-toolkit is already setup you can run the following example:
```shell
docker run --gpus all nvidia/cuda:10.0-base nvidia-smi
```
The command will first pull the nvidia/cuda:10.0-base image (if it has not been pulled already), and should display the nvidia-smi (System Management Interface):
```shell
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 440.33.01 Driver Version: 440.33.01 CUDA Version: 10.2 |
|-------------------------------+----------------------+----------------------+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
|===============================+======================+======================|
| 0 GeForce GTX 108... Off | 00000000:01:00.0 Off | N/A |
| 26% 20C P8 7W / 250W | 10924MiB / 11178MiB | 0% Default |
+-------------------------------+----------------------+----------------------+
| 1 GeForce GTX 108... Off | 00000000:02:00.0 Off | N/A |
| 22% 25C P8 9W / 250W | 151MiB / 11178MiB | 0% Default |
+-------------------------------+----------------------+----------------------+
| 2 GeForce GTX 108... Off | 00000000:82:00.0 Off | N/A |
| 27% 31C P8 9W / 250W | 151MiB / 11178MiB | 0% Default |
+-------------------------------+----------------------+----------------------+
+-----------------------------------------------------------------------------+
| Processes: GPU Memory |
| GPU PID Type Process name Usage |
|=============================================================================|
| 0 25888 G /usr/lib/xorg/Xorg 6MiB |
| 0 29622 C ...fa/agent/tensorflow/bin/python2 10905MiB |
| 1 29622 C ...fa/agent/tensorflow/bin/python2 139MiB |
| 2 29622 C ...fa/agent/tensorflow/bin/python2 139MiB |
+-----------------------------------------------------------------------------+
```
If the above does not work, then you may need to install the nvidia-container-toolkit. You may also need to update your nvidia-drivers for this step to work. On Ubuntu 16.04/18.04/20.04, Debian Jessie/Stretch/Buster the nvidia-container-toolkit can be installed using the following commands:
```shell
distribution=$(. /etc/os-release;echo $ID$VERSION_ID)
curl -s -L https://nvidia.github.io/nvidia-docker/gpgkey | sudo apt-key add -
curl -s -L https://nvidia.github.io/nvidia-docker/$distribution/nvidia-docker.list | sudo tee /etc/apt/sources.list.d/nvidia-docker.list
sudo apt-get update && sudo apt-get install -y nvidia-container-toolkit
sudo systemctl restart docker
```
If you do not have sudo rights you can ask your sys admin to carry out the above steps. Instructions for installing the nvidia-container-toolkit on other operating systems can be found [here](https://github.com/NVIDIA/nvidia-docker).
Having enabled GPU support you can now take advantage of a number of docker images where GPUs can significantly reduce the amount of time required to run your experiments. For instance [TensorFlow](https://www.tensorflow.org/install/docker) and [rapids](https://rapids.ai/start.html).