The docker image

This image is based on

Ubuntu 20.04
CUDA 11.4

Emulate ssh-ing into a remote machine. This is as opposed to using docker API, although one can still use it. Not focusing on production - only on development. Optimizes:

user experience, no steep learning curve
simplicity

Doesn't optimize (less attention to https://pythonspeed.com/articles/official-docker-best-practices/):

image size -> no docker file tricks, just plain copypaste
security -> running as root as docker default

Features:

opengl and graphics (glxgears works)
desktop GUI via browser (VNC is open at <this_ip>:8080/vnc.html by default)
passwordless ssh access
GPU training example with PyTorch and Lightning
Goolge cloud setup

New repo setup

You probably already have some repo that you want to dockerize. First decision point to pick: (1) Add Dockerfile right into your repo (easiest) (2) Make a highlevel repo, into which you put your existing repo as a submodule or subtree (a bit cleaner, but with some hassle) Option (1) is suboptimal if you have setup.py in the root of your repo AND you want to install it into venv inside docker AND you want to edit files.

Add this repo as a subtree:

git subtree add --prefix docker_mlgl [email protected]:olegsinavski/docker_mlgl.git main --squash

(or submodule - not recommended git submodule add [email protected]:olegsinavski/docker_mlgl.git docker_mlgl)

Create a sandbox.sh script with this content:

#!/usr/bin/env bash
set -e
PROJECT_NAME=<YOUR_REPO_NAME>
PYTHON_VERSION=3.9

# https://stackoverflow.com/questions/59895/how-do-i-get-the-directory-where-a-bash-script-is-located-from-within-the-script
SCRIPT_DIR=$( cd -- "$( dirname -- "${BASH_SOURCE[0]}" )" &> /dev/null && pwd )
./docker_mlgl/stop_sandbox.sh $PROJECT_NAME
# Build parent image
./docker_mlgl/build.sh mlgl_sandbox $PYTHON_VERSION
docker build -t $PROJECT_NAME $SCRIPT_DIR
./docker_mlgl/start_sandbox.sh $PROJECT_NAME $SCRIPT_DIR

SANDBOX_IP="$(docker inspect -f '{{ .NetworkSettings.IPAddress }}' $PROJECT_NAME)"
ssh docker@$SANDBOX_IP

Allow it to be executable: chmod +x sandbox.sh

Create a Dockerfile in the root with this content:

FROM mlgl_sandbox

Install docker if needed ./install_docker.sh

Run ./sandbox.sh. It should build a docker image with your project name and then drop you into a developer sandbox.

The sandbox is running in docker and you always can exit and then ssh into it again. You can always rerun ./sandbox.sh if you don't want to ssh. Its going to quickly rebuild it since docker caches build stages.

Your repo is available under ~/ directory in the sandbox. Additionally, a storage folder ~/storage in the container is mapped to ~/.${project_name}_storage folder on your desktop. Use it for artifacts that you want to persist between rebuilds (e.g. network weights).

Currently, your sandbox is not very useful. You need to add your custom setup into the Dockerfile: First, run some apt-gets if needed and then choose your development environment: conda, venv or system python (see below). Note, that since the container is completely isolated you don't have to use conda or venv for isolation. If it's easy for you, just install things into the system. Here are some examples.

Run apt-gets or other system install scripts

If you have some apt-get installs or a system script, simply call it from the dockerfile. Parent dockerfile defines APT_INSTALL variable to install packages without manual interface. For example, if you need to install libsfml-dev, run:

RUN apt-get update && $APT_INSTALL libsfml-dev

If you have a some setup.sh script, add:

COPY <path_to_setup_sh>/setup.sh /root/setup.sh
RUN chmod +x /root/setup.sh
RUN /root/setup.sh

System python installation with `requirements.txt`

The easiest and at the same time robust way to install requirements is to do with requirement locking. Read here about the similar technique in conda.

Create requirements.txt.
ssh into the container (e.g. by running ./sandbox.sh), cd into ~/<YOUR_REPO_NAME> folder, you should find requirements.txt there
Run pip-compile --generate-hashes --output-file=requirements.txt.lock --resolver=backtracking requirements.txt. NOTE: you'll need at least 16GB RAM for this!
You should be able to find requirements.txt.lock file on your host repo now. Commit both files.
Add the following in your Dockerfile

COPY requirements.txt.lock requirements.txt.lock
RUN python -m pip --no-cache-dir install --no-deps --ignore-installed -r requirements.txt.lock
# Add the /src/ folder to pythonpath. A sandbox will mount there the default python code
ENV PYTHONPATH "${PYTHONPATH}:~/<YOUR_REPO_NAME>"

Also notice, that you can change system python version with PYTHON_VERSION variable in sandbox.sh (tested with 3.8 and 3.9 so far).

Use `conda`

If you have environment.yml file in your repo, add the following to docker:

USER docker
COPY environment.yml /home/docker/environment.yml
RUN conda env create -f ~/environment.yml
# activate conda env on login
RUN echo "conda activate <YOUR_ENV_NAME>" >> ~/.bashrc

Notice that it does NOT activate conda environment during build, but only during ssh-ing into the sandbox. If you want to run conda commands inside the envduring build, use the following recipe from (here)[https://pythonspeed.com/articles/activate-conda-dockerfile/]:

SHELL ["conda", "run", "-n", "<YOUR_ENV_NAME>", "/bin/bash", "-c"]
# this will run inside <YOUR_ENV_NAME> conda env
RUN python setup.py. develop

Use `venv`

Based on (this)[https://pythonspeed.com/articles/activate-virtualenv-dockerfile/]

USER docker
WORKDIR /home/docker/
ENV VIRTUAL_ENV=/home/docker/venv
RUN python3.8 -m venv $VIRTUAL_ENV
ENV PATH="$VIRTUAL_ENV/bin:$PATH"
RUN echo "source venv/bin/activate" >> ~/.bashrc

COPY --chown=docker:docker ./folder_to_install /home/docker/folder_to_install
WORKDIR /home/docker/folder_to_install
RUN pip install -e .

Use GUI utils (e.g. matplotlib) with VNC

When sandbox is running, you can open <this_ip>:8080/vnc.html and see a simple desktop. You can run GUI utils there, e.g. matplotlib, opencv or pygame.

The sandbox supports 3d rendering: check that GPU rendering works with glxgears (you should see a spinning gear). So you can run 3d simulators like gym or pybullet there.

How to setup proxy jump ssh

This allows to call ssh sandbox on your laptop right into the sandbox on a remote dev server.

Copy your keys from your development laptop to the remote server:

scp ~/.ssh/id_ed25519 <ssh_name_of_the_server>:~/.ssh/
scp ~/.ssh/id_ed25519.pub <ssh_name_of_the_server>:~/.ssh/

On your development laptop, configure a proxy jump to the sandbox:

Host sandbox
 Hostname 172.17.0.2
 User <youruser>
 ProxyJump <ssh_name_of_the_server>
 StrictHostKeyChecking no

Notice 172.17.0.2 as explicit address, but your docker container address could be different. To get your IP address, you can run docker inspect -f '{{ .NetworkSettings.IPAddress }}' <PROJECT_NAME>

Users

There is a default docker user created during build and a root user. We recommend using docker for all user installations, such as venvs and conda. There is a paswordless sudo for the docker user in case you need it.

Troubleshooting

invalid argument <XXX> for "-t, --tag" flag: invalid reference format: repository name must be lowercase

Docker wants full lowercase name for the image. Use lowercase in sandbox.sh, PROJECT_NAME variable.

After I run sandbox.sh, I'm still asked for a password to login to a sandbox. Solution:

eval $(ssh-agent)
ssh-add ~/.ssh/<FIRST_PRIVATE_KEY_IN_SSH_FOLDER>

This is based on the following images/tutorials

Install docker deepo container dependencies so that it works (you don't need deepo itself): https://github.com/ufoym/deepo

Enable default gpu support by picking nvidia runtime: https://stackoverflow.com/questions/59652992/pycharm-debugging-using-docker-with-gpus

Make docker daemon available on a fixed port: https://dockerlabs.collabnix.com/beginners/components/daemon/access-daemon-externally.html

How VNC works

xvfb - create a virtual X11 display fluxbox - uses a virtual X11 and creates a windowmanager (xterm - adds a terminal) X11Vnc - exposes all that via VNC server (makes it available for VNC clients) websockify - translates WebSockets traffic to normal socket traffic to be available via browser

How to build cudagl base image

Install https://github.com/docker/buildx. Clone https://gitlab.com/nvidia/container-images/cuda

Run build.sh from https://gitlab.com/nvidia/container-images/cuda/-/blob/master/build.sh This is an example (you can add --push to push the image):

./build.sh -d --image-name <yourname>/cudagl --cuda-version 11.6.1 --os ubuntu --os-version 20.04 --arch x86_64 --cudagl

Name		Name	Last commit message	Last commit date
Latest commit History 70 Commits
dep/vnc		dep/vnc
scripts		scripts
.gitignore		.gitignore
Dockerfile		Dockerfile
GoogleCloudSetup.md		GoogleCloudSetup.md
LICENSE		LICENSE
README.md		README.md
build.sh		build.sh
install_docker.sh		install_docker.sh
print_jupyter.sh		print_jupyter.sh
sandbox.sh		sandbox.sh
start_sandbox.sh		start_sandbox.sh
stop_sandbox.sh		stop_sandbox.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

The docker image

New repo setup

Run apt-gets or other system install scripts

System python installation with `requirements.txt`

Use `conda`

Use `venv`

Use GUI utils (e.g. matplotlib) with VNC

How to setup proxy jump ssh

Users

Troubleshooting

This is based on the following images/tutorials

How VNC works

How to build cudagl base image

About

Releases

Packages

Languages

License

olegsinavski/docker_mlgl

Folders and files

Latest commit

History

Repository files navigation

The docker image

New repo setup

Run apt-gets or other system install scripts

System python installation with requirements.txt

Use conda

Use venv

Use GUI utils (e.g. matplotlib) with VNC

How to setup proxy jump ssh

Users

Troubleshooting

This is based on the following images/tutorials

How VNC works

How to build cudagl base image

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

System python installation with `requirements.txt`

Use `conda`

Use `venv`

Packages