We simply mount the ground truth directory to the container and run the evaluation script. TODO we should avoid publicizing this to prevent name conflicts docker run -v data/:evaluation_data/ [other_options] -it sk_smoke e.g. sudo docker run -v ~/projects/stability-benchmark/data:/dataset/evaluation_data/ -it sk_smoke /bin/bash TODO Thiago -- script to wire and run the submitted container and the private evaluation dir