ChAII-docker

Repo to build the docker image for Google ChAII

Data setup

Download the data files from Kaggle and put it inside the mount/dataset folder. We will attach this directory to our docker container in order to use the dataset and the Jupyter notebook provided.

Docker

Install Docker and nvidia-container-toolkit

sh setupDocker.sh

This script will setup Docker using the official convenience script and install the nvidia-container-toolkit so that we can use host machine's GPU inside the container. The documentation on this script can be found here

Building the image

docker build -t <name:tag> .

Running the image (with jupyter)

docker run --gpus all -it -v <absolute path of directory to be mounted from local>:/root/mount -p 8081:8081 <name:tag>

Scripts and Arguments

RUN SCRIPT

TASK : chaii_hi / chaii_ta
DATA_DIR : the absolute directory path containing the dataset files (no suffix will be added for chaii tasks)
OUT_DIR : the absolute directory path to where the output files need to be written
MODEL : model
- default : bert-base-multilingual-cased
GPU : GPU number
- default : 0
TRAIN_FILE_NAME : the name of the file containing the training data (inside DATA_DIR)
- default : for TASK == chaii_hi, train.hi.qa.jsonl; for TASK == chaii_hi, train.ta.qa.jsonl
PREDICT_FILE_NAME : the name of the file containing the validation data (inside DATA_DIR)
- default : for TASK == chaii_hi, dev.hi.qa.jsonl; for TASK == chaii_hi, dev.ta.qa.jsonl

PREDICT SCRIPT

MODEL_PATH : the absolute path to the model file
TASK : chaii_hi / chaii_ta
DATA_DIR : the absolute directory path containing the dataset files (no suffix will be added for chaii tasks)
PREDICTIONS_DIR : the absolute directory path to where the predictions need to be written (no suffix will be added for chaii tasks)
MODEL : model
- default : bert-base-multilingual-cased
MODEL_TYPE : model type
- default : bert
GPU: GPU number
- default : 0
PREDICT_FILE_NAME: the name of the file containing the prediction data (inside DATA_DIR)
- default : for TASK == chaii_hi, dev.hi.qa.jsonl; for TASK == chaii_hi, dev.ta.qa.jsonl

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
mount		mount
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
evaluate.sh		evaluate.sh
install_tools.sh		install_tools.sh
pre_evaluate.sh		pre_evaluate.sh
predict.sh		predict.sh
predict_qa.sh		predict_qa.sh
run.sh		run.sh
setupDocker.sh		setupDocker.sh
train.sh		train.sh
train_qa.sh		train_qa.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ChAII-docker

Data setup

Docker

Install Docker and nvidia-container-toolkit

Building the image

Running the image (with jupyter)

Scripts and Arguments

RUN SCRIPT

PREDICT SCRIPT

About

Releases

Packages

Contributors 3

Languages

abisubramanya27/ChAII-docker

Folders and files

Latest commit

History

Repository files navigation

ChAII-docker

Data setup

Docker

Install Docker and nvidia-container-toolkit

Building the image

Running the image (with jupyter)

Scripts and Arguments

RUN SCRIPT

PREDICT SCRIPT

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages