Track R-CNN

End-to-end multitask CNN architecture for object instance segmentation, human pose detection and multi-person tracking based on Facebook AI's Detectron system. This framework was developed as part of Orestis Zambounis' Master Thesis Multitask CNN Architecture for Online 3D Human Pose Estimation and Multi-person Tracking.

Links: Presentation, Report

In the following we provide important command line snippets for training, validation and inference as a starting point to familiarize with our extension to the original Detectron framework.

Please refer to Detectron's original README for setup instructions and more.

Example multi-task output.

Getting Started

Dataset

The MOT17 Benchmark dataset (or a symlink to it) has to be placed under detectron/datasets/data

Next, we need to convert the MOT ground-truth annotations to a COCO compatible format:

python tools/convert_mot_to_coco.py --dataset-dir path/to/MOT17/dataset

If we want to use the proposals provided by the benchmark we need to convert those to a COCO compatible format using:

python tools/convert_mot_detetections_to_proposals.py --dataset-dir path/to/MOT17/dataset

Matlab Engine

In order to run evaluations in python with the provided MOT devkit we need to install the matlab engine for python. Follow the official instructions at www.mathworks.com/help/matlab/matlab_external/install-the-matlab-engine-for-python.html.

c2board

We can use tensorboard to visualize our training progress. For this we need to install c2board. Follow the official instructions at github.com/endernewton/c2board.

Training

Config

Set both MODEL.FASTER_RCNN and MODEL.RPN_ONLY to True if training exclusively for tracking. This will disable class and bounding box regression training. We can further disable RPN and Faster R-CNN losses by setting RPN.LOSS_ON and FAST_RCNN.LOSS_ON to False.

Run training

Finally, execute the training with

python tools/train_net.py --cfg path/to/cfg.yaml

Inspection

Inspect the training progress with tensorboard:

tensorboard --logdir outputs

Example tensorboard visualization.

We can further download the tensorboard visualizations as a csv file and plot those with matplotlib:

python visualize_tensorboard.py --file path/to/file.csv

Validation

Full sequence validation using MOT metrics

Run validation on all the testing sequences specified in TEST.DATASETS to calculate MOT metrics on all saved models from a specific configuration. The following command evaluates the sequences specified in TEST.DATASETS by interfacing to the MOT devkit in matlab. When using custom proposals we have to set MODEL.FASTER_RCNN to False in the configuration file.

python tools/test_tracking.py --cfg path/to/cfg.yaml proposals eval

MOT metrics visualization

The previous command will write validation results to the evaluation directory (outputs/test/...) which can be visualized using:

python --pdb tools/visualize_mot_val.py --eval-dir path/to/evaluation/directory

Example metric visualization: MOTA over training interation.

Inference

Config

Set MODEL.FASTER_RCNN to False if using custom proposals. Set MODEL.RPN_ONLY to False.

Custom image sequence

Simple image sequence inference (visualized results in outputs/infer_track_sequence):

python tools/infer_track_sequence.py --wts path/to/weights.pkl --cfg path/to/cfg.yaml --im-dir path/to/image/sequence show-track

Isolated image pair object associations (visualized results in outputs/infer_track_pairs):

python tools/infer_track_pairs.py --wts path/to/weights.pkl --cfg path/to/cfg.yaml --im-dir path/to/image/sequence show-track

Multi-task

Merging weights from mulitple files for multi-task inference with a sibling backbone:

python tools/infer_track_sequence.py --wts path/to/weights/tracking.pkl path/to/weights/kps.pkl --cfg path/to/multitask-cfg.yaml --preffixes "" sib --im-dir path/to/image/sequence show-track

Example of a sibling backbone for human pose keypoints estimation. Additions to the original Mask R-CNN architecture are highlighted in blue. Dotted box / arrow indicates features from the previous frame.

Submission

As we are using custom proposals we need to set MODEL.FASTER_RCNN to False in the configuration file. The following command infers submission results for all the sequences specified in TEST.DATASETS and stores the detection files under outputs/MOT17/:

python tools/test_tracking.py --cfg path/to/cfg.yaml --model model_iterX.pkl proposals

3D Keypoints

The following examples use the Princeton Tracking Benchmark for 3D human pose inference. (tracking.cs.princeton.edu/dataset.html). We define the path to the desired sequence as PRINCETON_SEQ.

Uncomment line 238 in detectron/utils/tracking.py for custom image sequence sorting for the Princeton Tracking Benchmark. Run inference on the image sequence:

python tools/infer_track_sequence.py --wts path/to/weights/tracking.pkl path/to/weights/kps.pkl --cfg path/to/multitask-cfg.yaml --preffixes "" sib --im-dir "${PRINCETON_SEQ}/rgb" --n-colors 2 --output-dir ${PRINCETON_SEQ}/dets --output-file "${PRINCETON_SEQ}/detections.pkl" all-dets show-track

Map keypoints to the depth and transform to world coordinates (saves results to kps_3d.npy):

python tools/3D_inference/vis_rgbd.py --datadir "${PRINCETON_SEQ}" --dataset princeton --mode 1 --k-size 1 auto-play record-kps no-plot

Filter keypoints with a median or gaussian filter:

python2 tools/3D_inference/filter_kps.py --kps-3d "${PRINCETON_SEQ}/kps_3d.npy" --output-dir "${PRINCETON_SEQ}" --filter-var 5

Visualize using the filtered keypoints:

python tools/3D_inference/vis_rgbd.py --datadir "${PRINCETON_SEQ}" --dataset princeton --mode 1 --kps-3d "${PRINCETON_SEQ}/kps_3d.npy" auto-play

Example 3D world-frame mapping of the human pose keypoints.

Other

Network graph visualization

Simple:

python tools/visualize_net.py --cfg path/to/cfg.yaml

Include blob shapes:

python tools/visualize_net.py --cfg path/to/cfg.yaml --model path/to/weights.pkl shapes

Excerpt from an example of a network graph visualization of the architecture in training mode.

Pre-compute blobs

Save selected blobs to storage to possibly speed up training time. Please refer to the following issue fist before considering using this script: github.com/facebookresearch/Detectron/issues/808.

Adapted from scripts/save_tracking_blobs.sh:

for seq in "02" "04" "05" "09" "10" "11" "13"; do
    python tools/save_blobs.py --wts path/to/weights.pkl --cfg path/to/cfg.yaml --blobs [blob-list] --output-dir path/to/output/${seq}/ --dataset mot17_train_frcnn_${seq}
done;

Unit Tests

Test tracking specific operations, losses and outputs with:

python detectron/tests/${test}.py

Name		Name	Last commit message	Last commit date
Latest commit History 174 Commits
.github		.github
cmake		cmake
configs		configs
demo		demo
detectron		detectron
docker		docker
projects/GN		projects/GN
scripts		scripts
tools		tools
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
Makefile		Makefile
NOTICE		NOTICE
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Track R-CNN

Getting Started

Dataset

Matlab Engine

c2board

Training

Config

Run training

Inspection

Validation

Full sequence validation using MOT metrics

MOT metrics visualization

Inference

Config

Custom image sequence

Multi-task

Submission

3D Keypoints

Other

Network graph visualization

Pre-compute blobs

Unit Tests

About

Languages

License

orestis-z/track-rcnn

Folders and files

Latest commit

History

Repository files navigation

Track R-CNN

Getting Started

Dataset

Matlab Engine

c2board

Training

Config

Run training

Inspection

Validation

Full sequence validation using MOT metrics

MOT metrics visualization

Inference

Config

Custom image sequence

Multi-task

Submission

3D Keypoints

Other

Network graph visualization

Pre-compute blobs

Unit Tests

About

Topics

Resources

License

Stars

Watchers

Forks

Languages