Official implementation for Multi-Modal Interaction Graph Convolutional Network for Temporal Language Localization in Videos
We use the PyTorch
framework.
- Python version: 3.7.0
- PyTorch version: 1.4.0
Clone the repository:
git clone https://github.com/zmzhang2000/MIGCN.git
cd MIGCN
- Download the preprocessed annotations and features of Charades-STA with I3D features.
- Save them in
data/charades
.
- Download the preprocessed annotations of ActivityNet.
- Download the C3D features of ActivityNet.
- Process the C3D feature according to
process_activitynet_c3d()
indata/preprocess/preprocess.py
. - Save them in
data/activitynet
.
- Download the checkpoints of Charades-STA and ActivityNet.
- Save them in
checkpoints
We provide the generation procedure of all MIGCN data.
- The raw data is listed in
data/raw_data/download.sh
. - The preprocess code is in
data/preprocess
.
Train MIGCN on Charades-STA with I3D feature:
python main.py --dataset charades --feature i3d
Train MIGCN on ActivityNet with C3D feature:
python main.py --dataset activitynet --feature c3d
Test MIGCN on Charades-STA with I3D feature:
python main.py --dataset charades --feature i3d --test --model_load_path checkpoints/$MODEL_CHECKPOINT
Test MIGCN on ActivityNet with C3D feature:
python main.py --dataset activitynet --feature c3d --test --model_load_path checkpoints/$MODEL_CHECKPOINT
List other hyper-parameters by:
python main.py -h
Please cite the following paper if MIGCN is helpful for your research
@ARTICLE{9547801,
author={Zhang, Zongmeng and Han, Xianjing and Song, Xuemeng and Yan, Yan and Nie, Liqiang},
journal={IEEE Transactions on Image Processing},
title={Multi-Modal Interaction Graph Convolutional Network for Temporal Language Localization in Videos},
year={2021},
volume={30},
number={},
pages={8265-8277},
doi={10.1109/TIP.2021.3113791}}