Data setup

Part-1: Directly using ActivityNet-SRL-QA and Charades-SRL-QA

Download the relevant dataset files + vocabs used from drive:

https://drive.google.com/file/d/1PqHYty4D71dakC1a95p4PbAr6WXYYCYq/view?usp=sharing
For Anet features:

File 1: https://drive.google.com/file/d/14GTjt3wuifK6GhaTsRYxMVXhBc4rREk2/view?usp=sharing

File 2: For RGB-Motion feats See dwn_datasets.sh
For Charades SRL Features:

File 1: https://drive.google.com/file/d/1bJgzyVue8GhbaImjiVhfNgHk3O9d8Apc/view?usp=sharing

After unzipping all files you should have the following structure:

data
├── anet
│   ├── anet_detection_vg_fc6_feat_100rois_resized.h5
│   ├── anet_detection_vg_fc6_feat_gt5_rois.h5
│   ├── csv_dir
│   ├── fc6_feat_5rois
│   └── rgb_motion_1d
├── anet_vocab
│   ├── aphr_nosw_vocab_10k.pkl
│   ├── aphr_nosw_vocab.pkl
│   ├── aphr_vocab_10k.pkl
│   ├── aphr_vocab.pkl
│   ├── arg_vocab.pkl
│   ├── qarg_vocab.pkl
│   ├── qword_vocab_old.pkl
│   ├── qword_vocab.pkl
│   └── word_vocab.pkl
├── asrl_vidqap_files
│   ├── anet_captions_all_splits.json
│   ├── anet_ent_cls_bbox_trainval.json
│   ├── trn_srlqa_trim.json
│   └── val_srlqa_trim.json
├── charades
│   ├── s3d_charades_fps1_frm_list.csv
│   └── s3d_charades_rgb_mix5c_ftdir
├── charades_vidqap_files
│   ├── trn_srlqa_trim.json
│   └── val_srlqa_trim.json
└── charades_vocab
    ├── arg_vocab.pkl
    ├── ch_aphr_nosw_vocab_10k.pkl
    ├── ch_aphr_nosw_vocab.pkl
    ├── ch_aphr_vocab_10k.pkl
    ├── ch_aphr_vocab.pkl
    ├── qarg_vocab.pkl
    ├── qword_vocab.pkl
    └── word_vocab.pkl

Annotation Structure

The main question/answer files are located in {anet/charades}_vidqap_files/{trn/val}_srlqa_trim.json
There are a lot of keys, but only a few of them are useful. Others are kept for legacy. In particular:
1. qsrl_ind: Identifier in the particular set.
2. vt_split: whether train/valid/test
3. vid_seg: Video segment of the file
4. qa_pair: Dict with the keys: question and answer and question_type. In evaluation, question_type value is replaced with prediction and gt answers.
5. cs_qsrl_inds: List of searched contrastive samples. For validation, test sets use the first one.
6. For other keys, please refer to https://github.com/TheShadow29/vognet-pytorch/tree/master/data#annotation-file-structure

Part 3 - Create dataset from scratch, and likely for a new captioning dataset

<WIP: Needs some heavy code clean up and refactorization>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Data setup

Part-1: Directly using ActivityNet-SRL-QA and Charades-SRL-QA

Annotation Structure

Part 3 - Create dataset from scratch, and likely for a new captioning dataset

Files

README.md

Latest commit

History

README.md

File metadata and controls

Data setup

Part-1: Directly using ActivityNet-SRL-QA and Charades-SRL-QA

Annotation Structure

Part 3 - Create dataset from scratch, and likely for a new captioning dataset