Instance-aware fine-grained micro-action recognition

This is the official solution of 3rd Place for The MAC 2024 Grand Challenge Track 1

1. Data preparation

Downlaod Track1 dataset in data or other floder as following:

-data
  |-annotations
  |-train
  |-val
  |-test

You need to prepare the virtual environment as follows:

conda create --name mmaction
conda activate mmaction
pip install -r requirements.txt

1.1 Balance data

Those videos less than 100 are copied several times to mitigate the severe data imbalance, which is a commonly used trick.

python data/data_aug.py

The distribution of train set are visualized before/after data balance.

1.2 Instance Detection

Pretrained people detector is employed to locate the interviewed person. Specifically, the bounding box of person instance is detected by YOLOv8m. All the bounding boxes are saved in pickle format.

python data/predict_video.py

The bounding box are visualized as follows:

2. Training

Before training your model following our tutorial, please make sure that the path of instance is right in line 69 of mmaction/datasets/video_dataset.py.
Make sure the path of dataset in config file.

bash tools/dist_train.sh configs/recognition/videomaev2/vit-small-p16_videomaev2-vit-g-dist-k710-pre_16x4x1_ma52.py

3. Testing Inference

With corresponding configuration, you can inference model forward and save the results in pickle format.

bash tools/dist_test.sh configs/recognition/videomaev2/vit-small-p16_videomaev2-vit-g-dist-k710-pre_16x4x1_ma52.py work_dirs/vit-small-p16_videomaev2-vit-g-dist-k710-pre_16x4x1_ma52/best_acc_f1_mean_7053.pth 4 --dump work_dirs/submit/videomaev2_f1mean7053.pickle

Notice that we provide several models as well as corresponding pickle files for final submission in the work_dirs folder.

4. Submission

If you would like to evaluate a single model, you can run generate.py to obtain submission.py

python generate.py

More important, model performance could be improved further by weighting the different predictions via Model ensembling, a simple yet useful trick.

python ensemble.py

5. Tips

Micro-action recognition is a fine-grained classification task. Both of coarse-grained and fine-grained metrics are taken into consideration individually. Therefore, we merge predictions with higher body and action metrics respectively and manually.

For example, we copy the coarse-grained column in work_dirs/submit/submission7017/prediction.csv and the fine-grained column in work_dirs/submit/submission7218/prediction.csv for our final submission.

Name		Name	Last commit message	Last commit date
Latest commit History 2,081 Commits
.circleci		.circleci
.github		.github
configs		configs
data		data
demo		demo
docker		docker
docs		docs
figs		figs
mmaction		mmaction
projects		projects
requirements		requirements
resources		resources
tests		tests
tools		tools
.gitignore		.gitignore
.owners.yml		.owners.yml
.pre-commit-config.yaml		.pre-commit-config.yaml
.pylintrc		.pylintrc
.readthedocs.yml		.readthedocs.yml
CITATION.cff		CITATION.cff
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
README_zh-CN.md		README_zh-CN.md
dataset-index.yml		dataset-index.yml
model-index.yml		model-index.yml
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Instance-aware fine-grained micro-action recognition

1. Data preparation

1.1 Balance data

1.2 Instance Detection

2. Training

3. Testing Inference

4. Submission

5. Tips

About

Releases

Packages

Languages

License

ilovepose/instance-aware-fine-grained-micro-action-recognition

Folders and files

Latest commit

History

Repository files navigation

Instance-aware fine-grained micro-action recognition

1. Data preparation

1.1 Balance data

1.2 Instance Detection

2. Training

3. Testing Inference

4. Submission

5. Tips

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages