Skip to content

Latest commit

 

History

History
241 lines (210 loc) · 10.3 KB

README_en.md

File metadata and controls

241 lines (210 loc) · 10.3 KB

简体中文 | English

PaddleVideo

Update:

  • add skeleton-base action recognition model CTR-GCN.
  • add lite action recognition model MoViNet.
  • add temporal segment model MS-TCN, ASRF.

​ 💖 Welcome to scan the code and join the group discussion 💖

  • Scan the QR code below with your Wechat and reply "video", you can access to official technical exchange group. Look forward to your participation.

Introduction

python version paddle version

PaddleVideo is a toolset for video tasks prepared for the industry and academia. This repository provides examples and best practice guildelines for exploring deep learning algorithm in the scene of video area.


Model and Applications

Model zoo

Action recognition method
PP-TSM (PP series) PP-TSN (PP series) PP-TimeSformer (PP series) TSN (2D’) TSM (2D')
SlowFast (3D’) TimeSformer (Transformer') VideoSwin (Transformer’) AttentionLSTM (RNN') MoViNet (Lite‘)
Skeleton based action recognition
ST-GCN (Custom’) AGCN (Adaptive') CTR-GCN (GCN‘)
Sequence action detection method
BMN (One-stage')
temporal segment
MS-TCN ASRF
Spatio-temporal motion detection method
SlowFast+Fast R-CNN
Multimodal
ActBERT (Learning') T2VLAD (Retrieval')
Video target segmentation
CFBI (Semi') MA-Net (Supervised')
Monocular depth estimation
ADDS (Unsupervised‘)

Dataset

Action Recognition
Kinetics-400 (Homepage) (CVPR'2017) UCF101 (Homepage) (CRCV-IR-12-01) ActivityNet (Homepage) (CVPR'2015) YouTube-8M (Homepage) (CVPR'2017)
Action Localization
ActivityNet (Homepage) (CVPR'2015)
Spatio-Temporal Action Detection
AVA (Homepage) (CVPR'2018)
Skeleton-based Action Recognition
NTURGB+D (Homepage) (IEEE CS'2016) FSD (Homepage)
Depth Estimation
Oxford-RobotCar (Homepage) (IJRR'2017)
Text-Video Retrieval
MSR-VTT (Homepage) (CVPR'2016)
Text-Video Pretrained Model
HowTo100M (Homepage) (ICCV'2019)

Applications

Applications Descriptions
FootballAction Football action detection solution
BasketballAction Basketball action detection solution
TableTennis Table tennis action recognition solution
FigureSkating Figure skating action recognition solution
VideoTag 3000-category large-scale video classification solution
MultimodalVideoTag Multimodal video classification solution
VideoQualityAssessment Video quality assessment solution
PP-Care 3DMRI medical image recognition solution
EIVideo Interactive video segmentation tool
Anti-UAV UAV detection solution
AbnormalActionDetection Abnormal action detection solution
PP-Human Action recognition solution for pedestrian analysis scene

Documentation tutorial

Competition

License

PaddleVideo is released under the Apache 2.0 license.

Thanks