PaddleVideo

Update:

add skeleton-base action recognition model CTR-GCN.
add lite action recognition model MoViNet.
add temporal segment model MS-TCN, ASRF.

💖 Welcome to scan the code and join the group discussion 💖

Scan the QR code below with your Wechat and reply "video", you can access to official technical exchange group. Look forward to your participation.

Introduction

PaddleVideo is a toolset for video tasks prepared for the industry and academia. This repository provides examples and best practice guildelines for exploring deep learning algorithm in the scene of video area.

Model and Applications

Model zoo

Please refer to Installation guide and Usage doc before using the model zoo.

Action recognition method
PP-TSM (PP series)	PP-TSN (PP series)	PP-TimeSformer (PP series)	TSN (2D’)	TSM (2D')
SlowFast (3D’)	TimeSformer (Transformer')	VideoSwin (Transformer’)	AttentionLSTM (RNN')	MoViNet (Lite‘)
Skeleton based action recognition
ST-GCN (Custom’)	AGCN (Adaptive')	CTR-GCN (GCN‘)
Sequence action detection method
BMN (One-stage')
temporal segment
MS-TCN	ASRF
Spatio-temporal motion detection method
SlowFast+Fast R-CNN
Multimodal
ActBERT (Learning')	T2VLAD (Retrieval')
Video target segmentation
CFBI (Semi')	MA-Net (Supervised')
Monocular depth estimation
ADDS (Unsupervised‘)

Dataset

Action Recognition
Kinetics-400 (Homepage) (CVPR'2017)	UCF101 (Homepage) (CRCV-IR-12-01)	ActivityNet (Homepage) (CVPR'2015)	YouTube-8M (Homepage) (CVPR'2017)
Action Localization
ActivityNet (Homepage) (CVPR'2015)
Spatio-Temporal Action Detection
AVA (Homepage) (CVPR'2018)
Skeleton-based Action Recognition
NTURGB+D (Homepage) (IEEE CS'2016)	FSD (Homepage)
Depth Estimation
Oxford-RobotCar (Homepage) (IJRR'2017)
Text-Video Retrieval
MSR-VTT (Homepage) (CVPR'2016)
Text-Video Pretrained Model
HowTo100M (Homepage) (ICCV'2019)

Applications

Applications	Descriptions
FootballAction	Football action detection solution
BasketballAction	Basketball action detection solution
TableTennis	Table tennis action recognition solution
FigureSkating	Figure skating action recognition solution
VideoTag	3000-category large-scale video classification solution
MultimodalVideoTag	Multimodal video classification solution
VideoQualityAssessment	Video quality assessment solution
PP-Care	3DMRI medical image recognition solution
EIVideo	Interactive video segmentation tool
Anti-UAV	UAV detection solution
AbnormalActionDetection	Abnormal action detection solution
PP-Human	Action recognition solution for pedestrian analysis scene

Documentation tutorial

AI-Studio Tutorial
Contribute code

Competition

Figure skating action recoginition using skeleton based on PaddlePaddle, AI Studio projects, video course
Table tennis action proposal localization based on PaddlePaddle
CCKS 2021: Knowledge Augmented Video Semantic Understanding

License

PaddleVideo is released under the Apache 2.0 license.

Thanks

Many thanks to mohui37、zephyr-fun、voipchina for contributing the code for prediction.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README_en.md

README_en.md

PaddleVideo

Update:

Introduction

Model and Applications

Model zoo

Dataset

Applications

Documentation tutorial

Competition

License

Thanks

Files

README_en.md

Latest commit

History

README_en.md

File metadata and controls

PaddleVideo

Update:

Introduction

Model and Applications

Model zoo

Dataset

Applications

Documentation tutorial

Competition

License

Thanks