GitHub - smart-primate/Gesture_Recognition_using_3D_CNNs__LSTM_GRU: Gesture recognition to control the television without using a remote.

GESTURE RECOGNITION TO CONTROL TV - PROBLEM STATEMENT

The objective of this project is to develop a cool feature in the smart TV that can recognise five different gestures performed by the user which will help users control the TV without using a remote.

The gestures are continuously monitored by the webcam mounted on the TV. Each gesture corresponds to a specific command:

Thumbs up: Increase the volume
Thumbs down: Decrease the volume
Left swipe: 'Jump' backwards 10 seconds
Right swipe: 'Jump' forward 10 seconds
Stop: Pause the movie

The training data consists of a few hundred videos categorised into one of the five classes. Each video (typically 2-3 seconds long) is divided into a sequence of 30 frames(images). These videos have been recorded by various people performing one of the five gestures in front of a webcam - similar to what the smart TV will use.

All images in a particular video subfolder have the same dimensions but different videos may have different dimensions. Specifically, videos have two types of dimensions - either 360x360 or 120x160 (depending on the webcam used to record the videos). Hence, some pre-processing is required to standardise the videos.

Each row of the CSV file represents one video and contains three main pieces of information - the name of the subfolder containing the 30 images of the video, the name of the gesture and the numeric label (between 0-4) of the video.

In this project, I have tried to build the model using both 3D CNNs as well as RNN-based structures like LSTM and GRU. A comparison has been performed between the two arhitectures over several models to obtain the most reliable predictions for the gestures.

The training and validation data can be found @ https://drive.google.com/drive/folders/1QJ5RHBr22ovdHNDENtDpWg59OxsjzrOY?usp=sharing

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
Second_best_model_Model5_h5_file		Second_best_model_Model5_h5_file
Gesture_Recognition_Project.ipynb		Gesture_Recognition_Project.ipynb
Gesture_Recognition_Write-up.pdf		Gesture_Recognition_Write-up.pdf
LICENSE		LICENSE
Model14_model-00019-0.04712-0.98643-0.12355-0.96000.h5		Model14_model-00019-0.04712-0.98643-0.12355-0.96000.h5
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Languages

License

smart-primate/Gesture_Recognition_using_3D_CNNs__LSTM_GRU

Folders and files

Latest commit

History

Repository files navigation

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages