VisioPhysioENet: Multimodal Engagement Detection

Implementation for the paper submitted to ICASSP 2025
[VisioPhysioENet: A Multimodal System for Detecting Learner Engagement through Visual and rPPG Signals]
Alakhsimar Singh*, Nischay Verma*, Kanav Goyal*, Amritpal Singh, Puneet Kumar, Xiaobai Li.

This repository contains the code and resources for VisioPhysioENet, a novel multimodal system for detecting learner engagement using visual and physiological signals. This project combines features extracted from video data, such as facial landmarks and eye metrics, with physiological signals derived from remote photoplethysmography (rPPG) to detect various levels of engagement in learners.

Introduction

VisioPhysioENet uses a multimodal fusion approach to detect engagement by combining visual and physiological data. It extracts visual features using facial landmarks and head pose estimation from the Dlib and OpenCV libraries, and physiological signals like heart rate using rPPG. These data streams are processed using machine learning models to classify engagement levels into different categories.

The system was rigorously tested on the DAiSEE dataset and achieved an accuracy of 63.09%, outperforming state-of-the-art methods in learner engagement detection.

Features

Multimodal Approach: Combines both visual and physiological signals for more accurate engagement detection.
Lightweight Architecture: Designed for fast processing with minimal computational load.
Advanced Feature Extraction: Uses Dlib and OpenCV for extracting visual features such as Eye Aspect Ratio (EAR), gaze direction, and head position.
rPPG-Based Physiological Feature Extraction: Remote photoplethysmography is used to non-invasively monitor heart rate and other physiological signals.
Multimodal Fusion: Implements both early and late fusion strategies for integrating visual and physiological data.

Dataset

The model is trained and validated using the DAiSEE dataset, which includes videos of individuals annotated with engagement levels (not engaged, barely engaged, engaged, and highly engaged).

Download the Dataset

You can download the DAiSEE dataset from its official website here.

Installation

Clone the repository:

git clone https://github.com/MIntelligence-Group/VisioPhysioENet.git
cd VisioPhysioENet

Install the required Python libraries:
```
pip install -r requirements.txt
```

Usage

To use the code in this repository, follow these steps to perform feature extraction and apply the machine learning models:

1. Feature Extraction

Open the extraction_DAiSEE_.ipynb notebook.
Run all cells by selecting Kernel -> Restart & Run All.
- This will extract features from the dataset and generate .csv files containing the processed data. These .csv files will be used as input for the machine learning models in the next steps.

2. Applying Machine Learning Models

After extracting the features, the next step is to apply the machine learning models for engagement detection:

Open the fusion_early.ipynb notebook.
- This notebook implements early fusion techniques using the extracted features. Run all cells to train and evaluate the model.
Open the fusion_late.ipynb notebook.
- This notebook implements late fusion techniques. Again, run all cells to train and evaluate the model using the extracted features.

By following these steps, you'll be able to extract features and apply the machine learning models for engagement detection.

Results

The proposed VisioPhysioENet system was evaluated on the DAiSEE dataset, achieving an accuracy of 63.09%. The following are some of the key metrics:

Model	Modality	Accuracy
InceptionNet (Frame Level)	V	47.10%
ResNet + Temporal Conv Net	V	53.70%
3D CNN + TCN	V	59.97%
Proposed VisioPhysioENet	V + P	63.09%

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
README.md		README.md
ablation_.ipynb		ablation_.ipynb
ablation_fusion_.ipynb		ablation_fusion_.ipynb
architecture.jpg		architecture.jpg
extraction_DAiSEE_.ipynb		extraction_DAiSEE_.ipynb
fusion_early.ipynb		fusion_early.ipynb
fusion_late.ipynb		fusion_late.ipynb
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VisioPhysioENet: Multimodal Engagement Detection

Table of Contents

Introduction

Features

Dataset

Download the Dataset

Installation

Usage

1. Feature Extraction

2. Applying Machine Learning Models

Results

About

Releases

Packages

Contributors 3

Languages

MIntelligence-Group/VisioPhysioENet

Folders and files

Latest commit

History

Repository files navigation

VisioPhysioENet: Multimodal Engagement Detection

Table of Contents

Introduction

Features

Dataset

Download the Dataset

Installation

Usage

1. Feature Extraction

2. Applying Machine Learning Models

Results

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages