Comparison of common Machine Learning Algorithms considering a sport activities Classification Task

Group project for Machine Learning class of Galli Davide and Papousek Jiri.

Introduction to the Project

Even if Deep Learning is continuosly growing in terms of importance, classical Machine Learning technquis still represent a cornerstone for people who approach computer science for the first time. There are loads of different techniques, some of them are specifically devoted to tackle given types of problems; some others are instead multi-porpose.

One popular task is the so called classification problem, where the model’s output is a category with a semantic meaning. A classification model attempts to draw some conclusion from observed values. Different methods can be implemented to tackle this problem.

Our focus is a brief comparative study over four different machine learning supervised techniques:

Logistic Regression
K Nearest Neighbors
Decision Trees
Multilayer Perceptron

Brief Dataset Description

The choosen dataset comprises motion sensor data of 19 daily and sports activities performed by 8 subjects (between 20 and 30 years old) in their own style for 5 minutes. Five Xsens MTx units are used on the torso, arms, and legs. This kind of sensor embeds a gyroscope, an accelerometer and a magnetometer.

Since activities are perfomed as the subject desires, there might be inter-subject variations in the speeds and amplitudes of some activities.

The activities are performed at the Bilkent University Sports Hall, in the Electrical and Electronics Engineering Building, and in a flat outdoor area on campus. Sensor units are calibrated to acquire data at 25 Hz sampling frequency. The 5-min signals are divided into 5-sec segments so that 480(=60x8) signal segments are obtained for each activity.

The 19 activities are:

A1 → sitting

A2 → standing

A3 → lying on back

A4 → laying on right side

A5 → ascending stairs

A6 → descending stairs

A7 → standing in an elevator still

A8 → moving around in an elevator

A9 → walking in a parking lot

A10 → walking on a treadmill with a speed of 4 km/h on flat

A11 → walking on a treadmill with a speed of 4 km/h on a 15 deg inclined ositions

A12 → running on a treadmill with a speed of 8 km/h

A13 → exercising on a stepper

A14 → exercising on a cross trainer

A15 → cycling on an exercise bike in horizontal position

A16 → cycling on an exercise bike in vertical positions

A17 → rowing

A18 → jumping

A19 → playing basketball

File Structure

19 activities (a) (in the order given above) 8 subjects (p) 60 segments (s) 5 units:

on torso (T)
right arm (RA)
left arm (LA)
right leg (RL)
left leg (LL)

There are 9 sensors on each unit (x,y,z accelerometers, x,y,z gyroscopes, x,y,z magnetometers).

Folders a01, a02, ..., a19 contain data recorded from the 19 activities. For each activity, the subfolders p1, p2, ..., p8 contain data from each of the 8 subjects. In each subfolder, there are 60 text files s01, s02, ..., s60, one for each segment. In each text file, there are 5 units x 9 sensors = 45 columns and 5 sec x 25 Hz = 125 rows. Each column contains the 125 samples of data acquired from one of the sensors of one of the units over a period of 5 sec. Each row contains data acquired from all of the 45 sensor axes at a particular sampling instant separated by commas.

Columns 1-45 correspond to: T_xacc, T_yacc, T_zacc, T_xgyro, ..., T_ymag, T_zmag, RA_xacc, RA_yacc, RA_zacc, RA_xgyro, ..., RA_ymag, RA_zmag, LA_xacc, LA_yacc, LA_zacc, LA_xgyro, ..., LA_ymag, LA_zmag, RL_xacc, RL_yacc, RL_zacc, RL_xgyro, ..., RL_ymag, RL_zmag, LL_xacc, LL_yacc, LL_zacc, LL_xgyro, ..., LL_ymag, LL_zmag.

Therefore, columns 1-9 correspond to the sensors in unit 1 (T), columns 10-18 correspond to the sensors in unit 2 (RA), columns 19-27 correspond to the sensors in unit 3 (LA), columns 28-36 correspond to the sensors in unit 4 (RL), columns 37-45 correspond to the sensors in unit 5 (LL).

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
models		models
sports_dataset		sports_dataset
Galli_Papoušek-ML-Project Abstract.pdf		Galli_Papoušek-ML-Project Abstract.pdf
README.md		README.md
create_dataset.py		create_dataset.py
requirements.txt		requirements.txt
sport_activity_classification.ipynb		sport_activity_classification.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Comparison of common Machine Learning Algorithms considering a sport activities Classification Task

Introduction to the Project

Brief Dataset Description

File Structure

References

About

Releases

Packages

Contributors 2

Languages

d-galli/SportActivitiesClassification

Folders and files

Latest commit

History

Repository files navigation

Comparison of common Machine Learning Algorithms considering a sport activities Classification Task

Introduction to the Project

Brief Dataset Description

File Structure

References

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages