Finding the best approach to classify music genre in GTZAN database

Author: Tianhao Liu (20205784)

Email: [email protected] | [email protected]

In this project, I tried two approachs to classify the genres of songs. One is to build deeplearning model from scratch, using CNN, MLP, LSTM, and GRU. Beside this, I also tried to fine tune exsiting pre-trained models (vgg19, resNext, and SqueezeNet)

Train From Scratch

Quick Start

Download the GTZAN dataset to this project folder
Unzip the downloaded database, the name of the downloaded database should be Data
If the database is in somewhere else or with other names, please write the path to config file hparams.yaml. You need to modify the audio_dir and image_dir in it.
main.ipynb is all you need.
After training, a trained model will be saved in folder checkpoints, and its loss record will be saved in folder logs.

Model	Status
MLP	✅
CNN	✅
LSTM	✅
GRU	✅

Fine Tune

install all the packages declared in requirements.txt
I have fine tuned 3 pre-trained models, that are: resnext, vgg19, and squeezenet. You can find them in finetune-resnext.ipynb, finetune-vgg.ipynb, and finetune-squeezenet.ipynb respectively.

Model	Status
VGG	✅
ResNext	✅
SqueezeNet	✅

Techniques in Training

Feature	Status
Early Stop	✅
Batch training	✅
Checkpoint	✅
Log (loss)	✅
Train-test-split	✅
Evaluation	✅

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Finding the best approach to classify music genre in GTZAN database

Train From Scratch

Quick Start

Fine Tune

Techniques in Training

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
checkpoints		checkpoints
logs		logs
models		models
tex		tex
utils		utils
README.md		README.md
clear.sh		clear.sh
finetune-resnext.ipynb		finetune-resnext.ipynb
finetune-squeezenet.ipynb		finetune-squeezenet.ipynb
finetune-vgg.ipynb		finetune-vgg.ipynb
hparams.yaml		hparams.yaml
main.ipynb		main.ipynb
requirements.txt		requirements.txt
train.py		train.py

THLiu55/GTZAN

Folders and files

Latest commit

History

Repository files navigation

Finding the best approach to classify music genre in GTZAN database

Train From Scratch

Quick Start

Fine Tune

Techniques in Training

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages