1st place solution https://www.automl.ai/competitions/2
git clone https://github.com/u1234x1234/AutoSpeech2020.git
class Model
from the file model.py
satisfies the interface, so you could just run the following line in order to reproduce the results:
python run_local_test.py -dataset_dir=path_to_dataset -code_dir=path_to_model_file
Please refer to the official detailed description of the evaluation protocol.
The basic ideas:
- Get a decent result as fast as possible with the simplest models, then train more elaborate ones.
- There's no single model that performs the best on the all datasets, so try different ones.
Used models:
- Logistic Regression on features extracted with pretrained model trained on the speaker recognition task
- Logistic Regression on features extracted with pretrained model trained on the music genre classification task
- Logistic Regression on the combination of features from 1. 2.
- AutoSpeech 2019 1st place solution by Hazza Cheng
- AutoSpeech 2019 3rd place Solution by Kon
- Fine-tuning of pretrained network from 1.
Then average the results from multiple models with geometric mean.
- AutoSpeech 2019 1st place solution by Hazza Cheng, GPL Licence, Code modifications
- AutoSpeech 2019 3rd place Solution by Kon, MIT Licence
- Speaker recognition pretrained model by ClovaAI, MIT Licence
- Musicnn by Jordi Pons, ISC Licence
Each subdirectory from 3rdparty
contains subcomponents with separate copyright notices and license terms.
Please refer to Licence provided in specific subdirectory.