Data Collection
Pre requisite
1. To get original training data: youtube-dl
2. To create labelled training data - Audacity
Configure audacity to point to 64 bit ffmpeg
Mac: https://manual.audacityteam.org/man/installing_ffmpeg_for_mac.html
Windows: https://manual.audacityteam.org/man/installing_ffmpeg_for_windows.html
- Download youtube videos as audio format - m4a https://medium.com/@jsaluja/download-audio-m4a-with-youtube-dl-957791ff6f7a
- Open downloaded file in Audacity, zoom in on waveform
- Play, select clip, export as .wav
- Upload to KhalisMoolMantarClips https://drive.google.com/drive/folders/1wuorveZ00eHpns1vcMk7_CESM2TX5xBo
Training
Pre requisite
1. To convert media files to wav - ffmpeg
- Download KhalisMoolMantarClips from https://drive.google.com/drive/folders/1wuorveZ00eHpns1vcMk7_CESM2TX5xBo
- mkdir dataset
- unzip KhalisMoolMantarClips.zip dataset
- python prepare_dataset
- python train
Inference
- mkdir test
- Create samples in test folder
- python audio_classification_service