Audio Segmentation Using K-means Clustering

General Info

My project to separate audio with the noise using k-means clustering based on the idea from the paper L. Marchegiani and I. Posner, "Leveraging the urban soundscape: Auditory perception for smart vehicles," 2017 IEEE International Conference on Robotics and Automation (ICRA), 2017, pp. 6547-6554, doi: 10.1109/ICRA.2017.7989774 [1].

Package Requirement

numpy
cv2
IPython
matplotlib
skimage
sklearn
pandas
argparse
librosa
scipy

Description

The system consists of two main scripts:

spec.py
- This script create a grayscale mel-spectrogram images with bandpassfilter from an audio using the datalist of a csv that you already created.
k-means.py
- This script cluster the grayscale mel-spectrogram images into several cluster then make a binary mask based on the threshold that you have decided.

Run

Run spec.py first to create grayscale mel-spectrogram images, then run k-means.py to create the binary mask.
Individual implementation using Jupyter Notebook is also provided on note_masking.ipynb and note_filter.ipynb.

Check results

You can check the spectrogram and k-means result in the image files in the directory Ground_truth/mel_spec and Ground_truth/mask respectively.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
Jupyter Notebook		Jupyter Notebook
Result		Result
LICENSE		LICENSE
README.md		README.md
k-means.py		k-means.py
spec.py		spec.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Audio Segmentation Using K-means Clustering

Table of contents

General Info

Package Requirement

Description

Run

Check results

About

Releases

Packages

Languages

License

mraditya01/Audio_Segmentation_Using_K-means

Folders and files

Latest commit

History

Repository files navigation

Audio Segmentation Using K-means Clustering

Table of contents

General Info

Package Requirement

Description

Run

Check results

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages