Skip to content

Clean up your spoken audio recordings, utilizes VAD (Voice Activity Detection)

Notifications You must be signed in to change notification settings

nickopicz/BandPass_API

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

51 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

SpecialSound

An audio refinery program that uses speech-to-text and cleans out background noise.

1. Install Dependencies.

First make sure python3 is installed on your local system. Make sure you have pip installed as a cli.
After creating a dedicated python workspace...

Run these following commands:

pip install speech_recognition
pip install soundfile as sf
pip install wave
pip install webrtcvad
pip install contextlib


cd flask
py Main.py

2. Make a Recording.

You can create your own recording within this directory by running the file "make_recording.py" This makes it easy to test, since it generates an uncorrupted wav file.

3. Clean up Audio.

You can either make a short recording within the directory, or import an audio file from your system. You can clean your custom audio by changing the "path1" variable of "Refine.py", to the path of your specific audio file. This algorithm utilizes webrtcvad, which is a tool that google developed. It can be used for a wide variety of things, but for this application it just detects when audio is speech. The algorithm used in this repository to clean your audio returns your original file in "new_file.wav".

Next, you can run the program by executing the following commands in your repository.


cd flask
py Main.py

About

Clean up your spoken audio recordings, utilizes VAD (Voice Activity Detection)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages