This README describes the Noise Suppression demo application.
On startup the demo application reads command line parameters and loads a network to Inference engine. It also read user-provided sound file with mix of speech and some noise to feed it into the network by small sequential patches. The output of network is also sequence of audio patches with clean speech. The patches collected together and save into output audio file.
The list of models supported by the demo is in <omz_dir>/demos/noise_suppression_demo/python/models.lst
file.
This file can be used as a parameter for Model Downloader and Converter to download and, if necessary, convert models to OpenVINO Inference Engine format (*.xml + *.bin).
An example of using the Model Downloader:
omz_downloader --list models.lst
An example of using the Model Converter:
omz_converter --list models.lst
- noise-suppression-denseunet-ll-0001
- noise-suppression-poconetlike-0001
NOTE: Refer to the tables Intel's Pre-Trained Models Device Support and Public Pre-Trained Models Device Support for the details on models inference support at different devices.
Running the application with an empty list of options yields an error message.
For example, to do inference on a CPU, run the following command:
./noise_suppression_demo \
-m <path_to_model>/noise-suppression-poconetlike-0001.xml \
-d CPU \
-i noisy.wav \
-o cleaned.wav
The application reads audio wave from the INPUT WAV file. The INPUT file has to have 16kHZ discretization frequency and be mono. The MODEL is also required arguments.
The application outputs cleaned wave to OUTPUT WAV file. The demo reports
- Latency: total processing time required to process input data (from reading the data to displaying the results).