This is a fully quantized version (asymmetrical int8) of the MicroNet Medium model developed by Arm, from the MicroNets paper. This model is trained on the 'Google Speech Commands' dataset.
Apache-2.0
The class labels associated with this model can be downloaded by running the script get_class_labels.sh
.
Platform |
Optimized |
Cortex-A |
✖️ |
Cortex-M |
✔️ |
Mali GPU |
✔️ |
Ethos U |
✔️ |
- ✔️ - Will run on this platform.
- ✖️ - Will not run on this platform.
Dataset: Google Speech Commands Test Set
Metric |
Value |
Accuracy |
0.958 |
Optimization |
Value |
Quantization |
INT8 |
Input Node Name |
Shape |
Description |
input |
(1, 49, 10, 1) |
A one second audio clip, converted to a 2D MFCC computed from a speech frame of length 40ms and stride 20ms. |
Output Node Name |
Shape |
Description |
Identity |
(1, 12) |
The probability on 12 keywords. |