You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I'm trying to use your segmentation-3.0.onnx for syllable segmentaion(mandarin pinyin),
for sherpa-onnx-kws-zipformer-wenetspeech-3.3M-2024-01-01_test_wavs_4.wav,
it can correctly segment the first 7 syllables, but the last 5 syllables are not so accurate,
could you help me to improve it?
I guess that since the segmentation-3.0.onnx can segment syllables(mandarin pinyin), maybe a very small model (even a simple SVM, support vector machine) can recognize all the 1300 mono-syllable pinyins after segmentation-3.0.onnx preprocessing. While the segmentation-3.0.onnx is only 5.8MB, amazing small!
The text was updated successfully, but these errors were encountered:
diyism
changed the title
[Need Help] segment syllables (mandarin pinyin) for syllable-level voice recognition
[Need Help] segment syllables (mandarin pinyin) for syllable-level voice recognition or syllable-level VAD
Nov 3, 2024
I'm trying to use your segmentation-3.0.onnx for syllable segmentaion(mandarin pinyin),
for sherpa-onnx-kws-zipformer-wenetspeech-3.3M-2024-01-01_test_wavs_4.wav,
it can correctly segment the first 7 syllables, but the last 5 syllables are not so accurate,
could you help me to improve it?
https://github.com/diyism/pyannote_segment_syllables
ref: k2-fsa/sherpa-onnx#920
I guess that since the segmentation-3.0.onnx can segment syllables(mandarin pinyin), maybe a very small model (even a simple SVM, support vector machine) can recognize all the 1300 mono-syllable pinyins after segmentation-3.0.onnx preprocessing. While the segmentation-3.0.onnx is only 5.8MB, amazing small!
The text was updated successfully, but these errors were encountered: