New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

语音识别 FBank 和 MFCC 特征 | 拾荒志 #75

Open

murphypei opened this issue Nov 24, 2021 · 0 comments

Labels

085dfc5b7cd7504303ca1324d4b638be Gitalk

Owner

murphypei commented Nov 24, 2021

https://murphypei.github.io/blog/2021/10/asr-fbank-mfcc.html

ASR 流程中，音频特征提取是第一步。和 CV 不同，图片本身的 RGB 数值就是一种特征，但是音频本身无法被用于分析，常常是将一段音频提取 FBank 和 MFCC 特征然后作为模型的输入。

murphypei added Gitalk 085dfc5b7cd7504303ca1324d4b638be labels

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment