Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

语音识别 FBank 和 MFCC 特征 | 拾荒志 #75

Open
murphypei opened this issue Nov 24, 2021 · 0 comments
Open

语音识别 FBank 和 MFCC 特征 | 拾荒志 #75

murphypei opened this issue Nov 24, 2021 · 0 comments

Comments

@murphypei
Copy link
Owner

https://murphypei.github.io/blog/2021/10/asr-fbank-mfcc.html

ASR 流程中,音频特征提取是第一步。和 CV 不同,图片本身的 RGB 数值就是一种特征,但是音频本身无法被用于分析,常常是将一段音频提取 FBank 和 MFCC 特征然后作为模型的输入。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant