Speech to text is a speech recognition software that enables the recognition and translation of spoken language into text through computational linguistics
- Sampling
- Quantization
- data collection
- data preprocessing : FFT or DFT, STFT
- feature extraction : MFCC, Decibel
- modelling : Sequence Machine Learning : HMM, Seq2seq, Batch padding
https://m.blog.naver.com/PostView.naver?isHttpsRedirect=true&blogId=sooftware&logNo=221661644808
https://haythamfayek.com/2016/04/21/speech-processing-for-machine-learning.html