Skip to content

A-big-fish-in-a-small-pond/STT

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Machine Learning - Speech to Text (Speech recognition) in nodejs


Related pages

https://towardsdatascience.com/audio-deep-learning-made-simple-automatic-speech-recognition-asr-how-it-works-716cfce4c706


Description

Speech to text is a speech recognition software that enables the recognition and translation of spoken language into text through computational linguistics


development process


  1. Sampling

  2. Quantization

  3. data collection

  4. data preprocessing : FFT or DFT, STFT

  5. feature extraction : MFCC, Decibel

  6. modelling : Sequence Machine Learning : HMM, Seq2seq, Batch padding





Refernce

https://everyday-tech.tistory.com/entry/2%ED%83%84-%EB%94%A5%EB%9F%AC%EB%8B%9D-%EC%9D%8C%EC%84%B1-%EC%9D%B8%EC%8B%9D

https://www.nti-audio.com/ko/%EC%A7%80-%EC%9B%90/know-how/%EB%B9%A0%EB%A5%B8-fourier-%EB%B3%80%ED%99%98-fft

https://m.blog.naver.com/PostView.naver?isHttpsRedirect=true&blogId=sooftware&logNo=221661644808

https://haythamfayek.com/2016/04/21/speech-processing-for-machine-learning.html

About

Speech to Text in fft algorithm from nodejs

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published