S2T-Speech-To-Text

An end to end full stack application to transform speech to text and perform further downstream tasks like Text Similarity, Text Summarization, Named Entity Recognition

Things to set up: Install Docker

Install ElasticSearch

    Link for Tutorial: https://dylancastillo.co/elasticsearch-python/#what%E2%80%99s-elasticsearch

Install FastAPI

    Link for Tutorial: https://fastapi.tiangolo.com/tutorial/first-steps/

Install Whisper AI

    Tutorial Link: 

        https://medium.com/the-research-nest/how-to-setup-openais-whisper-model-on-windows-10-11-df001d5a350b

    Install ffmpeg

        Download the zip file from https://github.com/BtbN/FFmpeg-Builds/releases

        Extract and put the link to bin folder into System varibales path variable
   
   Install whisper-timestamped

        https://github.com/linto-ai/whisper-timestamped

   Install PyAnnote

        https://github.com/pyannote/pyannote-audio

Name		Name	Last commit message	Last commit date
Latest commit History 48 Commits
Backend		Backend
FrontEnd		FrontEnd
audioData		audioData
.gitattributes		.gitattributes
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

S2T-Speech-To-Text

About

Releases

Packages

Contributors 4

Languages

UMass-Rescue/S2T-Speech-To-Text

Folders and files

Latest commit

History

Repository files navigation

S2T-Speech-To-Text

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages