-
-
Notifications
You must be signed in to change notification settings - Fork 139
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Feature Request] Real-Time Speech-to-Text with Whisper Model 🎙️ #14
Comments
Is there any progress on this? |
I don't have enough time to develop this. That's why this feature is not currently being developed. I will add it later. |
I'm trying to use tiny models for on-device (mobile) near-real-time speech-to-text. |
You can use distiller-whisper models. |
Thanks for the reply, from what I have read, I understood that the idea should be
Is there any better approach. It seems very inefficient to run inference on 30s chunks for a real-time streaming transcription. Am I missing something? |
I don't understand If you want to use the whisper model in real-time, you can look at this library. |
Hi, |
Can you add Real-Time feature? |
If we are doing on a streaming service then it take buffer time of 6 sec. |
I will research this issue. |
Hi @Nishant-Kumar-2002 , can you review this code? This feature adds subtitles to the video. |
Ok will check that. |
Code looks good to me. |
May I ask if the main idea is to implement real-time whisper to transcribe speech through the microphone or transcribe audio files in real-time to a file, so that we do not have to wait until the end of the audio? |
I want to do the first thing you said. |
Any update on this? Would love real-time transcription of speech through a mic |
I would like to add this new feature. |
@Nishant-Kumar-2002 Wonderful news 👍🏻 I'm waiting for the pull request. |
I started coding. I will add this support over the weekend. |
Awesome looking forward to this! Thanks for your amazing work! |
thanks for the awesome work @kadirnar! any eta on this? |
There are a few problems with real-time. It may take a while to figure it out. I'm developing for Autopipeline. |
Implement real-time functionality for the Whisper model, enabling it to transcribe speech into text as the user speaks🎤
The text was updated successfully, but these errors were encountered: