Welcome to RealtimeTTS Discussions! #1
Replies: 5 comments 19 replies
-
Thank you for the library. Any possibility of supporting the OpenAI TTS engine? |
Beta Was this translation helpful? Give feedback.
-
What will be the best approach to stream the audio that is being generated through an API endpoint to consume in a web application? Any example code to share as you did for RealtimeSTT? |
Beta Was this translation helpful? Give feedback.
-
Tried this but it is hard to do. Audio chunks come in different formats. Handling of this in RealtimeTTS is already a pain. For example, OpenAI delivers clean MP3 chunks, which we can play with ffmpeg or convert to wav easy. Elevenlabs yields Mp3 chunks which depend on the last. RealtimeTTS plays them with MPV, but conversion to Wav is really hard for this stream. |
Beta Was this translation helpful? Give feedback.
-
Hello, In my solution I commented out the below lines in the _on_audio_chunk function, because audio2face accepts the audio data directly as a numpy array, and not as bytes. Could you please confirm if this is the right location to do that? if format == pyaudio.paFloat32: |
Beta Was this translation helpful? Give feedback.
-
Hello, |
Beta Was this translation helpful? Give feedback.
-
Welcome!
This GitHub Discussion space is designated for users and contributors of the project.
Purpose:
Discuss applications and use-cases.
Collaborate on feature development and improvements.
Offer and seek technical support.
Discuss industry best practices related to low-latency voice technology.
Participation Guidelines:
Ask questions related to the project's application and troubleshooting.
Share ideas for new features or improvements.
Upvote helpful discussions and answers.
Maintain a respectful and open-minded tone.
Beta Was this translation helpful? Give feedback.
All reactions