Add support for local TTS #98

FoxCunning · 2024-11-09T15:03:32Z

This change implements support for local/system text-to-speech using the Web Speech API.

It will read the generated text up to 500 characters. If the prompt has been modified, it will also read the portion that was changed.

lmg-anon · 2024-11-30T18:16:11Z

This is interesting, but I don't think it should be a new collapsible group in the sidebar. Maybe it could open a new modal when clicking on a button here instead:

Or, even better, the configurations could be added to the Editor Preferences modal.

FoxCunning · 2024-12-06T20:17:52Z

The Editor Preferences modal sounds good and I'll see if I can move all the settings there.
There wouldn't be a button to stop the active playback on the main UI, though.

mikupad.html

…nto feature/tts

FoxCunning · 2025-01-01T19:32:19Z

Happy new year! 🎆🎉

I've now fully re-written the TTS code.

The main change is in the App.predict method. Using "useEffect" hooks proved to be unreliable. On the bright side, I only had to add a few lines in that method.
Almost all of the TTS functionality is in separate methods that I kept together starting at line 7350.
The settings UI is now in the Editor Preferences modal. Disabling TTS via checkbox makes all the other TTS-related elements disappear.
I've also added an option to not narrate user inputs.
window.TTS has been removed. Now it only uses React variables stored with useRef / usePersistentState inside the App class. TTS settings should be saved in local storage.
The SVG I added is a simple "stop" button used to, well, stop the TTS while it's speaking. Since it's in the editor preferences, I also added a "hotkey" (CTRL+E) so it's possible to stop the narration without having to open the menu.

To be noted: the reason why I chose to process text chunks as soon as they come is that this way the speech synthesis can start narrating as soon as a sentence is complete (e.g. the AI generates a newline or other "stopping" token). This way, if a lot of text is being generated, and especially if the AI is slow, the user does not have to wait until it's all finished before the narration starts.
Unterminated user inputs will be narrated (if the option is selected) together with the next generation, to form a complete sentence.

I've tested it quite a bit with llama.cpp and koboldcpp.
If there's anything else you think should be changed, let me know.

FoxCunning

Reviewed - See previous comment in pull request.

Add support for local TTS

868d348

FoxCunning marked this pull request as ready for review November 9, 2024 15:03

Merge branch 'lmg-anon:main' into feature/tts

6fb6f29

FoxCunning marked this pull request as draft December 14, 2024 14:41

Moved TTS options to Editor Preference modal

79d1464

lmg-anon requested changes Dec 15, 2024

View reviewed changes

mikupad.html Outdated Show resolved Hide resolved

mikupad.html Outdated Show resolved Hide resolved

mikupad.html Outdated Show resolved Hide resolved

mikupad.html Outdated Show resolved Hide resolved

Full rewrite of TTS system

8b6a27a

FoxCunning closed this Jan 1, 2025

FoxCunning force-pushed the feature/tts branch from 8b6a27a to 7d9b7e3 Compare January 1, 2025 18:50

Fox Cunning added 3 commits January 1, 2025 18:54

Merge branch 'feature/tts' of https://github.com/FoxCunning/mikupad i…

ac65df2

…nto feature/tts

Manual merge

7e597b7

Add missing variables

45a0a17

FoxCunning reopened this Jan 1, 2025

FoxCunning marked this pull request as ready for review January 1, 2025 19:32

FoxCunning requested a review from lmg-anon January 1, 2025 19:32

FoxCunning commented Jan 4, 2025

View reviewed changes

FoxCunning and others added 2 commits January 6, 2025 14:09

Small UI changes, User Input Length option

a56ee25

Merge branch 'lmg-anon:main' into feature/tts

efbdb4c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for local TTS #98

Add support for local TTS #98

FoxCunning commented Nov 9, 2024

lmg-anon commented Nov 30, 2024

FoxCunning commented Dec 6, 2024

FoxCunning commented Jan 1, 2025 •

edited

Loading

FoxCunning left a comment •

edited

Loading

Add support for local TTS #98

Are you sure you want to change the base?

Add support for local TTS #98

Conversation

FoxCunning commented Nov 9, 2024

lmg-anon commented Nov 30, 2024

FoxCunning commented Dec 6, 2024

FoxCunning commented Jan 1, 2025 • edited Loading

FoxCunning left a comment • edited Loading

Choose a reason for hiding this comment

FoxCunning commented Jan 1, 2025 •

edited

Loading

FoxCunning left a comment •

edited

Loading