Speech Note 4.3.0
Linux Desktop
Changes:
- Accessibility
- Global keyboard shortcuts (X11 only)
- Support for Actions
- User Interface
- Desktop notifications
- Speech speed control in the main app window
- Opening files with Drag and Drop gesture
- Fix: Application did not use native widgets on some platforms
- Translator
- New model: English to Hungarian
- Speech to Text
- New languages: Afrikaans, Gujarati, Hausa, Telugu, Tswana, Javanese, Hebrew
- New engine: Faster Whisper
- New engine: April-ASR. Models for: English, French and Polish.
- Inserting text to any active window (X11 only)
- Copy decoded text directly to the clipboard
- Stop listening button
- Support for Opus audio codec in Transcribe a file
- More effective GPU acceleration for Whisper models (NVIDIA CUDA only)
- New smaller and quicker Whisper models for English: Distil-Whisper
- New version of Whisper Large model: Whisper Large-v3
- Fix: CUDA acceleration for Whisper models did not work on NVIDIA video cards with Maxwell architecture
- Text to Speech
- New languages: Afrikaans, Gujarati, Hausa, Telugu, Tswana, Javanese, Hebrew
- New engine: Mimic 3
- Reading text from the clipboard
- New Piper voices: Arabic, English, Hungarian, Polish, Czech, German, Ukrainian, Vietnamese, Serbian, French, Spanish, Nepali
- More steps in Speech speed option
- Diacritical marks restoration before speech synthesis for Arabic and Hebrew
- Support for GPU acceleration for Coqui models (NVIDIA CUDA only)
- Fix: Coqui Chinese MMS Hakka and MinNan voices were broken
- Fix: Exporting to audio file was not possible when text was very long
- Other
- Setting option to disable support for certain graphic cards
- Setting option Clear cache on close
- Cache compression (Opus format instead of raw audio)
- Detecting the availability of the optional features
Sailfish OS
Changes:
- Translator
- New model: English to Hungarian
- Speech to Text
- New languages: Afrikaans, Gujarati, Hausa, Telugu, Tswana, Javanese, Hebrew
- New engine: April-ASR. Models for: English, French and Polish.
- Stop listening button
- Support for Opus audio codec in Transcribe a file
- Text to Speech
- New Piper voices: Arabic, English, Hungarian, Polish, Czech, German, Ukrainian, Vietnamese, Serbian, French, Spanish, Nepali
- More steps in Speech speed option
- Diacritical marks restoration before speech synthesis for Arabic
- Fix: Exporting to audio file was not possible when text was very long
- Other
- Setting option Clear cache on close
- Cache compression (Opus format instead of raw audio)