Release Speech Note 4.3.0 · mkiol/dsnote

Linux Desktop

Changes:

Accessibility
- Global keyboard shortcuts (X11 only)
- Support for Actions
User Interface
- Desktop notifications
- Speech speed control in the main app window
- Opening files with Drag and Drop gesture
- Fix: Application did not use native widgets on some platforms
Translator
- New model: English to Hungarian
Speech to Text
- New languages: Afrikaans, Gujarati, Hausa, Telugu, Tswana, Javanese, Hebrew
- New engine: Faster Whisper
- New engine: April-ASR. Models for: English, French and Polish.
- Inserting text to any active window (X11 only)
- Copy decoded text directly to the clipboard
- Stop listening button
- Support for Opus audio codec in Transcribe a file
- More effective GPU acceleration for Whisper models (NVIDIA CUDA only)
- New smaller and quicker Whisper models for English: Distil-Whisper
- New version of Whisper Large model: Whisper Large-v3
- Fix: CUDA acceleration for Whisper models did not work on NVIDIA video cards with Maxwell architecture
Text to Speech
- New languages: Afrikaans, Gujarati, Hausa, Telugu, Tswana, Javanese, Hebrew
- New engine: Mimic 3
- Reading text from the clipboard
- New Piper voices: Arabic, English, Hungarian, Polish, Czech, German, Ukrainian, Vietnamese, Serbian, French, Spanish, Nepali
- More steps in Speech speed option
- Diacritical marks restoration before speech synthesis for Arabic and Hebrew
- Support for GPU acceleration for Coqui models (NVIDIA CUDA only)
- Fix: Coqui Chinese MMS Hakka and MinNan voices were broken
- Fix: Exporting to audio file was not possible when text was very long
Other
- Setting option to disable support for certain graphic cards
- Setting option Clear cache on close
- Cache compression (Opus format instead of raw audio)
- Detecting the availability of the optional features

Sailfish OS

Changes:

Translator
- New model: English to Hungarian
Speech to Text
- New languages: Afrikaans, Gujarati, Hausa, Telugu, Tswana, Javanese, Hebrew
- New engine: April-ASR. Models for: English, French and Polish.
- Stop listening button
- Support for Opus audio codec in Transcribe a file
Text to Speech
- New Piper voices: Arabic, English, Hungarian, Polish, Czech, German, Ukrainian, Vietnamese, Serbian, French, Spanish, Nepali
- More steps in Speech speed option
- Diacritical marks restoration before speech synthesis for Arabic
- Fix: Exporting to audio file was not possible when text was very long
Other
- Setting option Clear cache on close
- Cache compression (Opus format instead of raw audio)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speech Note 4.3.0

Linux Desktop

Sailfish OS