Skip to content

Speech Note 4.3.0

Compare
Choose a tag to compare
@mkiol mkiol released this 13 Nov 08:53
· 600 commits to main since this release

Linux Desktop

Changes:

  • Accessibility
    • Global keyboard shortcuts (X11 only)
    • Support for Actions
  • User Interface
    • Desktop notifications
    • Speech speed control in the main app window
    • Opening files with Drag and Drop gesture
    • Fix: Application did not use native widgets on some platforms
  • Translator
    • New model: English to Hungarian
  • Speech to Text
    • New languages: Afrikaans, Gujarati, Hausa, Telugu, Tswana, Javanese, Hebrew
    • New engine: Faster Whisper
    • New engine: April-ASR. Models for: English, French and Polish.
    • Inserting text to any active window (X11 only)
    • Copy decoded text directly to the clipboard
    • Stop listening button
    • Support for Opus audio codec in Transcribe a file
    • More effective GPU acceleration for Whisper models (NVIDIA CUDA only)
    • New smaller and quicker Whisper models for English: Distil-Whisper
    • New version of Whisper Large model: Whisper Large-v3
    • Fix: CUDA acceleration for Whisper models did not work on NVIDIA video cards with Maxwell architecture
  • Text to Speech
    • New languages: Afrikaans, Gujarati, Hausa, Telugu, Tswana, Javanese, Hebrew
    • New engine: Mimic 3
    • Reading text from the clipboard
    • New Piper voices: Arabic, English, Hungarian, Polish, Czech, German, Ukrainian, Vietnamese, Serbian, French, Spanish, Nepali
    • More steps in Speech speed option
    • Diacritical marks restoration before speech synthesis for Arabic and Hebrew
    • Support for GPU acceleration for Coqui models (NVIDIA CUDA only)
    • Fix: Coqui Chinese MMS Hakka and MinNan voices were broken
    • Fix: Exporting to audio file was not possible when text was very long
  • Other
    • Setting option to disable support for certain graphic cards
    • Setting option Clear cache on close
    • Cache compression (Opus format instead of raw audio)
    • Detecting the availability of the optional features

Sailfish OS

Changes:

  • Translator
    • New model: English to Hungarian
  • Speech to Text
    • New languages: Afrikaans, Gujarati, Hausa, Telugu, Tswana, Javanese, Hebrew
    • New engine: April-ASR. Models for: English, French and Polish.
    • Stop listening button
    • Support for Opus audio codec in Transcribe a file
  • Text to Speech
    • New Piper voices: Arabic, English, Hungarian, Polish, Czech, German, Ukrainian, Vietnamese, Serbian, French, Spanish, Nepali
    • More steps in Speech speed option
    • Diacritical marks restoration before speech synthesis for Arabic
    • Fix: Exporting to audio file was not possible when text was very long
  • Other
    • Setting option Clear cache on close
    • Cache compression (Opus format instead of raw audio)