Skip to content

Extension of ChatTTS, 3x Faster on Windows, Support Voice Cloning and Mobile Deployment

License

Notifications You must be signed in to change notification settings

warmshao/ChatTTSPlus

Repository files navigation

ChatTTSPlus: Extension of ChatTTS

中文 | English

ChatTTSPlus is an extension of ChatTTS, adding features such as TensorRT acceleration, voice cloning, and mobile model deployment.

If you find this project useful, please give it a star! ✨✨

Some fun demos based on ChatTTSPlus

  • NotebookLM podcast: Open In Colab. Use ChatTTSPlus to turn the AnimateAnyone paper into a podcast.

    0cd339f5bd6c4344e7446d96654a5a59-ff.mp4

New Features

  • Refactored ChatTTS code in a way I'm familiar with.
  • Achieved over 3x acceleration with TensorRT, increasing performance on a Windows 3060 GPU from 28 tokens/s to 110 tokens/s.
  • Windows integration package for one-click extraction and use.
  • Implemented voice cloning using technologies like LoRA. Please reference voice_clone.
  • Model compression and acceleration using techniques like pruning and knowledge distillation, targeting mobile deployment.

Environment Setup

  • Install Python 3; it's recommended to use Miniforge. Run: conda create -n chattts_plus python=3.10 && conda activate chattts_plus
  • Download the source code: git clone https://github.com/warmshao/ChatTTSPlus, and navigate to the project root directory: cd ChatTTSPlus
  • Install necessary Python libraries: pip install -r requirements.txt
  • [Optional] If you want to use TensorRT, please install tensorrt10
  • [Recommended for Windows users] Download the integration package directly from Google Drive Link, extract it, and double-click webui.bat to use. If you want to update the code, please double-click update.bat. Note: This will overwrite all your local code modifications.

Demo

  • Web UI with TensorRT: python webui.py --cfg configs/infer/chattts_plus_trt.yaml.
  • Web UI with PyTorch: python webui.py --cfg configs/infer/chattts_plus.yaml

2024-10-26.101512-ff.mp4

License

ChatTTSPlus inherits the license from ChatTTS; please refer to ChatTTS as the standard.

The code is published under the AGPLv3+ license.

The model is published under the CC BY-NC 4.0 license. It is intended for educational and research use and should not be used for any commercial or illegal purposes. The authors do not guarantee the accuracy, completeness, or reliability of the information. The information and data used in this repository are for academic and research purposes only. The data is obtained from publicly available sources, and the authors do not claim any ownership or copyright over the data.

Disclaimer

We do not hold any responsibility for any illegal usage of the codebase. Please refer to your local laws about DMCA and other related laws.

About

Extension of ChatTTS, 3x Faster on Windows, Support Voice Cloning and Mobile Deployment

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published