Skip to content
Change the repository type filter

All

    Repositories list

    • bu

      Public
      JavaScript
      0000Updated Jan 6, 2025Jan 6, 2025
    • Run AI Agent in your browser.
      Python
      367000Updated Jan 5, 2025Jan 5, 2025
    • Text2midi

      Public
      Python
      MIT License
      4000Updated Dec 28, 2024Dec 28, 2024
    • Build your own StyleTTS 2 Voice!
      JavaScript
      2100Updated Dec 26, 2024Dec 26, 2024
    • MMAudio

      Public
      [arXiv 2024] Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis
      Python
      MIT License
      101000Updated Dec 21, 2024Dec 21, 2024
    • theme

      Public
      EJS
      0000Updated Dec 17, 2024Dec 17, 2024
    • HunyuanVideo GP: Large Video Generation Model - GPU Poor version
      Python
      Other
      563000Updated Dec 11, 2024Dec 11, 2024
    • mcpc

      Public
      JavaScript
      0000Updated Dec 1, 2024Dec 1, 2024
    • JavaScript
      0000Updated Dec 1, 2024Dec 1, 2024
    • mcp

      Public
      JavaScript
      0000Updated Nov 28, 2024Nov 28, 2024
    • A minimal and universal controller for FLUX.1.
      Python
      68000Updated Nov 27, 2024Nov 27, 2024
    • Python
      MIT License
      27100Updated Nov 26, 2024Nov 26, 2024
    • EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation
      Python
      Apache License 2.0
      264000Updated Nov 24, 2024Nov 24, 2024
    • local

      Public
      HTML
      0000Updated Nov 24, 2024Nov 24, 2024
    • JoyVASA

      Public
      Python
      MIT License
      51000Updated Nov 20, 2024Nov 20, 2024
    • Enhanced background remove and replace app built around BRIA-RMBG-2.0. Low VRAM/RAM | 6GB Install
      Python
      6000Updated Nov 17, 2024Nov 17, 2024
    • Creative Image Enhancer/Upscaler. Powered By Refiners. 8GB VRAM | 10GB Install
      Python
      Other
      3100Updated Nov 16, 2024Nov 16, 2024
    • Code of Pyramidal Flow Matching for Efficient Video Generative Modeling
      Python
      MIT License
      266000Updated Nov 16, 2024Nov 16, 2024
    • F5-TTS

      Public
      Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
      Python
      MIT License
      1.2k000Updated Nov 10, 2024Nov 10, 2024
    • hertz-dev

      Public
      first base model for full-duplex conversational audio
      Python
      Apache License 2.0
      110000Updated Nov 6, 2024Nov 6, 2024
    • Brand new TTS solution
      Python
      Other
      1.4k100Updated Nov 3, 2024Nov 3, 2024
    • OmniGen

      Public
      Jupyter Notebook
      MIT License
      274000Updated Oct 27, 2024Oct 27, 2024
    • Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
      Python
      MIT License
      2.2k200Updated Oct 9, 2024Oct 9, 2024
    • FacePoke

      Public
      Select a portrait, click to move the head around (please use your own space / GPU!)
      JavaScript
      Other
      78101Updated Oct 7, 2024Oct 7, 2024
    • Python
      Apache License 2.0
      910000Updated Sep 5, 2024Sep 5, 2024
    • CogVideo

      Public
      Text-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
      Python
      Apache License 2.0
      9571600Updated Sep 3, 2024Sep 3, 2024
    • Various AI scripts. Mostly Stable Diffusion stuff.
      Python
      MIT License
      428000Updated Sep 2, 2024Sep 2, 2024
    • kohya_ss

      Public
      Python
      Apache License 2.0
      1.3k000Updated Aug 31, 2024Aug 31, 2024
    • OpenVoice

      Public
      Instant voice cloning by MIT and MyShell.
      Python
      MIT License
      3k100Updated Aug 31, 2024Aug 31, 2024
    • A parallel fork of node-pty providing ia32, amd64, arm, and aarch64 prebuilt packages for macOS, Windows and Linux (glibc and musl libc).
      TypeScript
      Other
      253000Updated Aug 23, 2024Aug 23, 2024