Highlights
- Pro
Lists (1)
Sort Name ascending (A-Z)
Stars
A multi-purpose toolkit for table-to-text generation: web interface, Python bindings, CLI commands.
An NLP processing pipeline for characters in fanfiction. Developed by students at Carnegie Mellon University from 2019-2021.
COA Tools is a 2D Animation Suite for blender. It offers a 2D cutout animation workflow similar to programs like spine or spriter.
A successor to booknlp, aiming to fix bugs and improve model performance
Object Detection Model for Scanned Documents
AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents
🦄 Unitxt: a python library for getting data fired up and set for training and evaluation
Character Animation (AnimateAnyone, Face Reenactment)
Code to accompany "A Method for Animating Children's Drawings of the Human Figure"
FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.
Rembg is a tool to remove images background
SDXL API provides a seamless interface for image generation and retrieval using Stable Diffusion XL integrated with Cloudflare AI Workers. This API allows users to generate and manage images in a h…
automated video subtitles creation using whisperx, moviepy and optional chatGPT for translation
Character tropes, Forensic Interviews, and Character Attributes
transform entire books into vectors and embeddings to conduct advanced searches and retrieve relevant information information quickly
AI Storytelling is Natural Language Processing project that targets on conversion of short stories to audiobooks with features of characters, voice, musical background and sound of activities gener…
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
EPUB to audiobook converter, optimized for Audiobookshelf
Unsupervised domain adaptation for conversational speech enhancement using RemixIT
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
🔊 Text-Prompted Generative Audio Model
2022-2 SNU Computer Vision Project - Fortune On Your Hand: View-Invariant Machine Palmistry
[Official] Exploring Temporal Coherence for More General Video Face Forgery Detection(ICCV 2021)
Its a implementation of DeepFont : Identify Your Font from An Image using Keras