From 232cb95fd05f8dda434f4e567f8c64a5e660c2a9 Mon Sep 17 00:00:00 2001 From: Rex Cheng Date: Sun, 15 Dec 2024 20:59:12 -0600 Subject: [PATCH] Update README.md --- README.md | 10 +++++----- 1 file changed, 5 insertions(+), 5 deletions(-) diff --git a/README.md b/README.md index 400d649..2f5fb8f 100644 --- a/README.md +++ b/README.md @@ -85,11 +85,11 @@ The models are also available at https://huggingface.co/hkchengrex/MMAudio/tree/ | Model | Download link | File size | | -------- | ------- | ------- | -| Flow prediction network, small 16kHz | mmaudio_small_16k.pth | 601M | -| Flow prediction network, small 44.1kHz | mmaudio_small_44k.pth | 601M | -| Flow prediction network, medium 44.1kHz | mmaudio_medium_44k.pth | 2.4G | -| Flow prediction network, large 44.1kHz | mmaudio_large_44k.pth | 3.9G | -| Flow prediction network, large 44.1kHz, v2 **(recommended)** | mmaudio_large_44k_v2.pth | 3.9G | +| Flow prediction network, small 16kHz | mmaudio_small_16k.pth | 601M | +| Flow prediction network, small 44.1kHz | mmaudio_small_44k.pth | 601M | +| Flow prediction network, medium 44.1kHz | mmaudio_medium_44k.pth | 2.4G | +| Flow prediction network, large 44.1kHz | mmaudio_large_44k.pth | 3.9G | +| Flow prediction network, large 44.1kHz, v2 **(recommended)** | mmaudio_large_44k_v2.pth | 3.9G | | 16kHz VAE | v1-16.pth | 655M | | 16kHz BigVGAN vocoder (from Make-An-Audio 2) |best_netG.pt | 429M | | 44.1kHz VAE |v1-44.pth | 1.2G |