diff --git a/index.html b/index.html index ba14db3..ccc0a50 100755 --- a/index.html +++ b/index.html @@ -146,7 +146,8 @@

Speech Recognition Dataset

GigaSpeech 2: An Evolving, Large-Scale and Multi-domain ASR Corpus for Low-Resource Languages with Automated Crawling, Transcription and Refinement

Yifan Yang, Zheshu Song, Jianheng Zhuo, Mingyu Cui, Jinpeng Li, Bo Yang, Yexing Du, Ziyang Ma, Xunying Liu, Ziyuan Wang, Ke Li, Shuai Fan, Kai Yu, Wei-Qiang Zhang, Guoguo Chen, Xie Chen

Preprint in arXiv, 2024

-

[Dataset | Code]

+

GigaSpeech2 powers Typhoon-Audio, a state-of-the-art open-source audio language model for Thai.

+

[Dataset] [Code]

  • LibriheavyMix: A 20,000-Hour Dataset for Single-Channel Reverberant Multi-Talker Speech Separation, ASR and Speaker Diarization

    @@ -157,7 +158,7 @@

    Speech Recognition Dataset

    Libriheavy: a 50,000 hours ASR corpus with punctuation casing and context

    Wei Kang, Xiaoyu Yang, Zengwei Yao, Fangjun Kuang, Yifan Yang, Liyong Guo, Long Lin, Daniel Povey

    Oral in Proc. ICASSP, 2024

    -

    [Dataset | Code]

    +

    [Dataset] [Code]

  • Speech Processing via Discrete Tokens

    @@ -202,7 +203,7 @@

    Awards

    Academic Service