Skip to content

Commit

Permalink
Update index.html
Browse files Browse the repository at this point in the history
  • Loading branch information
yfyeung authored Aug 25, 2024
1 parent f4086ad commit 8a9979d
Showing 1 changed file with 4 additions and 3 deletions.
7 changes: 4 additions & 3 deletions index.html
Original file line number Diff line number Diff line change
Expand Up @@ -146,7 +146,8 @@ <h4>Speech Recognition Dataset</h4>
<p><a href="https://arxiv.org/pdf/2406.11546">GigaSpeech 2: An Evolving, Large-Scale and Multi-domain ASR Corpus for Low-Resource Languages with Automated Crawling, Transcription and Refinement</a></p>
<b>Yifan Yang</b>, Zheshu Song, Jianheng Zhuo, Mingyu Cui, Jinpeng Li, Bo Yang, Yexing Du, Ziyang Ma, Xunying Liu, Ziyuan Wang, Ke Li, Shuai Fan, Kai Yu, Wei-Qiang Zhang, Guoguo Chen, Xie Chen</p>
<p>Preprint in arXiv, 2024</p>
<p>[<a href="https://huggingface.co/datasets/speechcolab/gigaspeech2">Dataset</a> | <a href="https://github.com/SpeechColab/GigaSpeech2">Code</a>]</p>
<p>GigaSpeech2 powers <a href="https://blog.opentyphoon.ai/typhoon-audio-preview-release-6fbb3f938287">Typhoon-Audio</a>, a state-of-the-art open-source audio language model for Thai.</p>
<p>[<a href="https://huggingface.co/datasets/speechcolab/gigaspeech2">Dataset</a>] [<a href="https://github.com/SpeechColab/GigaSpeech2">Code</a>]</p>
</li>
<li>
<p>LibriheavyMix: A 20,000-Hour Dataset for Single-Channel Reverberant Multi-Talker Speech Separation, ASR and Speaker Diarization</p>
Expand All @@ -157,7 +158,7 @@ <h4>Speech Recognition Dataset</h4>
<p><a href="https://arxiv.org/pdf/2309.08105.pdf">Libriheavy: a 50,000 hours ASR corpus with punctuation casing and context</a></p>
<p>Wei Kang, Xiaoyu Yang, Zengwei Yao, Fangjun Kuang, <b>Yifan Yang</b>, Liyong Guo, Long Lin, Daniel Povey</p>
<p><span style="color:red; font-weight:bold;">Oral</span> in Proc. ICASSP, 2024</p>
<p>[<a href="https://huggingface.co/datasets/pkufool/libriheavy">Dataset</a> | <a href="https://github.com/k2-fsa/libriheavy">Code</a>]</p>
<p>[<a href="https://huggingface.co/datasets/pkufool/libriheavy">Dataset</a>] [<a href="https://github.com/k2-fsa/libriheavy">Code</a>]</p>
</li>
</ul>
<h4>Speech Processing via Discrete Tokens</h4>
Expand Down Expand Up @@ -202,7 +203,7 @@ <h3>Awards</h3>
<h3>Academic Service</h3>
<ul>
<li>
<p>[Journal Reviewer] IEEE Signal Processing Letters (SPL)</p>
<p>[Journal Reviewer] IEEE Signal Processing Letters (SPL)</p>
</li>
<li>
<p>[Conference Reviewer] The Thirteenth International Conference on Learning Representations (ICLR 2025)</p>
Expand Down

0 comments on commit 8a9979d

Please sign in to comment.