Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

train-2mix split #2

Open
valentin7121 opened this issue Mar 19, 2024 · 2 comments
Open

train-2mix split #2

valentin7121 opened this issue Mar 19, 2024 · 2 comments

Comments

@valentin7121
Copy link

Where can I find train-2mix (specifically train-2mix.jsonl)? The LibriSpeechMix repository contains only dev-clean-2mix, test-clean-2mix.

@AntoineBlanot
Copy link

There is no train dataset on the LibriSpeechMix repository but there is train-clean-100, train-clean-360 and train-other-500.
Maybe he is using one of them? Otherwise data must come from somewhere else?

@lucadellalib
Copy link
Owner

Hello, as pointed out by @AntoineBlanot, unfortunately LibriSpeechMix provides only dev/test manifest files, but no training. You would have to write your own utterance mixing script to generate manifest files with the same structure (using utterances from LibriSpeech train set, for example).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants