We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Hi,
can anyone share your resource usage and settings for downloading the dataset?
We have tried to download the 10M sub-dataset in 720p without audio. But it requires more than 15K CPU hours.
Is there anything wrong?
Thx.
The text was updated successfully, but these errors were encountered:
this is because you're re-encoding the downloaded videos when splitting. Try adding:
subsampling: ClippingSubsampler: args: precision: keyframe_adjusted
in the config and it would orders of magnitude faster
Sorry, something went wrong.
No branches or pull requests
Hi,
can anyone share your resource usage and settings for downloading the dataset?
We have tried to download the 10M sub-dataset in 720p without audio. But it requires more than 15K CPU hours.
Is there anything wrong?
Thx.
The text was updated successfully, but these errors were encountered: