You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
It's currently a bit of a hassle to have to download a potentially large dataset from HF before being able to train on it. The datasets library has an option to use IterableDataset and IterableDatasetDict, and stream in the data rather than blocking while loading it.
Handling these classes though is different than the regular Dataset and DatasetDict and we'll need to decide whether this should be supported.
The text was updated successfully, but these errors were encountered:
It's currently a bit of a hassle to have to download a potentially large dataset from HF before being able to train on it. The
datasets
library has an option to useIterableDataset
andIterableDatasetDict
, and stream in the data rather than blocking while loading it.Handling these classes though is different than the regular
Dataset
andDatasetDict
and we'll need to decide whether this should be supported.The text was updated successfully, but these errors were encountered: