Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

OSError: [Errno 22] Invalid argument forbidden character #7388

Closed
langflogit opened this issue Feb 10, 2025 · 2 comments
Closed

OSError: [Errno 22] Invalid argument forbidden character #7388

langflogit opened this issue Feb 10, 2025 · 2 comments

Comments

@langflogit
Copy link

Describe the bug

I'm on Windows and i'm trying to load a datasets but i'm having title error because files in the repository are named with charactere like < >which can't be in a name file. Could it be possible to load this datasets but removing those charactere ?

Steps to reproduce the bug

load_dataset("CATMuS/medieval") on Windows

Expected behavior

Making the function to erase the forbidden character to allow loading the datasets who have those characters.

Environment info

  • datasets version: 3.2.0
  • Platform: Windows-10-10.0.19045-SP0
  • Python version: 3.12.2
  • huggingface_hub version: 0.28.1
  • PyArrow version: 19.0.0
  • Pandas version: 2.2.3
  • fsspec version: 2024.9.0
@lhoestq
Copy link
Member

lhoestq commented Feb 11, 2025

You can probably copy the dataset in your HF account and rename the files (without having to download them to your disk). Or alternatively feel free to open a Pull Request to this dataset with the renamed file

@langflogit
Copy link
Author

Thank you, that will help me work around this problem

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants