Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Dataset name missing from Y columns in CSV header #324

Open
Entropy512 opened this issue Apr 30, 2024 · 0 comments
Open

Dataset name missing from Y columns in CSV header #324

Entropy512 opened this issue Apr 30, 2024 · 0 comments

Comments

@Entropy512
Copy link

Currently, CSVs are exported in a manner that would make sense for a format that supports merged cells, but CSV does not support this. Instead, the dataset name should be included for every column, not just the X columns. The current format breaks the ability to load a dataset with Python Pandas' load_csv() function into a MultiIndex dataframe, for example.

Currently, the CSV header looks like this:

Red,,Green,,Blue,
X,Y,X,Y,X,Y

To make the headers easier to parse, it should look like this:

Red,Red,Green,Green,Blue,Blue
X,Y,X,Y,X,Y

The latter format loads into a Pandas MultiIndex dataframe easily

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant