Dataset name missing from Y columns in CSV header #324

Entropy512 · 2024-04-30T00:18:59Z

Currently, CSVs are exported in a manner that would make sense for a format that supports merged cells, but CSV does not support this. Instead, the dataset name should be included for every column, not just the X columns. The current format breaks the ability to load a dataset with Python Pandas' load_csv() function into a MultiIndex dataframe, for example.

Currently, the CSV header looks like this:

Red,,Green,,Blue,
X,Y,X,Y,X,Y

To make the headers easier to parse, it should look like this:

Red,Red,Green,Green,Blue,Blue
X,Y,X,Y,X,Y

The latter format loads into a Pandas MultiIndex dataframe easily

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dataset name missing from Y columns in CSV header #324

Dataset name missing from Y columns in CSV header #324

Entropy512 commented Apr 30, 2024

Dataset name missing from Y columns in CSV header #324

Dataset name missing from Y columns in CSV header #324

Comments

Entropy512 commented Apr 30, 2024