-
Notifications
You must be signed in to change notification settings - Fork 32
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Improved delimiter sniffing for .csv files #210
Comments
This reminded me of a similar discussion about auto-detection of encoding for csv files at #198 Although now I read above a bit more carefully I see that the suggested changes are quite limited. Do you have an idea why the changes lead to a better outcome for the example csv? Does the existing code fail to read far enough, or does it choose an incorrect delimiter too early? |
Yes, I followed that one closely as well, but fortunately this problem here seems less complicated. The I assume that we will encounter .csvs in the future with gaps in the data causing "variance" and therefore now my proposed solution to dynamically go quarter, half, three quarters and ultimately full length of characters. I will try out a bit longer test.csv with more missing data values later on, just to see how it behaves and where the limits might be. |
Okay, I played around quite a bit with differing amounts of missing values (represented by "<value>;;<value>;<value>;;" --> delimiters with nothing in between) and it seems like 50% is the sweet spot. |
Should I provide an official PR for this solution? |
Hi @JensWendt yes please - I think that would be helpful for discussion and testing, thanks. |
Repository: ome/omero-scripts Excluded PRs: - PR 211 JensWendt 'Solution for delimiter reading issue (ome#210) utilizing dynamic checkpoints' (user: JensWendt) - PR 206 Rdornier 'Adding a script that saves data owner as key-value pair' (user: Rdornier) - PR 202 will-moore 'Add Plate support to KeyVal_to_csv.py' (status: failure) - PR 198 JulianHn 'Adding Support for different CSV Encodings in Import_Scripts/Populate_Metadata.py' (user: JulianHn) - PR 103 joshmoore 'Script for calculating min/max' (exclude comment) Already up-to-date. Merged PRs: - PR 187 jburel 'Rtd' Generated by OMERO-python-superbuild-push#71 (https://snoopycrimecop-ci.openmicroscopy.org/jenkins/job/OMERO-python-superbuild-push/71/)
Repository: ome/scripts Excluded PRs: - PR 211 JensWendt 'Solution for delimiter reading issue (ome#210) utilizing dynamic checkpoints' (user: JensWendt) - PR 206 Rdornier 'Adding a script that saves data owner as key-value pair' (user: Rdornier) - PR 202 will-moore 'Add Plate support to KeyVal_to_csv.py' (status: failure) - PR 198 JulianHn 'Adding Support for different CSV Encodings in Import_Scripts/Populate_Metadata.py' (user: JulianHn) - PR 103 joshmoore 'Script for calculating min/max' (exclude comment) Already up-to-date. Merged PRs: - PR 187 jburel 'Rtd' Generated by OMERO-plugins-push#45 (https://snoopycrimecop-ci.openmicroscopy.org/jenkins/job/OMERO-plugins-push/45/)
Repository: ome/omero-scripts Excluded PRs: - PR 211 JensWendt 'Solution for delimiter reading issue (ome#210) utilizing dynamic checkpoints' (user: JensWendt) - PR 206 Rdornier 'Adding a script that saves data owner as key-value pair' (user: Rdornier) - PR 202 will-moore 'Add Plate support to KeyVal_to_csv.py' (status: failure) - PR 198 JulianHn 'Adding Support for different CSV Encodings in Import_Scripts/Populate_Metadata.py' (user: JulianHn) - PR 103 joshmoore 'Script for calculating min/max' (exclude comment) Already up-to-date. Merged PRs: - PR 187 jburel 'Rtd' Generated by OMERO-python-superbuild-push#295 (https://snoopycrimecop-ci.openmicroscopy.org/jenkins/job/OMERO-python-superbuild-push/295/)
Repository: ome/scripts Excluded PRs: - PR 211 JensWendt 'Solution for delimiter reading issue (ome#210) utilizing dynamic checkpoints' (user: JensWendt) - PR 206 Rdornier 'Adding a script that saves data owner as key-value pair' (user: Rdornier) - PR 202 will-moore 'Add Plate support to KeyVal_to_csv.py' (status: failure) - PR 198 JulianHn 'Adding Support for different CSV Encodings in Import_Scripts/Populate_Metadata.py' (user: JulianHn) - PR 103 joshmoore 'Script for calculating min/max' (exclude comment) Already up-to-date. Merged PRs: - PR 187 jburel 'Rtd' Generated by OMERO-plugins-push#250 (https://snoopycrimecop-ci.openmicroscopy.org/jenkins/job/OMERO-plugins-push/250/)
Repository: ome/omero-scripts Excluded PRs: - PR 211 JensWendt 'Solution for delimiter reading issue (ome#210) utilizing dynamic checkpoints' (user: JensWendt) - PR 206 Rdornier 'Adding a script that saves data owner as key-value pair' (user: Rdornier) - PR 202 will-moore 'Add Plate support to KeyVal_to_csv.py' (status: failure) - PR 198 JulianHn 'Adding Support for different CSV Encodings in Import_Scripts/Populate_Metadata.py' (user: JulianHn) - PR 103 joshmoore 'Script for calculating min/max' (exclude comment) Already up-to-date. Merged PRs: - PR 187 jburel 'Rtd' Generated by OMERO-python-superbuild-push#72 (https://snoopycrimecop-ci.openmicroscopy.org/jenkins/job/OMERO-python-superbuild-push/72/)
Repository: ome/scripts Excluded PRs: - PR 211 JensWendt 'Solution for delimiter reading issue (ome#210) utilizing dynamic checkpoints' (user: JensWendt) - PR 206 Rdornier 'Adding a script that saves data owner as key-value pair' (user: Rdornier) - PR 202 will-moore 'Add Plate support to KeyVal_to_csv.py' (status: failure) - PR 198 JulianHn 'Adding Support for different CSV Encodings in Import_Scripts/Populate_Metadata.py' (user: JulianHn) - PR 103 joshmoore 'Script for calculating min/max' (exclude comment) Already up-to-date. Merged PRs: - PR 187 jburel 'Rtd' Generated by OMERO-plugins-push#46 (https://snoopycrimecop-ci.openmicroscopy.org/jenkins/job/OMERO-plugins-push/46/)
Repository: ome/omero-scripts Excluded PRs: - PR 211 JensWendt 'Solution for delimiter reading issue (ome#210) utilizing dynamic checkpoints' (user: JensWendt) - PR 206 Rdornier 'Adding a script that saves data owner as key-value pair' (user: Rdornier) - PR 202 will-moore 'Add Plate support to KeyVal_to_csv.py' (status: failure) - PR 198 JulianHn 'Adding Support for different CSV Encodings in Import_Scripts/Populate_Metadata.py' (user: JulianHn) - PR 103 joshmoore 'Script for calculating min/max' (exclude comment) Already up-to-date. Merged PRs: - PR 187 jburel 'Rtd' Generated by OMERO-python-superbuild-push#296 (https://snoopycrimecop-ci.openmicroscopy.org/jenkins/job/OMERO-python-superbuild-push/296/)
Repository: ome/scripts Excluded PRs: - PR 211 JensWendt 'Solution for delimiter reading issue (ome#210) utilizing dynamic checkpoints' (user: JensWendt) - PR 206 Rdornier 'Adding a script that saves data owner as key-value pair' (user: Rdornier) - PR 202 will-moore 'Add Plate support to KeyVal_to_csv.py' (status: failure) - PR 198 JulianHn 'Adding Support for different CSV Encodings in Import_Scripts/Populate_Metadata.py' (user: JulianHn) - PR 103 joshmoore 'Script for calculating min/max' (exclude comment) Already up-to-date. Merged PRs: - PR 187 jburel 'Rtd' Generated by OMERO-plugins-push#251 (https://snoopycrimecop-ci.openmicroscopy.org/jenkins/job/OMERO-plugins-push/251/)
Repository: ome/omero-scripts Excluded PRs: - PR 211 JensWendt 'Solution for delimiter reading issue (ome#210) utilizing dynamic checkpoints' (user: JensWendt) - PR 206 Rdornier 'Adding a script that saves data owner as key-value pair' (user: Rdornier) - PR 202 will-moore 'Add Plate support to KeyVal_to_csv.py' (status: failure) - PR 198 JulianHn 'Adding Support for different CSV Encodings in Import_Scripts/Populate_Metadata.py' (user: JulianHn) - PR 103 joshmoore 'Script for calculating min/max' (exclude comment) Already up-to-date. Merged PRs: - PR 187 jburel 'Rtd' Generated by OMERO-python-superbuild-push#73 (https://snoopycrimecop-ci.openmicroscopy.org/jenkins/job/OMERO-python-superbuild-push/73/)
Repository: ome/scripts Excluded PRs: - PR 211 JensWendt 'Solution for delimiter reading issue (ome#210) utilizing dynamic checkpoints' (user: JensWendt) - PR 206 Rdornier 'Adding a script that saves data owner as key-value pair' (user: Rdornier) - PR 202 will-moore 'Add Plate support to KeyVal_to_csv.py' (status: failure) - PR 198 JulianHn 'Adding Support for different CSV Encodings in Import_Scripts/Populate_Metadata.py' (user: JulianHn) - PR 103 joshmoore 'Script for calculating min/max' (exclude comment) Already up-to-date. Merged PRs: - PR 187 jburel 'Rtd' Generated by OMERO-plugins-push#47 (https://snoopycrimecop-ci.openmicroscopy.org/jenkins/job/OMERO-plugins-push/47/)
Repository: ome/omero-scripts Excluded PRs: - PR 211 JensWendt 'Solution for delimiter reading issue (ome#210) utilizing dynamic checkpoints' (user: JensWendt) - PR 206 Rdornier 'Adding a script that saves data owner as key-value pair' (user: Rdornier) - PR 202 will-moore 'Add Plate support to KeyVal_to_csv.py' (status: failure) - PR 198 JulianHn 'Adding Support for different CSV Encodings in Import_Scripts/Populate_Metadata.py' (user: JulianHn) - PR 103 joshmoore 'Script for calculating min/max' (exclude comment) Already up-to-date. Merged PRs: - PR 187 jburel 'Rtd' Generated by OMERO-python-superbuild-push#74 (https://snoopycrimecop-ci.openmicroscopy.org/jenkins/job/OMERO-python-superbuild-push/74/)
Repository: ome/scripts Excluded PRs: - PR 211 JensWendt 'Solution for delimiter reading issue (ome#210) utilizing dynamic checkpoints' (user: JensWendt) - PR 206 Rdornier 'Adding a script that saves data owner as key-value pair' (user: Rdornier) - PR 202 will-moore 'Add Plate support to KeyVal_to_csv.py' (status: failure) - PR 198 JulianHn 'Adding Support for different CSV Encodings in Import_Scripts/Populate_Metadata.py' (user: JulianHn) - PR 103 joshmoore 'Script for calculating min/max' (exclude comment) Already up-to-date. Merged PRs: - PR 187 jburel 'Rtd' Generated by OMERO-plugins-push#48 (https://snoopycrimecop-ci.openmicroscopy.org/jenkins/job/OMERO-plugins-push/48/)
Repository: ome/omero-scripts Excluded PRs: - PR 211 JensWendt 'Solution for delimiter reading issue (ome#210) utilizing dynamic checkpoints' (user: JensWendt) - PR 206 Rdornier 'Adding a script that saves data owner as key-value pair' (user: Rdornier) - PR 202 will-moore 'Add Plate support to KeyVal_to_csv.py' (status: failure) - PR 198 JulianHn 'Adding Support for different CSV Encodings in Import_Scripts/Populate_Metadata.py' (user: JulianHn) - PR 103 joshmoore 'Script for calculating min/max' (exclude comment) Already up-to-date. Merged PRs: - PR 187 jburel 'Rtd' Generated by OMERO-python-superbuild-push#75 (https://snoopycrimecop-ci.openmicroscopy.org/jenkins/job/OMERO-python-superbuild-push/75/)
Repository: ome/scripts Excluded PRs: - PR 211 JensWendt 'Solution for delimiter reading issue (ome#210) utilizing dynamic checkpoints' (user: JensWendt) - PR 206 Rdornier 'Adding a script that saves data owner as key-value pair' (user: Rdornier) - PR 202 will-moore 'Add Plate support to KeyVal_to_csv.py' (status: failure) - PR 198 JulianHn 'Adding Support for different CSV Encodings in Import_Scripts/Populate_Metadata.py' (user: JulianHn) - PR 103 joshmoore 'Script for calculating min/max' (exclude comment) Already up-to-date. Merged PRs: - PR 187 jburel 'Rtd' Generated by OMERO-plugins-push#49 (https://snoopycrimecop-ci.openmicroscopy.org/jenkins/job/OMERO-plugins-push/49/)
Repository: ome/omero-scripts Excluded PRs: - PR 211 JensWendt 'Solution for delimiter reading issue (ome#210) utilizing dynamic checkpoints' (user: JensWendt) - PR 206 Rdornier 'Adding a script that saves data owner as key-value pair' (user: Rdornier) - PR 202 will-moore 'Add Plate support to KeyVal_to_csv.py' (status: failure) - PR 198 JulianHn 'Adding Support for different CSV Encodings in Import_Scripts/Populate_Metadata.py' (user: JulianHn) - PR 103 joshmoore 'Script for calculating min/max' (exclude comment) Already up-to-date. Merged PRs: - PR 187 jburel 'Rtd' Generated by OMERO-python-superbuild-push#76 (https://snoopycrimecop-ci.openmicroscopy.org/jenkins/job/OMERO-python-superbuild-push/76/)
Repository: ome/scripts Excluded PRs: - PR 211 JensWendt 'Solution for delimiter reading issue (ome#210) utilizing dynamic checkpoints' (user: JensWendt) - PR 206 Rdornier 'Adding a script that saves data owner as key-value pair' (user: Rdornier) - PR 202 will-moore 'Add Plate support to KeyVal_to_csv.py' (status: failure) - PR 198 JulianHn 'Adding Support for different CSV Encodings in Import_Scripts/Populate_Metadata.py' (user: JulianHn) - PR 103 joshmoore 'Script for calculating min/max' (exclude comment) Already up-to-date. Merged PRs: - PR 187 jburel 'Rtd' Generated by OMERO-plugins-push#50 (https://snoopycrimecop-ci.openmicroscopy.org/jenkins/job/OMERO-plugins-push/50/)
Repository: ome/omero-scripts Excluded PRs: - PR 211 JensWendt 'Solution for delimiter reading issue (ome#210) utilizing dynamic checkpoints' (user: JensWendt) - PR 206 Rdornier 'Adding a script that saves data owner as key-value pair' (user: Rdornier) - PR 202 will-moore 'Add Plate support to KeyVal_to_csv.py' (status: failure) - PR 198 JulianHn 'Adding Support for different CSV Encodings in Import_Scripts/Populate_Metadata.py' (user: JulianHn) - PR 103 joshmoore 'Script for calculating min/max' (exclude comment) Updating 0ce3845..824d8a2 Previously merged: - PR 187 jburel 'Rtd' Generated by OMERO-python-superbuild-push#77 (https://snoopycrimecop-ci.openmicroscopy.org/jenkins/job/OMERO-python-superbuild-push/77/)
Repository: ome/omero-scripts Excluded PRs: - PR 211 JensWendt 'Solution for delimiter reading issue (ome#210) utilizing dynamic checkpoints' (user: JensWendt) - PR 206 Rdornier 'Adding a script that saves data owner as key-value pair' (user: Rdornier) - PR 202 will-moore 'Add Plate support to KeyVal_to_csv.py' (status: failure) - PR 198 JulianHn 'Adding Support for different CSV Encodings in Import_Scripts/Populate_Metadata.py' (user: JulianHn) - PR 103 joshmoore 'Script for calculating min/max' (exclude comment) Updating 0ce3845..824d8a2 Previously merged: - PR 187 jburel 'Rtd' Generated by OMERO-python-superbuild-push#297 (https://snoopycrimecop-ci.openmicroscopy.org/jenkins/job/OMERO-python-superbuild-push/297/)
Repository: ome/omero-scripts Excluded PRs: - PR 211 JensWendt 'Solution for delimiter reading issue (ome#210) utilizing dynamic checkpoints' (user: JensWendt) - PR 206 Rdornier 'Adding a script that saves data owner as key-value pair' (user: Rdornier) - PR 202 will-moore 'Add Plate support to KeyVal_to_csv.py' (status: failure) - PR 198 JulianHn 'Adding Support for different CSV Encodings in Import_Scripts/Populate_Metadata.py' (user: JulianHn) - PR 103 joshmoore 'Script for calculating min/max' (exclude comment) Already up-to-date. Generated by OMERO-python-superbuild-push#298 (https://snoopycrimecop-ci.openmicroscopy.org/jenkins/job/OMERO-python-superbuild-push/298/)
Repository: ome/scripts Excluded PRs: - PR 211 JensWendt 'Solution for delimiter reading issue (ome#210) utilizing dynamic checkpoints' (user: JensWendt) - PR 206 Rdornier 'Adding a script that saves data owner as key-value pair' (user: Rdornier) - PR 202 will-moore 'Add Plate support to KeyVal_to_csv.py' (status: failure) - PR 198 JulianHn 'Adding Support for different CSV Encodings in Import_Scripts/Populate_Metadata.py' (user: JulianHn) - PR 103 joshmoore 'Script for calculating min/max' (exclude comment) Already up-to-date. Generated by OMERO-plugins-push#252 (https://snoopycrimecop-ci.openmicroscopy.org/jenkins/job/OMERO-plugins-push/252/)
Repository: ome/omero-scripts Excluded PRs: - PR 211 JensWendt 'Solution for delimiter reading issue (ome#210) utilizing dynamic checkpoints' (user: JensWendt) - PR 206 Rdornier 'Adding a script that saves data owner as key-value pair' (user: Rdornier) - PR 202 will-moore 'Add Plate support to KeyVal_to_csv.py' (status: failure) - PR 198 JulianHn 'Adding Support for different CSV Encodings in Import_Scripts/Populate_Metadata.py' (user: JulianHn) - PR 103 joshmoore 'Script for calculating min/max' (exclude comment) Already up-to-date. Generated by OMERO-python-superbuild-push#78 (https://snoopycrimecop-ci.openmicroscopy.org/jenkins/job/OMERO-python-superbuild-push/78/)
Repository: ome/scripts Excluded PRs: - PR 211 JensWendt 'Solution for delimiter reading issue (ome#210) utilizing dynamic checkpoints' (user: JensWendt) - PR 206 Rdornier 'Adding a script that saves data owner as key-value pair' (user: Rdornier) - PR 202 will-moore 'Add Plate support to KeyVal_to_csv.py' (status: failure) - PR 198 JulianHn 'Adding Support for different CSV Encodings in Import_Scripts/Populate_Metadata.py' (user: JulianHn) - PR 103 joshmoore 'Script for calculating min/max' (exclude comment) Already up-to-date. Generated by OMERO-plugins-push#51 (https://snoopycrimecop-ci.openmicroscopy.org/jenkins/job/OMERO-plugins-push/51/)
Repository: ome/omero-scripts Excluded PRs: - PR 211 JensWendt 'Solution for delimiter reading issue (ome#210) utilizing dynamic checkpoints' (user: JensWendt) - PR 206 Rdornier 'Adding a script that saves data owner as key-value pair' (user: Rdornier) - PR 202 will-moore 'Add Plate support to KeyVal_to_csv.py' (status: failure) - PR 198 JulianHn 'Adding Support for different CSV Encodings in Import_Scripts/Populate_Metadata.py' (user: JulianHn) - PR 103 joshmoore 'Script for calculating min/max' (exclude comment) Already up-to-date. Generated by OMERO-python-superbuild-push#299 (https://snoopycrimecop-ci.openmicroscopy.org/jenkins/job/OMERO-python-superbuild-push/299/)
Repository: ome/scripts Excluded PRs: - PR 211 JensWendt 'Solution for delimiter reading issue (ome#210) utilizing dynamic checkpoints' (user: JensWendt) - PR 206 Rdornier 'Adding a script that saves data owner as key-value pair' (user: Rdornier) - PR 202 will-moore 'Add Plate support to KeyVal_to_csv.py' (status: failure) - PR 198 JulianHn 'Adding Support for different CSV Encodings in Import_Scripts/Populate_Metadata.py' (user: JulianHn) - PR 103 joshmoore 'Script for calculating min/max' (exclude comment) Already up-to-date. Generated by OMERO-plugins-push#253 (https://snoopycrimecop-ci.openmicroscopy.org/jenkins/job/OMERO-plugins-push/253/)
Repository: ome/scripts Excluded PRs: - PR 211 JensWendt 'Solution for delimiter reading issue (ome#210) utilizing dynamic checkpoints' (user: JensWendt) - PR 206 Rdornier 'Adding a script that saves data owner as key-value pair' (user: Rdornier) - PR 202 will-moore 'Add Plate support to KeyVal_to_csv.py' (user: will-moore) - PR 198 JulianHn 'Adding Support for different CSV Encodings in Import_Scripts/Populate_Metadata.py' (user: JulianHn) - PR 103 joshmoore 'Script for calculating min/max' (user: joshmoore) Already up-to-date. Generated by OMERO-plugins-push#1416 (https://latest-ci.openmicroscopy.org/jenkins/job/OMERO-plugins-push/1416/)
Repository: ome/omero-scripts Excluded PRs: - PR 211 JensWendt 'Solution for delimiter reading issue (ome#210) utilizing dynamic checkpoints' (user: JensWendt) - PR 206 Rdornier 'Adding a script that saves data owner as key-value pair' (user: Rdornier) - PR 198 JulianHn 'Adding Support for different CSV Encodings in Import_Scripts/Populate_Metadata.py' (user: JulianHn) - PR 103 joshmoore 'Script for calculating min/max' (exclude comment) Already up to date. Conflicting PRs (not included): - PR 202 will-moore 'Add Plate support to KeyVal_to_csv.py' Generated by OMERO-python-superbuild-push#7 (https://snoopycrimecop-ci.openmicroscopy.org/jenkins/job/OMERO-python-superbuild-push/7/)
Repository: ome/scripts Excluded PRs: - PR 211 JensWendt 'Solution for delimiter reading issue (ome#210) utilizing dynamic checkpoints' (user: JensWendt) - PR 206 Rdornier 'Adding a script that saves data owner as key-value pair' (user: Rdornier) - PR 198 JulianHn 'Adding Support for different CSV Encodings in Import_Scripts/Populate_Metadata.py' (user: JulianHn) - PR 103 joshmoore 'Script for calculating min/max' (exclude comment) Already up to date. Conflicting PRs (not included): - PR 202 will-moore 'Add Plate support to KeyVal_to_csv.py' Generated by OMERO-plugins-push#4 (https://snoopycrimecop-ci.openmicroscopy.org/jenkins/job/OMERO-plugins-push/4/)
Repository: ome/omero-scripts Excluded PRs: - PR 211 JensWendt 'Solution for delimiter reading issue (ome#210) utilizing dynamic checkpoints' (user: JensWendt) - PR 206 Rdornier 'Adding a script that saves data owner as key-value pair' (user: Rdornier) - PR 198 JulianHn 'Adding Support for different CSV Encodings in Import_Scripts/Populate_Metadata.py' (user: JulianHn) - PR 103 joshmoore 'Script for calculating min/max' (exclude comment) Already up-to-date. Conflicting PRs (not included): - PR 202 will-moore 'Add Plate support to KeyVal_to_csv.py' Generated by OMERO-python-superbuild-push#310 (https://snoopycrimecop-ci.openmicroscopy.org/jenkins/job/OMERO-python-superbuild-push/310/)
Repository: ome/scripts Excluded PRs: - PR 211 JensWendt 'Solution for delimiter reading issue (ome#210) utilizing dynamic checkpoints' (user: JensWendt) - PR 206 Rdornier 'Adding a script that saves data owner as key-value pair' (user: Rdornier) - PR 198 JulianHn 'Adding Support for different CSV Encodings in Import_Scripts/Populate_Metadata.py' (user: JulianHn) - PR 103 joshmoore 'Script for calculating min/max' (exclude comment) Already up-to-date. Conflicting PRs (not included): - PR 202 will-moore 'Add Plate support to KeyVal_to_csv.py' Generated by OMERO-plugins-push#266 (https://snoopycrimecop-ci.openmicroscopy.org/jenkins/job/OMERO-plugins-push/266/)
Repository: ome/omero-scripts Excluded PRs: - PR 211 JensWendt 'Solution for delimiter reading issue (ome#210) utilizing dynamic checkpoints' (user: JensWendt) - PR 206 Rdornier 'Adding a script that saves data owner as key-value pair' (user: Rdornier) - PR 202 will-moore 'Add Plate support to KeyVal_to_csv.py' (user: will-moore) - PR 198 JulianHn 'Adding Support for different CSV Encodings in Import_Scripts/Populate_Metadata.py' (user: JulianHn) - PR 103 joshmoore 'Script for calculating min/max' (user: joshmoore) Already up-to-date. Generated by OMERO-python-superbuild-push#1364 (https://latest-ci.openmicroscopy.org/jenkins/job/OMERO-python-superbuild-push/1364/)
Repository: ome/scripts Excluded PRs: - PR 211 JensWendt 'Solution for delimiter reading issue (ome#210) utilizing dynamic checkpoints' (user: JensWendt) - PR 206 Rdornier 'Adding a script that saves data owner as key-value pair' (user: Rdornier) - PR 202 will-moore 'Add Plate support to KeyVal_to_csv.py' (user: will-moore) - PR 198 JulianHn 'Adding Support for different CSV Encodings in Import_Scripts/Populate_Metadata.py' (user: JulianHn) - PR 103 joshmoore 'Script for calculating min/max' (user: joshmoore) Already up-to-date. Generated by OMERO-plugins-push#1417 (https://latest-ci.openmicroscopy.org/jenkins/job/OMERO-plugins-push/1417/)
Repository: ome/omero-scripts Excluded PRs: - PR 211 JensWendt 'Solution for delimiter reading issue (ome#210) utilizing dynamic checkpoints' (user: JensWendt) - PR 206 Rdornier 'Adding a script that saves data owner as key-value pair' (user: Rdornier) - PR 198 JulianHn 'Adding Support for different CSV Encodings in Import_Scripts/Populate_Metadata.py' (user: JulianHn) - PR 103 joshmoore 'Script for calculating min/max' (exclude comment) Already up-to-date. Conflicting PRs (not included): - PR 202 will-moore 'Add Plate support to KeyVal_to_csv.py' Generated by OMERO-python-superbuild-push#311 (https://snoopycrimecop-ci.openmicroscopy.org/jenkins/job/OMERO-python-superbuild-push/311/)
Repository: ome/scripts Excluded PRs: - PR 211 JensWendt 'Solution for delimiter reading issue (ome#210) utilizing dynamic checkpoints' (user: JensWendt) - PR 206 Rdornier 'Adding a script that saves data owner as key-value pair' (user: Rdornier) - PR 198 JulianHn 'Adding Support for different CSV Encodings in Import_Scripts/Populate_Metadata.py' (user: JulianHn) - PR 103 joshmoore 'Script for calculating min/max' (exclude comment) Already up-to-date. Conflicting PRs (not included): - PR 202 will-moore 'Add Plate support to KeyVal_to_csv.py' Generated by OMERO-plugins-push#267 (https://snoopycrimecop-ci.openmicroscopy.org/jenkins/job/OMERO-plugins-push/267/)
Repository: ome/omero-scripts Excluded PRs: - PR 211 JensWendt 'Solution for delimiter reading issue (ome#210) utilizing dynamic checkpoints' (user: JensWendt) - PR 206 Rdornier 'Adding a script that saves data owner as key-value pair' (user: Rdornier) - PR 198 JulianHn 'Adding Support for different CSV Encodings in Import_Scripts/Populate_Metadata.py' (user: JulianHn) - PR 103 joshmoore 'Script for calculating min/max' (exclude comment) Already up to date. Conflicting PRs (not included): - PR 202 will-moore 'Add Plate support to KeyVal_to_csv.py' Generated by OMERO-python-superbuild-push#8 (https://snoopycrimecop-ci.openmicroscopy.org/jenkins/job/OMERO-python-superbuild-push/8/)
Repository: ome/scripts Excluded PRs: - PR 211 JensWendt 'Solution for delimiter reading issue (ome#210) utilizing dynamic checkpoints' (user: JensWendt) - PR 206 Rdornier 'Adding a script that saves data owner as key-value pair' (user: Rdornier) - PR 198 JulianHn 'Adding Support for different CSV Encodings in Import_Scripts/Populate_Metadata.py' (user: JulianHn) - PR 103 joshmoore 'Script for calculating min/max' (exclude comment) Already up to date. Conflicting PRs (not included): - PR 202 will-moore 'Add Plate support to KeyVal_to_csv.py' Generated by OMERO-plugins-push#5 (https://snoopycrimecop-ci.openmicroscopy.org/jenkins/job/OMERO-plugins-push/5/)
Solution for delimiter reading issue (#210) utilizing dynamic checkpoints
Repository: ome/omero-scripts Excluded PRs: - PR 206 Rdornier 'Adding a script that saves data owner as key-value pair' (user: Rdornier) - PR 198 JulianHn 'Adding Support for different CSV Encodings in Import_Scripts/Populate_Metadata.py' (user: JulianHn) - PR 103 joshmoore 'Script for calculating min/max' (exclude comment) Updating 824d8a2..8bf8b9f Previously merged: - PR 211 JensWendt 'Solution for delimiter reading issue (ome#210) utilizing dynamic checkpoints' Merged PRs: - PR 212 jburel 'add entry for Jens' Conflicting PRs (not included): - PR 202 will-moore 'Add Plate support to KeyVal_to_csv.py' Generated by OMERO-python-superbuild-push#9 (https://snoopycrimecop-ci.openmicroscopy.org/jenkins/job/OMERO-python-superbuild-push/9/)
Repository: ome/scripts Excluded PRs: - PR 206 Rdornier 'Adding a script that saves data owner as key-value pair' (user: Rdornier) - PR 198 JulianHn 'Adding Support for different CSV Encodings in Import_Scripts/Populate_Metadata.py' (user: JulianHn) - PR 103 joshmoore 'Script for calculating min/max' (exclude comment) Updating 824d8a2..8bf8b9f Previously merged: - PR 211 JensWendt 'Solution for delimiter reading issue (ome#210) utilizing dynamic checkpoints' Merged PRs: - PR 212 jburel 'add entry for Jens' Conflicting PRs (not included): - PR 202 will-moore 'Add Plate support to KeyVal_to_csv.py' Generated by OMERO-plugins-push#6 (https://snoopycrimecop-ci.openmicroscopy.org/jenkins/job/OMERO-plugins-push/6/)
Repository: ome/omero-scripts Excluded PRs: - PR 206 Rdornier 'Adding a script that saves data owner as key-value pair' (user: Rdornier) - PR 198 JulianHn 'Adding Support for different CSV Encodings in Import_Scripts/Populate_Metadata.py' (user: JulianHn) - PR 103 joshmoore 'Script for calculating min/max' (exclude comment) Updating 824d8a2..d3fb626 Previously merged: - PR 212 jburel 'add entry for Jens' - PR 211 JensWendt 'Solution for delimiter reading issue (ome#210) utilizing dynamic checkpoints' Conflicting PRs (not included): - PR 202 will-moore 'Add Plate support to KeyVal_to_csv.py' Generated by OMERO-python-superbuild-push#312 (https://snoopycrimecop-ci.openmicroscopy.org/jenkins/job/OMERO-python-superbuild-push/312/)
Repository: ome/scripts Excluded PRs: - PR 206 Rdornier 'Adding a script that saves data owner as key-value pair' (user: Rdornier) - PR 198 JulianHn 'Adding Support for different CSV Encodings in Import_Scripts/Populate_Metadata.py' (user: JulianHn) - PR 103 joshmoore 'Script for calculating min/max' (exclude comment) Updating 824d8a2..d3fb626 Previously merged: - PR 212 jburel 'add entry for Jens' - PR 211 JensWendt 'Solution for delimiter reading issue (ome#210) utilizing dynamic checkpoints' Conflicting PRs (not included): - PR 202 will-moore 'Add Plate support to KeyVal_to_csv.py' Generated by OMERO-plugins-push#268 (https://snoopycrimecop-ci.openmicroscopy.org/jenkins/job/OMERO-plugins-push/268/)
Repository: ome/omero-scripts Excluded PRs: - PR 206 Rdornier 'Adding a script that saves data owner as key-value pair' (user: Rdornier) - PR 198 JulianHn 'Adding Support for different CSV Encodings in Import_Scripts/Populate_Metadata.py' (user: JulianHn) - PR 103 joshmoore 'Script for calculating min/max' (exclude comment) Updating 824d8a2..d3fb626 Previously merged: - PR 212 jburel 'add entry for Jens' - PR 211 JensWendt 'Solution for delimiter reading issue (ome#210) utilizing dynamic checkpoints' Conflicting PRs (not included): - PR 202 will-moore 'Add Plate support to KeyVal_to_csv.py' Generated by OMERO-python-superbuild-push#10 (https://snoopycrimecop-ci.openmicroscopy.org/jenkins/job/OMERO-python-superbuild-push/10/)
Repository: ome/scripts Excluded PRs: - PR 206 Rdornier 'Adding a script that saves data owner as key-value pair' (user: Rdornier) - PR 198 JulianHn 'Adding Support for different CSV Encodings in Import_Scripts/Populate_Metadata.py' (user: JulianHn) - PR 103 joshmoore 'Script for calculating min/max' (exclude comment) Updating 824d8a2..d3fb626 Previously merged: - PR 212 jburel 'add entry for Jens' - PR 211 JensWendt 'Solution for delimiter reading issue (ome#210) utilizing dynamic checkpoints' Conflicting PRs (not included): - PR 202 will-moore 'Add Plate support to KeyVal_to_csv.py' Generated by OMERO-plugins-push#7 (https://snoopycrimecop-ci.openmicroscopy.org/jenkins/job/OMERO-plugins-push/7/)
Repository: ome/omero-scripts Excluded PRs: - PR 206 Rdornier 'Adding a script that saves data owner as key-value pair' (user: Rdornier) - PR 198 JulianHn 'Adding Support for different CSV Encodings in Import_Scripts/Populate_Metadata.py' (user: JulianHn) - PR 103 joshmoore 'Script for calculating min/max' (exclude comment) Updating 824d8a2..d3fb626 Previously merged: - PR 212 jburel 'add entry for Jens' - PR 211 JensWendt 'Solution for delimiter reading issue (ome#210) utilizing dynamic checkpoints' Conflicting PRs (not included): - PR 202 will-moore 'Add Plate support to KeyVal_to_csv.py' Generated by OMERO-python-superbuild-push#313 (https://snoopycrimecop-ci.openmicroscopy.org/jenkins/job/OMERO-python-superbuild-push/313/)
Repository: ome/scripts Excluded PRs: - PR 206 Rdornier 'Adding a script that saves data owner as key-value pair' (user: Rdornier) - PR 198 JulianHn 'Adding Support for different CSV Encodings in Import_Scripts/Populate_Metadata.py' (user: JulianHn) - PR 103 joshmoore 'Script for calculating min/max' (exclude comment) Updating 824d8a2..d3fb626 Previously merged: - PR 212 jburel 'add entry for Jens' - PR 211 JensWendt 'Solution for delimiter reading issue (ome#210) utilizing dynamic checkpoints' Conflicting PRs (not included): - PR 202 will-moore 'Add Plate support to KeyVal_to_csv.py' Generated by OMERO-plugins-push#269 (https://snoopycrimecop-ci.openmicroscopy.org/jenkins/job/OMERO-plugins-push/269/)
Repository: ome/omero-scripts Excluded PRs: - PR 206 Rdornier 'Adding a script that saves data owner as key-value pair' (user: Rdornier) - PR 198 JulianHn 'Adding Support for different CSV Encodings in Import_Scripts/Populate_Metadata.py' (user: JulianHn) - PR 103 joshmoore 'Script for calculating min/max' (exclude comment) Updating 824d8a2..d3fb626 Previously merged: - PR 212 jburel 'add entry for Jens' - PR 211 JensWendt 'Solution for delimiter reading issue (ome#210) utilizing dynamic checkpoints' Conflicting PRs (not included): - PR 202 will-moore 'Add Plate support to KeyVal_to_csv.py' Generated by OMERO-python-superbuild-push#11 (https://snoopycrimecop-ci.openmicroscopy.org/jenkins/job/OMERO-python-superbuild-push/11/)
Repository: ome/scripts Excluded PRs: - PR 206 Rdornier 'Adding a script that saves data owner as key-value pair' (user: Rdornier) - PR 198 JulianHn 'Adding Support for different CSV Encodings in Import_Scripts/Populate_Metadata.py' (user: JulianHn) - PR 103 joshmoore 'Script for calculating min/max' (exclude comment) Updating 824d8a2..d3fb626 Previously merged: - PR 212 jburel 'add entry for Jens' - PR 211 JensWendt 'Solution for delimiter reading issue (ome#210) utilizing dynamic checkpoints' Conflicting PRs (not included): - PR 202 will-moore 'Add Plate support to KeyVal_to_csv.py' Generated by OMERO-plugins-push#8 (https://snoopycrimecop-ci.openmicroscopy.org/jenkins/job/OMERO-plugins-push/8/)
Repository: ome/omero-scripts Excluded PRs: - PR 206 Rdornier 'Adding a script that saves data owner as key-value pair' (user: Rdornier) - PR 202 will-moore 'Add Plate support to KeyVal_to_csv.py' (user: will-moore) - PR 198 JulianHn 'Adding Support for different CSV Encodings in Import_Scripts/Populate_Metadata.py' (user: JulianHn) - PR 103 joshmoore 'Script for calculating min/max' (user: joshmoore) Updating 824d8a2..d3fb626 Previously merged: - PR 212 jburel 'add entry for Jens' - PR 211 JensWendt 'Solution for delimiter reading issue (ome#210) utilizing dynamic checkpoints' Generated by OMERO-python-superbuild-push#1365 (https://latest-ci.openmicroscopy.org/jenkins/job/OMERO-python-superbuild-push/1365/)
Repository: ome/scripts Excluded PRs: - PR 206 Rdornier 'Adding a script that saves data owner as key-value pair' (user: Rdornier) - PR 202 will-moore 'Add Plate support to KeyVal_to_csv.py' (user: will-moore) - PR 198 JulianHn 'Adding Support for different CSV Encodings in Import_Scripts/Populate_Metadata.py' (user: JulianHn) - PR 103 joshmoore 'Script for calculating min/max' (user: joshmoore) Updating 824d8a2..d3fb626 Previously merged: - PR 212 jburel 'add entry for Jens' - PR 211 JensWendt 'Solution for delimiter reading issue (ome#210) utilizing dynamic checkpoints' Generated by OMERO-plugins-push#1418 (https://latest-ci.openmicroscopy.org/jenkins/job/OMERO-plugins-push/1418/)
Repository: ome/omero-scripts Excluded PRs: - PR 206 Rdornier 'Adding a script that saves data owner as key-value pair' (user: Rdornier) - PR 198 JulianHn 'Adding Support for different CSV Encodings in Import_Scripts/Populate_Metadata.py' (user: JulianHn) - PR 103 joshmoore 'Script for calculating min/max' (exclude comment) Updating 824d8a2..d3fb626 Previously merged: - PR 212 jburel 'add entry for Jens' - PR 211 JensWendt 'Solution for delimiter reading issue (ome#210) utilizing dynamic checkpoints' Conflicting PRs (not included): - PR 202 will-moore 'Add Plate support to KeyVal_to_csv.py' Generated by OMERO-python-superbuild-push#314 (https://snoopycrimecop-ci.openmicroscopy.org/jenkins/job/OMERO-python-superbuild-push/314/)
Repository: ome/scripts Excluded PRs: - PR 206 Rdornier 'Adding a script that saves data owner as key-value pair' (user: Rdornier) - PR 198 JulianHn 'Adding Support for different CSV Encodings in Import_Scripts/Populate_Metadata.py' (user: JulianHn) - PR 103 joshmoore 'Script for calculating min/max' (exclude comment) Updating 824d8a2..d3fb626 Previously merged: - PR 212 jburel 'add entry for Jens' - PR 211 JensWendt 'Solution for delimiter reading issue (ome#210) utilizing dynamic checkpoints' Conflicting PRs (not included): - PR 202 will-moore 'Add Plate support to KeyVal_to_csv.py' Generated by OMERO-plugins-push#270 (https://snoopycrimecop-ci.openmicroscopy.org/jenkins/job/OMERO-plugins-push/270/)
Repository: ome/omero-scripts Excluded PRs: - PR 206 Rdornier 'Adding a script that saves data owner as key-value pair' (user: Rdornier) - PR 198 JulianHn 'Adding Support for different CSV Encodings in Import_Scripts/Populate_Metadata.py' (user: JulianHn) - PR 103 joshmoore 'Script for calculating min/max' (exclude comment) Updating 824d8a2..d3fb626 Previously merged: - PR 212 jburel 'add entry for Jens' - PR 211 JensWendt 'Solution for delimiter reading issue (ome#210) utilizing dynamic checkpoints' Conflicting PRs (not included): - PR 202 will-moore 'Add Plate support to KeyVal_to_csv.py' Generated by OMERO-python-superbuild-push#12 (https://snoopycrimecop-ci.openmicroscopy.org/jenkins/job/OMERO-python-superbuild-push/12/)
Repository: ome/scripts Excluded PRs: - PR 206 Rdornier 'Adding a script that saves data owner as key-value pair' (user: Rdornier) - PR 198 JulianHn 'Adding Support for different CSV Encodings in Import_Scripts/Populate_Metadata.py' (user: JulianHn) - PR 103 joshmoore 'Script for calculating min/max' (exclude comment) Updating 824d8a2..d3fb626 Previously merged: - PR 212 jburel 'add entry for Jens' - PR 211 JensWendt 'Solution for delimiter reading issue (ome#210) utilizing dynamic checkpoints' Conflicting PRs (not included): - PR 202 will-moore 'Add Plate support to KeyVal_to_csv.py' Generated by OMERO-plugins-push#9 (https://snoopycrimecop-ci.openmicroscopy.org/jenkins/job/OMERO-plugins-push/9/)
Repository: ome/omero-scripts Excluded PRs: - PR 206 Rdornier 'Adding a script that saves data owner as key-value pair' (user: Rdornier) - PR 202 will-moore 'Add Plate support to KeyVal_to_csv.py' (user: will-moore) - PR 198 JulianHn 'Adding Support for different CSV Encodings in Import_Scripts/Populate_Metadata.py' (user: JulianHn) - PR 103 joshmoore 'Script for calculating min/max' (user: joshmoore) Updating 824d8a2..d3fb626 Previously merged: - PR 212 jburel 'add entry for Jens' - PR 211 JensWendt 'Solution for delimiter reading issue (ome#210) utilizing dynamic checkpoints' Generated by OMERO-python-superbuild-push#1366 (https://latest-ci.openmicroscopy.org/jenkins/job/OMERO-python-superbuild-push/1366/)
Repository: ome/scripts Excluded PRs: - PR 206 Rdornier 'Adding a script that saves data owner as key-value pair' (user: Rdornier) - PR 202 will-moore 'Add Plate support to KeyVal_to_csv.py' (user: will-moore) - PR 198 JulianHn 'Adding Support for different CSV Encodings in Import_Scripts/Populate_Metadata.py' (user: JulianHn) - PR 103 joshmoore 'Script for calculating min/max' (user: joshmoore) Updating 824d8a2..d3fb626 Previously merged: - PR 212 jburel 'add entry for Jens' - PR 211 JensWendt 'Solution for delimiter reading issue (ome#210) utilizing dynamic checkpoints' Generated by OMERO-plugins-push#1419 (https://latest-ci.openmicroscopy.org/jenkins/job/OMERO-plugins-push/1419/)
Hello,
Adding to a previous discussion we had #195, I would like to propose an improvement to the delimiter sniffing technique we use.
Why do I come up with this now? Because @SchmChris found a .csv which could not be properly resolved with the current code.
the .csv in question: test.csv
the current code reads:
I propose ditching the fixed values of characters we feed into the sniffer and go for a dynamic approach where we read the first quarter, half , three quarters or the whole file.
This will cost maybe a bit more time, but will more reliably lead to a successful "sniffing" of the delimiter.
Does this sound like a good idea, am I missing some issue or is there maybe a more elegant approach to implement this?
When this more robust method is implemented (in whatever fashion) I would also like to adapt the code for
populate_metadata.py
and OMERO.parade to include this, as the delimiter-issue will come up for any german localized Excel version and possibly prevents new OMERO users from utilizing this part of the OMERO functionality.The text was updated successfully, but these errors were encountered: