-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
290 Plazi datasets that are very likely to have classification issues ACC-ACC species (different authors) #380
Comments
@camiplata Is there a spreadsheet for this? |
ups forgot to load it |
Thanks! |
These 2 don't seem to have anything to fix, can you confirm? |
I'll check! |
https://www.checklistbank.org/dataset/83924/classification
This one https://www.checklistbank.org/dataset/276639/classification has no issues |
Thanks! |
@camiplata most of these have now been fixed The following file contains datasets where we didn't find any clear errors, so we'll need further feedback from you as to what exactly needs to be fixed. Additionally, these 4 datasets are not done yet https://www.checklistbank.org/dataset/300248/classification |
That is great Felipe, I'll have a look to the shared file and come back with comments, if any. |
Hi Felipe, there where some without problems others where you already did some fixing, the remaining ones I will list here, it is mainly issues due to name duplicates within the same dataset.
Thank you as always!!! |
Regarding
|
@camiplata |
Hi Felipe, thanks for your help. For 1 and 2, I think we can handle this kind of duplicates on the merge as they are the same. The errors I'm getting with those probably need to be fixed more on our side than on yours, I just check an there are other sources involved so I'll have to set some editorial decisions. So I believe we are almost done with this issue package!!!!! |
@camiplata |
Hi Plazi team, this issue is similar to #362.
While reviewing duplicate data on the extended release, I found that Plazi was involved in at least 616 duplicate pairs. I did several checks and found that many are due to:
Here you can find those names wich are represented in 290 datasets, I recommend reviewing the datasets as an entity as in most cases there are problems in multiple names not only in the ones listed.
These datasets are very likely to have a problem, but still there may be a couple of them that may be just fine.
Thank you as always for these "clean ups", these have a very positive impact on Catalogue of Life and all our users.
The text was updated successfully, but these errors were encountered: