-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Misinterpretation of country - GBIF datasets #45
Comments
Hi @ManonGros, Thanks for this list. I'll check with @myrmoteras how he want to handle them, and go from there. But just so I can understand: I think gbif/portal-feedback#2420 and gbif/portal-feedback#2380 are related as the first cites the latter? All the best, |
yes I think so (but I am not sure) |
It seems like this new issue could also be related: gbif/portal-feedback#3030 |
I'm checking with @myrmoteras and Guido whether I'll act on these issues, or Guido will do it. Thanks for reporting them! |
It's a similar problem as (gbif/portal-feedback#2380), yes ... with "Wales" as in "New South Wales" being the culprit this time ... that together with an erroneous materials citation split between "N.S." and "Wales" (in what is supposed to read "N. S. Wales") elevates "Wales" to a standalone country level, which ISO normalizes to the UK. Geographical homonyms (if infix based ones in this instance) ... we're doing a lot more to sort these out than we did back in 2016, when this article was processed, but these legacy error remain. |
That said, maybe we should add a "problematic/ambiguous country name" check in the QC? |
Hi @gsautter , I think an additional QC rule to cover this might be very interesting. Better false positives than data issues, in my opinion. And let us cover this, then. I'll query TB Stats and divide the task among people. Sooner than later we will have this covered. What do you think? |
Let's get the stats and analyze this first ... maybe we can devise some automated filter to deal with the lion's share of the obvious cases ... no need to have the office do more tedious and repetitive work than necessary ... |
Found 175 articles that might contain the Wales/New South Wales issue... but you might be able to filter them a bit further using your own API. Please, tell me what you think. Thanks! |
Well, I don't really have an "own API" or anything ... just using the stats line anyone else ... but sure, there might be ways of filtering further, if somewhat downstream using the whole treatments, e.g. looking for other UK constituents like "Scotland" and "England", or for the presence or absence of Australia or other Australian territories ... we'll see, the data will tell. |
Hi, I am trying to sort a bit some of the issues on the GBIF feedback repository and I thought I would forward you the following issues:
Let me know if you need anything!
The text was updated successfully, but these errors were encountered: