Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Adjust handling of tagged datasets? #5582

Open
gbif-portal opened this issue Nov 20, 2024 · 10 comments
Open

Adjust handling of tagged datasets? #5582

gbif-portal opened this issue Nov 20, 2024 · 10 comments
Assignees

Comments

@gbif-portal
Copy link
Collaborator

Adjust handling of tagged datasets?

Knowing that it could be done, I've tagged a set of five business-sector datasets (see "Base de datos programa de rescate ECA-Liquifaction correspondiente a México (FOO)". There's no way to tag the publisher, because they come invia CONABIO, but they are from an LNG facility in Baja California. (In fact, they may be the first datasets to come in at the recommendation of the Equator Principles, a consortium of private banks that the project financier recently left.)

These datasets belong here, but they appear as individual entries alongside all the publishers. I'm sure this works as designed, but I wanted to ask: is there any way for us to consolidate these? Perhaps by adding a "Company" value to the private sector tag?

Just wanted to ask, in case other examples like this come to light.


Github user: @kcopas
User: See in registry - Send email
System: Firefox 132.0.0 / Mac OS X 10.15.0
Referer: https://www.gbif.org/composition/1XtRfS0nTKs8HtRd18Q7ai/businesses-sharing-biodiversity-data-via-gbif
Window size: width 1688 - height 898
API log
Site log
System health at time of feedback: OPERATIONAL

@kcopas
Copy link
Member

kcopas commented Nov 20, 2024

btw, the version of the table on GBIF.org is amazing—near real-time!

@MortenHofft
Copy link
Member

If we added a new tag like privateSector.gbif.org:company:my-company-name then we could show that information as well.
But would it be a column (and empty when not filled?). Repeat the org name when not filled?
The datasets would still be repeated, just labeled as the same company?

Would it be a better and feasible option to create a publisher, and then have CONABIO host it instead?

@kcopas
Copy link
Member

kcopas commented Nov 21, 2024

Would it be a better and feasible option to create a publisher, and then have CONABIO host it instead?

That was actually what I was hoping would come to pass. The group responsible for the datasets actually is registered as a publisher.

CONABIO remains the listed publisher of nearly all data from Mexican institutions (1,032 / 1,086 datasets).

The intent of any tweak would be to aggregate the five standalone datasets that now show in the table as a single line listed under the company name (ECA Liquefaction S. de R.L. de C.V.).

I wanted to ask about feasibility here first. Let me revert to las Patricias at CONABIO to ask about their willingness to shift the datasets to the publisher since that is already in place.

@CecSve
Copy link

CecSve commented Nov 21, 2024

How is the activity sector defined, please?

@kcopas
Copy link
Member

kcopas commented Nov 21, 2024

It's based on GICS: the Global Industrial Standard Classification.

@CecSve
Copy link

CecSve commented Nov 21, 2024

It's based on GICS: the Global Industrial Standard Classification.

And is it self-declared by the publisher somehow or do we add it?

@kcopas
Copy link
Member

kcopas commented Nov 21, 2024

To date, we add it as a machine tag in the registry.

@ManonGros
Copy link

If we are going to have the data tagged at the dataset level, wouldn't a network be more appropriate? @ahahn-gbif

@ahahn-gbif
Copy link

I would favor the option of CONABIO acting as host for the (already registered) company publisher, if CONABIO can be convinced.

(...) data tagged at the dataset level, wouldn't a network (...)

So far we tag the business sector contributions at publisher level, since we understand that this is a property of a publisher, and would apply to all their datasets. Tagging at network level (individual datasets) could be automated to a degree, but is an unnecessary overhead if the other solution could work. If we do not have to go this route, I would not.

@kcopas kcopas self-assigned this Nov 25, 2024
@kcopas
Copy link
Member

kcopas commented Nov 30, 2024

Five datasets transferred to ECA Liquefaction as the publisher. Tags removed from datasets and added to publisher, and table has auto-updated (https://gbif.link/business-data-publishers).

Nearly ready to close this (thank you!), but one small detail remains, as the new publisher is not showing any of the data citations attributed to the datasets. There are only two and three thus far, but perhaps those need some manual updating to reallocate them from CONABIO? Just a thought, you'll know better than I…

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

6 participants