Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Batch Label Data] Add more label data for Database technical area labled on dbdb.io and DB-Engines Ranking up to December 30, 2024. #1664

Closed
birdflyi opened this issue Dec 31, 2024 · 2 comments · Fixed by #1665
Labels
waiting for repliers need other's feedback

Comments

@birdflyi
Copy link
Contributor

Description

I want to add some labeled data into OpenDigger to help us for our community analysis.
The data is based on a dataset fused by data from dbdb.io and DB-Engines by December 30, 2024. It is an incremental version of labeled data submited in #1653, which is based on data by November 29, 2024.

Filter conditions: Collected by dbdb.io on December 30, 2024 OR Rankings in the DB-Engines Rankings table on December 30, 2024; Has open source license; Has repository link on GitHub.

Notes: The DBMS labeled dataset will keep updating incrementally at birdflyi/db_feature_data_fusion each month. The list below is auto-generated by wiget_autogen_issue_body_for_opendigger_submiting_labeled_data_issue.

Label: Document

Type: Tech-1

Repos:

  • Lona-Development/Server
  • PoloDB/PoloDB
  • capjamesg/jamesql

Label: Relational

Type: Tech-1

Repos:

  • darshan117/BroDB
  • greenplum-db/gpdb-archive

Label: Wide column

Type: Tech-1

Repos:

  • tidesdb/tidesdb
@github-actions github-actions bot added the waiting for repliers need other's feedback label Dec 31, 2024
@birdflyi
Copy link
Contributor Author

/parse-github-id

Copy link

Get repo and org/user ids done.

"### Description\n\nI want to add some labeled data into OpenDigger to help us for our community analysis.
The data is based on a dataset fused by data from dbdb.io and DB-Engines by December 30, 2024. It is an incremental version of labeled data submited in #1653, which is based on data by November 29, 2024.

Filter conditions: Collected by dbdb.io on December 30, 2024 OR Rankings in the DB-Engines Rankings table on December 30, 2024; Has open source license; Has repository link on GitHub.

Notes: The DBMS labeled dataset will keep updating incrementally at birdflyi/db_feature_data_fusion each month. The list below is auto-generated by wiget_autogen_issue_body_for_opendigger_submiting_labeled_data_issue.

Label: Document

Type: Tech-1

Repos:

- 750864017 # repo:Lona-Development/Server
- 284225188 # repo:PoloDB/PoloDB
- 844591313 # repo:capjamesg/jamesql

Label: Relational

Type: Tech-1

Repos:

- 824462634 # repo:darshan117/BroDB
- 805041952 # repo:greenplum-db/gpdb-archive

Label: Wide column

Type: Tech-1

Repos:

- 872645739 # repo:tidesdb/tidesdb

"

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
waiting for repliers need other's feedback
Projects
None yet
1 participant