Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Batch Label Data] Add more label data for Database technical area labled on dbdb.io and DB-Engines Ranking by Sep 26, 2023. #1393

Closed
birdflyi opened this issue Sep 26, 2023 · 3 comments · Fixed by #1394
Assignees
Labels
waiting for repliers need other's feedback

Comments

@birdflyi
Copy link
Contributor

Description

I want to add some labeled data into OpenDigger to help us for our community analysis.
The data is based on a dataset fused by data from dbdb.io and DB-Engines by Sep 26, 2023. It is an incremental version of labeled data submited in #1376, which is based on data by Aug 26, 2023.

Filter conditions: Collected by dbdb.io on Sep 26, 2023 OR Rankings in the DB-Engines Rankings table on Sep 26, 2023; Has open source license; Has repository link on GitHub.

Notes: The DBMS labeled dataset will keep updating incrementally at birdflyi/db_feature_data_fusion each month. The list below is auto-generated by wiget_autogen_issue_body_for_opendigger_submiting_labeled_data_issue.

Label: Array

Type: Tech-1

Repos:

  • epsilla-cloud/vectordb

Label: Document

Type: Tech-1

Repos:

  • awa-ai/awadb
  • endatabas/endb
  • jdagdelen/hyperDB
  • jina-ai/vectordb
  • kagisearch/vectordb
  • marqo-ai/marqo
  • resilientdb/resilientdb
  • vearch/vearch
  • vector5ai/vector5db

Label: Object Oriented

Type: Tech-1

Repos:

  • ADBSQL/AntDB
  • Bobris/BTDB
  • CUBRID/cubrid
  • CondensationDB/Condensation-java
  • DevrexLabs/OrigoDB
  • HydrasDB/hydra
  • ModeShape/modeshape
  • SapphireDb/SapphireDb
  • The-Alchemist/perst
  • VeloxDB/VeloxDB
  • apache/jackrabbit
  • atoti/atoti
  • authzed/spicedb
  • devrexlabs/memstate
  • edgedb/edgedb
  • etoile/CoreObject
  • fern4lvarez/piladb
  • gaia-platform/GaiaPlatform
  • iboxdb/db4o-gpl
  • jankotek/mapdb
  • kimchy/compass
  • markmeeus/MarcelloDB
  • morecraf/Siaqodb
  • neondatabase/neon
  • objectbox/objectbox-java
  • orientechnologies/orientdb
  • orioledb/orioledb
  • pilgr/Paper
  • pipelinedb/pipelinedb
  • postgres/postgres
  • realm/realm-core
  • tzaeschke/zoodb
  • zhihu/Matisse
  • zopefoundation/ZODB

Label: Relational

Type: Tech-1

Repos:

  • endatabas/endb
  • erikgrinaker/toydb
  • proullon/ramsql

Label: Search Engine

Type: Tech-1

Repos:

  • apache/solr
  • elastic/elasticsearch
  • manticoresoftware/manticoresearch
  • marqo-ai/marqo
  • meilisearch/meilisearch
  • opensearch-project/OpenSearch
  • sphinxsearch/sphinx
  • typesense/typesense
  • vespa-engine/vespa
  • xapian/xapian

Label: Vector

Type: Tech-1

Repos:

  • awa-ai/awadb
  • featureform/embeddinghub
  • jdagdelen/hyperDB
  • jina-ai/vectordb
  • kagisearch/vectordb
  • marekgalovic/anndb
  • marqo-ai/marqo
  • nuclia/nucliadb
  • pilosa/pilosa
  • vearch/vearch
  • vector5ai/vector5db
@birdflyi
Copy link
Contributor Author

/parse-github-id

@github-actions github-actions bot added the waiting for repliers need other's feedback label Sep 26, 2023
@github-actions
Copy link

Get repo and org/user ids done.

"### Description\n\nI want to add some labeled data into OpenDigger to help us for our community analysis.
The data is based on a dataset fused by data from dbdb.io and DB-Engines by Sep 26, 2023. It is an incremental version of labeled data submited in #1376, which is based on data by Aug 26, 2023.

Filter conditions: Collected by dbdb.io on Sep 26, 2023 OR Rankings in the DB-Engines Rankings table on Sep 26, 2023; Has open source license; Has repository link on GitHub.

Notes: The DBMS labeled dataset will keep updating incrementally at birdflyi/db_feature_data_fusion each month. The list below is auto-generated by wiget_autogen_issue_body_for_opendigger_submiting_labeled_data_issue.

Label: Array

Type: Tech-1

Repos:

- 664133375 # repo:epsilla-cloud/vectordb

Label: Document

Type: Tech-1

Repos:

- 642912355 # repo:awa-ai/awadb
- 587246478 # repo:endatabas/endb
- 628503457 # repo:jdagdelen/hyperDB
- 635374751 # repo:jina-ai/vectordb
- 632269609 # repo:kagisearch/vectordb
- 520096046 # repo:marqo-ai/marqo
- 223462217 # repo:resilientdb/resilientdb
- 186332888 # repo:vearch/vearch
- 634656527 # repo:vector5ai/vector5db

Label: Object Oriented

Type: Tech-1

Repos:

- ADBSQL/AntDB # not found
- 681142 # repo:Bobris/BTDB
- 52080367 # repo:CUBRID/cubrid
- 329729926 # repo:CondensationDB/Condensation-java
- 8799170 # repo:DevrexLabs/OrigoDB
- 516821813 # repo:HydrasDB/hydra
- 1244027 # repo:ModeShape/modeshape
- 150752008 # repo:SapphireDb/SapphireDb
- 124423054 # repo:The-Alchemist/perst
- 569871553 # repo:VeloxDB/VeloxDB
- 206403 # repo:apache/jackrabbit
- 246343828 # repo:atoti/atoti
- 396856161 # repo:authzed/spicedb
- 95070401 # repo:devrexlabs/memstate
- 95817032 # repo:edgedb/edgedb
- 15001136 # repo:etoile/CoreObject
- 42143916 # repo:fern4lvarez/piladb
- 240387847 # repo:gaia-platform/GaiaPlatform
- 204261394 # repo:iboxdb/db4o-gpl
- 5453989 # repo:jankotek/mapdb
- 1776883 # repo:kimchy/compass
- 25225465 # repo:markmeeus/MarcelloDB
- 273564373 # repo:morecraf/Siaqodb
- 351806852 # repo:neondatabase/neon
- 79901405 # repo:objectbox/objectbox-java
- 7083240 # repo:orientechnologies/orientdb
- 432844875 # repo:orioledb/orioledb
- 37285717 # repo:pilgr/Paper
- 14702444 # repo:pipelinedb/pipelinedb
- 927442 # repo:postgres/postgres
- 1917262 # repo:realm/realm-core
- 3893984 # repo:tzaeschke/zoodb
- 88111990 # repo:zhihu/Matisse
- 7357595 # repo:zopefoundation/ZODB

Label: Relational

Type: Tech-1

Repos:

- 587246478 # repo:endatabas/endb
- 183929744 # repo:erikgrinaker/toydb
- 26774602 # repo:proullon/ramsql

Label: Search Engine

Type: Tech-1

Repos:

- 341374920 # repo:apache/solr
- 507775 # repo:elastic/elasticsearch
- 95614931 # repo:manticoresoftware/manticoresearch
- 520096046 # repo:marqo-ai/marqo
- 130688011 # repo:meilisearch/meilisearch
- 334274271 # repo:opensearch-project/OpenSearch
- 36992044 # repo:sphinxsearch/sphinx
- 79317191 # repo:typesense/typesense
- 60377070 # repo:vespa-engine/vespa
- 735981 # repo:xapian/xapian

Label: Vector

Type: Tech-1

Repos:

- 642912355 # repo:awa-ai/awadb
- 304530333 # repo:featureform/embeddinghub
- 628503457 # repo:jdagdelen/hyperDB
- 635374751 # repo:jina-ai/vectordb
- 632269609 # repo:kagisearch/vectordb
- 242383787 # repo:marekgalovic/anndb
- 520096046 # repo:marqo-ai/marqo
- 478288303 # repo:nuclia/nucliadb
- 40127179 # repo:pilosa/pilosa
- 186332888 # repo:vearch/vearch
- 634656527 # repo:vector5ai/vector5db

"

@birdflyi
Copy link
Contributor Author

/self-assign

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
waiting for repliers need other's feedback
Projects
None yet
1 participant