RuTaBERT is a model solving the problem of Column Type Annotation with pre-trained large language model (BERT), trained on the Russian corpus.
-
Updated
Dec 24, 2024 - Python
RuTaBERT is a model solving the problem of Column Type Annotation with pre-trained large language model (BERT), trained on the Russian corpus.
CoLeM framework is a table model based on contrastive learning techniques for solving the problem of Column Type Annotation.
Dataset from the authors of RuTaBERT and is based on the Russian Web Tables. Only relational tables were chosen from Russian Web Tables with headers matching their selected 170 DBpedia semantic types.
Add a description, image, and links to the column-type-annotation topic page so that developers can more easily learn about it.
To associate your repository with the column-type-annotation topic, visit your repo's landing page and select "manage topics."