Skip to content

RiverBench/dataset-yago-annotated-facts

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

24 Commits
 
 
 
 
 
 
 
 

Repository files navigation

.github/workflows/release.yaml

Dataset: yago-annotated-facts (development version)

This is a subset of the YAGO 4 knowledge base (paper), based on Wikidata, version from February 24, 2020. This dataset includes only the fact annotations in RDF-star, that is facts about facts. Each stream element corresponds to one item in Wikidata.

This README is a snapshot of documentation for the latest development version of the dataset. Full documentation for all versions can be found on the website.

General information

Technical metadata

  • Has stream type usage:
    • RDF stream type usage (1)
    • RDF stream type usage (2)
      • Type: RDF stream type usage (stax:RdfStreamTypeUsage)
      • Comment: The dataset can be viewed as a stream of graphs. Each graph corresponds to the RDF-star annotations of one Wikidata item. (en)
      • Has stream type: RDF subject graph stream (stax:subjectGraphStream)
  • Has stream element count: 617,768
  • Has stream element split:
    • Type: Stream elements split by topic (rb:TopicStreamElementSplit)
    • Comment: Every stream element corresponds to one Wikidata item. (en)
    • Has subject shape:
      • Comment: Custom target – subject of any quoted triple in the subject position. (en)
      • Target custom: YAGO annotated facts target (rb:yagoTarget)
  • Uses vocabulary: http://schema.org/
  • Conforms to W3C RDF 1.1 specification: no
  • Conforms to W3C RDF-star draft specification as of December 17, 2021: yes
  • Uses generalized triples: no
  • Uses generalized RDF datasets: no
  • Uses RDF-star: yes

Distributions

Full stream distribution

Full Jelly distribution

Full flat distribution

100K elements stream distribution

100K elements Jelly distribution

100K elements flat distribution

10K elements stream distribution

10K elements Jelly distribution

10K elements flat distribution