diff --git a/README.md b/README.md index 05d1c73..d56a644 100644 --- a/README.md +++ b/README.md @@ -43,16 +43,27 @@ Full documentation for all versions can be found [on the website](https://w3id.o - [https://doi.org/10.1007%2F978-3-030-49461-2_34](https://doi.org/10.1007%2F978-3-030-49461-2_34) - [https://yago-knowledge.org/downloads/yago-4](https://yago-knowledge.org/downloads/yago-4) - **Date Issued**: 2023-04-30 -- **Date Modified**: 2023-09-21 +- **Date Modified**: 2023-10-30 - **Landing page**: [yago-annotated-facts (dev)](https://w3id.org/riverbench/datasets/yago-annotated-facts/dev) - **Conforms To**: Metadata ([https://w3id.org/riverbench/schema/metadata](https://w3id.org/riverbench/schema/metadata)) ## Technical metadata -- **Has stream element type**: Triples ([rb:triples](https://w3id.org/riverbench/schema/metadata#triples)) +- **Has stream type usage**: + - **RDF stream type usage (1)** + - **Type**: RDF stream type usage ([stax:RdfStreamTypeUsage](https://w3id.org/stax/ontology#RdfStreamTypeUsage)) + - **Comment**: The dataset can be viewed as a flattened stream of triples. + - **Has stream type**: Flat RDF triple stream ([stax:flatTripleStream](https://w3id.org/stax/ontology#flatTripleStream)) + - **RDF stream type usage (2)** + - **Type**: RDF stream type usage ([stax:RdfStreamTypeUsage](https://w3id.org/stax/ontology#RdfStreamTypeUsage)) + - **Comment**: The dataset can be viewed as a stream of graphs. Each graph corresponds to the RDF-star annotations of one Wikidata item. + - **Has stream type**: RDF subject graph stream ([stax:subjectGraphStream](https://w3id.org/stax/ontology#subjectGraphStream)) - **Has stream element count**: 617,768 - **Has stream element split**: - **Type**: Stream elements split by topic ([rb:TopicStreamElementSplit](https://w3id.org/riverbench/schema/metadata#TopicStreamElementSplit)) + - **Has subject shape**: + - **Comment**: Custom target – subject of any quoted triple in the subject position. + - **Target custom**: YAGO annotated facts target ([rb:yagoTarget](https://w3id.org/riverbench/schema/metadata#yagoTarget)) - **Comment**: Every stream element corresponds to one Wikidata item. - **Uses ontology**: [http://schema.org/](http://schema.org/) - **Conforms to W3C RDF 1.1 specification**: no @@ -63,30 +74,42 @@ Full documentation for all versions can be found [on the website](https://w3id.o ## Distributions -### Full triple stream distribution +### Full stream distribution -- **Title**: Full triple stream distribution +- **Title**: Full stream distribution - **Identifier**: `stream-full` - **Has file name**: `stream_full.tar.gz` +- **Has stream type usage**: + - **Type**: RDF stream type usage ([stax:RdfStreamTypeUsage](https://w3id.org/stax/ontology#RdfStreamTypeUsage)) + - **Comment**: The dataset can be viewed as a stream of graphs. Each graph corresponds to the RDF-star annotations of one Wikidata item. + - **Has stream type**: RDF subject graph stream ([stax:subjectGraphStream](https://w3id.org/stax/ontology#subjectGraphStream)) - **Has distribution type**: - Full distribution ([rb:fullDistribution](https://w3id.org/riverbench/schema/metadata#fullDistribution)) - - Triple stream distribution ([rb:tripleStreamDistribution](https://w3id.org/riverbench/schema/metadata#tripleStreamDistribution)) + - Stream distribution ([rb:streamDistribution](https://w3id.org/riverbench/schema/metadata#streamDistribution)) - **Has stream element count**: 617,768 -- **Byte size**: 36.15 MB +- **Byte size**: 36.16 MB - **Media type**: text/turtle - **Packaging format**: application/tar - **Compression format**: application/gzip - **Download URL**: [https://w3id.org/riverbench/datasets/yago-annotated-facts/dev/files/stream_full.tar.gz](https://w3id.org/riverbench/datasets/yago-annotated-facts/dev/files/stream_full.tar.gz) -### Full triple stream distribution (Jelly) +### Full Jelly distribution -- **Title**: Full triple stream distribution (Jelly) +- **Title**: Full Jelly distribution - **Identifier**: `jelly-full` - **Has file name**: `jelly_full.jelly.gz` +- **Has stream type usage**: + - **RDF stream type usage (1)** + - **Type**: RDF stream type usage ([stax:RdfStreamTypeUsage](https://w3id.org/stax/ontology#RdfStreamTypeUsage)) + - **Comment**: The dataset can be viewed as a flattened stream of triples. + - **Has stream type**: Flat RDF triple stream ([stax:flatTripleStream](https://w3id.org/stax/ontology#flatTripleStream)) + - **RDF stream type usage (2)** + - **Type**: RDF stream type usage ([stax:RdfStreamTypeUsage](https://w3id.org/stax/ontology#RdfStreamTypeUsage)) + - **Comment**: The dataset can be viewed as a stream of graphs. Each graph corresponds to the RDF-star annotations of one Wikidata item. + - **Has stream type**: RDF subject graph stream ([stax:subjectGraphStream](https://w3id.org/stax/ontology#subjectGraphStream)) - **Has distribution type**: - Full distribution ([rb:fullDistribution](https://w3id.org/riverbench/schema/metadata#fullDistribution)) - Jelly distribution ([rb:jellyDistribution](https://w3id.org/riverbench/schema/metadata#jellyDistribution)) - - Triple stream distribution ([rb:tripleStreamDistribution](https://w3id.org/riverbench/schema/metadata#tripleStreamDistribution)) - **Has stream element count**: 617,768 - **Byte size**: 29.91 MB - **Media type**: application/x-jelly-rdf @@ -98,6 +121,10 @@ Full documentation for all versions can be found [on the website](https://w3id.o - **Title**: Full flat distribution - **Identifier**: `flat-full` - **Has file name**: `flat_full.nt.gz` +- **Has stream type usage**: + - **Type**: RDF stream type usage ([stax:RdfStreamTypeUsage](https://w3id.org/stax/ontology#RdfStreamTypeUsage)) + - **Comment**: The dataset can be viewed as a flattened stream of triples. + - **Has stream type**: Flat RDF triple stream ([stax:flatTripleStream](https://w3id.org/stax/ontology#flatTripleStream)) - **Has distribution type**: - Flat distribution ([rb:flatDistribution](https://w3id.org/riverbench/schema/metadata#flatDistribution)) - Full distribution ([rb:fullDistribution](https://w3id.org/riverbench/schema/metadata#fullDistribution)) @@ -107,14 +134,18 @@ Full documentation for all versions can be found [on the website](https://w3id.o - **Compression format**: application/gzip - **Download URL**: [https://w3id.org/riverbench/datasets/yago-annotated-facts/dev/files/flat_full.nt.gz](https://w3id.org/riverbench/datasets/yago-annotated-facts/dev/files/flat_full.nt.gz) -### 100K elements triple stream distribution +### 100K elements stream distribution -- **Title**: 100K elements triple stream distribution +- **Title**: 100K elements stream distribution - **Identifier**: `stream-100k` - **Has file name**: `stream_100K.tar.gz` +- **Has stream type usage**: + - **Type**: RDF stream type usage ([stax:RdfStreamTypeUsage](https://w3id.org/stax/ontology#RdfStreamTypeUsage)) + - **Comment**: The dataset can be viewed as a stream of graphs. Each graph corresponds to the RDF-star annotations of one Wikidata item. + - **Has stream type**: RDF subject graph stream ([stax:subjectGraphStream](https://w3id.org/stax/ontology#subjectGraphStream)) - **Has distribution type**: - Partial distribution ([rb:partialDistribution](https://w3id.org/riverbench/schema/metadata#partialDistribution)) - - Triple stream distribution ([rb:tripleStreamDistribution](https://w3id.org/riverbench/schema/metadata#tripleStreamDistribution)) + - Stream distribution ([rb:streamDistribution](https://w3id.org/riverbench/schema/metadata#streamDistribution)) - **Has stream element count**: 100,000 - **Byte size**: 3.57 MB - **Media type**: text/turtle @@ -122,15 +153,23 @@ Full documentation for all versions can be found [on the website](https://w3id.o - **Compression format**: application/gzip - **Download URL**: [https://w3id.org/riverbench/datasets/yago-annotated-facts/dev/files/stream_100K.tar.gz](https://w3id.org/riverbench/datasets/yago-annotated-facts/dev/files/stream_100K.tar.gz) -### 100K elements triple stream distribution (Jelly) +### 100K elements Jelly distribution -- **Title**: 100K elements triple stream distribution (Jelly) +- **Title**: 100K elements Jelly distribution - **Identifier**: `jelly-100k` - **Has file name**: `jelly_100K.jelly.gz` +- **Has stream type usage**: + - **RDF stream type usage (1)** + - **Type**: RDF stream type usage ([stax:RdfStreamTypeUsage](https://w3id.org/stax/ontology#RdfStreamTypeUsage)) + - **Comment**: The dataset can be viewed as a stream of graphs. Each graph corresponds to the RDF-star annotations of one Wikidata item. + - **Has stream type**: RDF subject graph stream ([stax:subjectGraphStream](https://w3id.org/stax/ontology#subjectGraphStream)) + - **RDF stream type usage (2)** + - **Type**: RDF stream type usage ([stax:RdfStreamTypeUsage](https://w3id.org/stax/ontology#RdfStreamTypeUsage)) + - **Comment**: The dataset can be viewed as a flattened stream of triples. + - **Has stream type**: Flat RDF triple stream ([stax:flatTripleStream](https://w3id.org/stax/ontology#flatTripleStream)) - **Has distribution type**: - Jelly distribution ([rb:jellyDistribution](https://w3id.org/riverbench/schema/metadata#jellyDistribution)) - Partial distribution ([rb:partialDistribution](https://w3id.org/riverbench/schema/metadata#partialDistribution)) - - Triple stream distribution ([rb:tripleStreamDistribution](https://w3id.org/riverbench/schema/metadata#tripleStreamDistribution)) - **Has stream element count**: 100,000 - **Byte size**: 2.98 MB - **Media type**: application/x-jelly-rdf @@ -142,6 +181,10 @@ Full documentation for all versions can be found [on the website](https://w3id.o - **Title**: 100K elements flat distribution - **Identifier**: `flat-100k` - **Has file name**: `flat_100K.nt.gz` +- **Has stream type usage**: + - **Type**: RDF stream type usage ([stax:RdfStreamTypeUsage](https://w3id.org/stax/ontology#RdfStreamTypeUsage)) + - **Comment**: The dataset can be viewed as a flattened stream of triples. + - **Has stream type**: Flat RDF triple stream ([stax:flatTripleStream](https://w3id.org/stax/ontology#flatTripleStream)) - **Has distribution type**: - Flat distribution ([rb:flatDistribution](https://w3id.org/riverbench/schema/metadata#flatDistribution)) - Partial distribution ([rb:partialDistribution](https://w3id.org/riverbench/schema/metadata#partialDistribution)) @@ -151,30 +194,42 @@ Full documentation for all versions can be found [on the website](https://w3id.o - **Compression format**: application/gzip - **Download URL**: [https://w3id.org/riverbench/datasets/yago-annotated-facts/dev/files/flat_100K.nt.gz](https://w3id.org/riverbench/datasets/yago-annotated-facts/dev/files/flat_100K.nt.gz) -### 10K elements triple stream distribution +### 10K elements stream distribution -- **Title**: 10K elements triple stream distribution +- **Title**: 10K elements stream distribution - **Identifier**: `stream-10k` - **Has file name**: `stream_10K.tar.gz` +- **Has stream type usage**: + - **Type**: RDF stream type usage ([stax:RdfStreamTypeUsage](https://w3id.org/stax/ontology#RdfStreamTypeUsage)) + - **Comment**: The dataset can be viewed as a stream of graphs. Each graph corresponds to the RDF-star annotations of one Wikidata item. + - **Has stream type**: RDF subject graph stream ([stax:subjectGraphStream](https://w3id.org/stax/ontology#subjectGraphStream)) - **Has distribution type**: - Partial distribution ([rb:partialDistribution](https://w3id.org/riverbench/schema/metadata#partialDistribution)) - - Triple stream distribution ([rb:tripleStreamDistribution](https://w3id.org/riverbench/schema/metadata#tripleStreamDistribution)) + - Stream distribution ([rb:streamDistribution](https://w3id.org/riverbench/schema/metadata#streamDistribution)) - **Has stream element count**: 10,000 -- **Byte size**: 376.48 KB +- **Byte size**: 376.50 KB - **Media type**: text/turtle - **Packaging format**: application/tar - **Compression format**: application/gzip - **Download URL**: [https://w3id.org/riverbench/datasets/yago-annotated-facts/dev/files/stream_10K.tar.gz](https://w3id.org/riverbench/datasets/yago-annotated-facts/dev/files/stream_10K.tar.gz) -### 10K elements triple stream distribution (Jelly) +### 10K elements Jelly distribution -- **Title**: 10K elements triple stream distribution (Jelly) +- **Title**: 10K elements Jelly distribution - **Identifier**: `jelly-10k` - **Has file name**: `jelly_10K.jelly.gz` +- **Has stream type usage**: + - **RDF stream type usage (1)** + - **Type**: RDF stream type usage ([stax:RdfStreamTypeUsage](https://w3id.org/stax/ontology#RdfStreamTypeUsage)) + - **Comment**: The dataset can be viewed as a stream of graphs. Each graph corresponds to the RDF-star annotations of one Wikidata item. + - **Has stream type**: RDF subject graph stream ([stax:subjectGraphStream](https://w3id.org/stax/ontology#subjectGraphStream)) + - **RDF stream type usage (2)** + - **Type**: RDF stream type usage ([stax:RdfStreamTypeUsage](https://w3id.org/stax/ontology#RdfStreamTypeUsage)) + - **Comment**: The dataset can be viewed as a flattened stream of triples. + - **Has stream type**: Flat RDF triple stream ([stax:flatTripleStream](https://w3id.org/stax/ontology#flatTripleStream)) - **Has distribution type**: - Jelly distribution ([rb:jellyDistribution](https://w3id.org/riverbench/schema/metadata#jellyDistribution)) - Partial distribution ([rb:partialDistribution](https://w3id.org/riverbench/schema/metadata#partialDistribution)) - - Triple stream distribution ([rb:tripleStreamDistribution](https://w3id.org/riverbench/schema/metadata#tripleStreamDistribution)) - **Has stream element count**: 10,000 - **Byte size**: 301.51 KB - **Media type**: application/x-jelly-rdf @@ -186,6 +241,10 @@ Full documentation for all versions can be found [on the website](https://w3id.o - **Title**: 10K elements flat distribution - **Identifier**: `flat-10k` - **Has file name**: `flat_10K.nt.gz` +- **Has stream type usage**: + - **Type**: RDF stream type usage ([stax:RdfStreamTypeUsage](https://w3id.org/stax/ontology#RdfStreamTypeUsage)) + - **Comment**: The dataset can be viewed as a flattened stream of triples. + - **Has stream type**: Flat RDF triple stream ([stax:flatTripleStream](https://w3id.org/stax/ontology#flatTripleStream)) - **Has distribution type**: - Flat distribution ([rb:flatDistribution](https://w3id.org/riverbench/schema/metadata#flatDistribution)) - Partial distribution ([rb:partialDistribution](https://w3id.org/riverbench/schema/metadata#partialDistribution))