Skip to content

Commit

Permalink
updated data/README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
Matteo Romanello committed May 25, 2020
1 parent 2baec48 commit 5ecfc21
Showing 1 changed file with 1 addition and 0 deletions.
1 change: 1 addition & 0 deletions data/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -3,6 +3,7 @@
This folder contains the data releases relative to the [CLEF-HIPE shared task](https://impresso.github.io/CLEF-HIPE-2020/) on NERC and EL on historical newspapers. Please note that these datasets are not yet in their final versions but will evolve until end of spring 2020 approximately.


- **test-masked-v1.2** (released 25.05.2020): masked test dataset for evaluation of system runs for task bundles 1-4.
- **training-v1.2** (released 12.10.2020): fourth version of training and dev datasets for HIP. Main changes are: additional data for French and German.
- **training-v1.1** (released 7.04.2020): this release fixes a problem in the German validation set (see issue [#5](https://github.com/impresso/CLEF-HIPE-2020/issues/5)), as well as with escaping double quotes in the TSV exports.
- **training-v1.0** (released 26.03.2020): second version of training and dev datasets for German and French, and of dev dataset for English (there won't be training data for English). Main changes are quantitative (more documents and therefore more mentions and linked entities). Foreseen release v1.1 beginning of april with increased quality.
Expand Down

0 comments on commit 5ecfc21

Please sign in to comment.