Releases: impresso/CLEF-HIPE-2020
Releases · impresso/CLEF-HIPE-2020
Release of (masked) test data v1.3
Masked test data for task bundle 5
Release of (masked) test data v1.2
test-masked-v1.2 updated data/README.md
Release v1.2
Correction and additional data for French and German.
Release v1.1
Data release v1.1 fixes a problem in German validation set (issue #5), as well as with escaping of double quotes in the TSV export.
Release v1.0
The data consists of historical newspaper articles in French, German and
American English originating from Swiss, Luxembourgish and American digitized
newspaper archives and selected on a diachronic basis. The time span of the
whole corpus goes from 1798 until 2018.
Data release v1.0 contains training and dev sets for
French and German, and dev set for English (in total ca 24k mentions and linked entities).
Release of sample data (2020-01-10)
adding de sample data