Skip to content

Release v1.0

Compare
Choose a tag to compare
@mromanello mromanello released this 31 Mar 06:56
· 48 commits to master since this release

The data consists of historical newspaper articles in French, German and
American English originating from Swiss, Luxembourgish and American digitized
newspaper archives and selected on a diachronic basis. The time span of the
whole corpus goes from 1798 until 2018.

Data release v1.0 contains training and dev sets for
French and German, and dev set for English (in total ca 24k mentions and linked entities).