This dataset contains CF-labelled formulaic expressions (FEs) extracted from scholarly papers of four disciplines. The FEs were extracted with our proposed method.
The detail of this dataset was written in the following paper:
Iwatsuki, K., & Aizawa, A. (2021). Extraction of Formulaic Expressions from Scientific Papers. In Proceedings of the AAAI-21 Workshop on Scientific Document Understanding.
@inproceedings{Iwatsuki2021SDU,
author="Kenichi Iwatsuki and Akiko Aizawa",
title="Extraction of Formulaic Expressions from Scientific Papers",
booktitle="Proceedings of the AAAI-21 Workshop on Scientific Document Understanding",
year=2021,
}
The paper is available on CEUR-WS (http://ceur-ws.org/Vol-2831/paper6.pdf).
This dataset is licensed under CC BY-NC 4.0. It should be noted that the corpora we used were licensed under some licences that allowed us to use the whole texts of the papers.