TMU-GFM-Dataset

A dataset for GEC metrics with manual evaluations of grammaticality, fluency, and meaning preservation for system outputs.
More detail about the creation of the dataset can be found in Yoshimura et al. (2020).

File format

The are 9 columns in the tmu-gfm-dataset.

source: source sentence.
output: system output sentence.
grammer: Grammaticaliry annotations by 5 annotators.
fluency: Fluency annotations by 5 annotators.
meaning: Meaning Preservation annotations by 5 annotators.
system: Which system the output sentence is from.
ave_g: Average grammer score.
ave_f: Average fluency score.
ave_m: Average meaning score.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md
tmu-gfm-dataset.csv		tmu-gfm-dataset.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TMU-GFM-Dataset

File format

About

Releases

Packages

tmu-nlp/TMU-GFM-Dataset

Folders and files

Latest commit

History

Repository files navigation

TMU-GFM-Dataset

File format

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages