Skip to content

tmu-nlp/TMU-GFM-Dataset

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 

Repository files navigation

TMU-GFM-Dataset

A dataset for GEC metrics with manual evaluations of grammaticality, fluency, and meaning preservation for system outputs.
More detail about the creation of the dataset can be found in Yoshimura et al. (2020).

File format

The are 9 columns in the tmu-gfm-dataset.

  1. source: source sentence.
  2. output: system output sentence.
  3. grammer: Grammaticaliry annotations by 5 annotators.
  4. fluency: Fluency annotations by 5 annotators.
  5. meaning: Meaning Preservation annotations by 5 annotators.
  6. system: Which system the output sentence is from.
  7. ave_g: Average grammer score.
  8. ave_f: Average fluency score.
  9. ave_m: Average meaning score.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published