Skip to content

Latest commit

 

History

History
17 lines (13 loc) · 635 Bytes

README.md

File metadata and controls

17 lines (13 loc) · 635 Bytes

A Dependency Treebank for Classical Arabic Poetry (ArPoT_v1.0)

البنك الشجري الاعتمادي للشعر العربي الفصيح

This repository introduces the first syntactically annotated corpus for Classical Arabic poetry (ArPoT_v1.0). The Treebank presented in this paper: https://aclanthology.org/2021.depling-1.1.pdf

ArPoT_V1.0 contains CONLLU files for:

Training portion: arpot_train_v1.0.conllu Developmen portion: arpot_dev_v1.0.conllu Testing portion: arpot_test_v1.0.conllu

Total number of tokens: 35459

  • training: 28506 tokens
  • development: 2771 tokens
  • testing: 4182 tokens