This is an Apertium monolingual language package for Dizhsa, or Western Tlacolula Valley Zapotec, with a focus on the variety of San Lucas Quiaviní.
What you can use this language package for:
- Morphological analysis of Western Tlacolula Valley Zapotec
- Morphological generation of Western Tlacolula Valley Zapotec
- Part-of-speech tagging of Western Tlacolula Valley Zapotec
Estaremos traduciendo esta página al Dizhsa y al español.
Guelaguetza clause: While licensed under a free/open-source license that permits commercial uses, it is expected that anything created using this resource be made available to the community of San Lucas Quiaviní free of charge. This is consistent with the community's practice of guelaguetza, a complex system of reciprocity and exchange of goods and labor.
To compile these resources, you will need the following software installed:
- lttoolbox (>= 3.3.0)
- apertium (>= 3.3.0)
- vislcg3 (>= 0.9.9.10297)
- hfst (>= 3.8.2)
- lexd (>= 1.0.0)
If this does not make any sense, we recommend you look at: apertium.org. Especially have a look at https://wiki.apertium.org/wiki/Installation.
Given the requirements being installed, you should be able to just run:
$ ./configure
$ make
You can use ./autogen.sh
instead of ./configure
if you're compiling
from source.
If you're doing development, you don't have to install the data, you can use it directly from this directory.
If you are installing this language package as a prerequisite for an Apertium translation pair, then do (typically as root / with sudo):
$ make install
You can use a --prefix
with ./configure
to install as a non-root user,
but make sure to use the same prefix when installing the translation
pair and any other language packages.
If you are in the source directory after running make, the following commands should work for analysis:
$ echo "Yseidylayu naa Dizhsa." | apertium -d . zab-morph
^Yseidylayu/seidy<v><tv><pol><irre>+yu<prn><pers><p2><frm><sg>/seidy<v><dtv><pol><irre>+yu<prn><pers><p2><frm><sg>$ ^naa/naa<prn><pers><p1><sg>$ ^Dizhsa/Dizhsa<n>$^../..<sent>$
$ echo "Yseidylayu naa Dizhsa." | apertium -d . zab-tagger
^Yseidylayu/Seidy<v><dtv><pol><irre>+yu<prn><pers><p2><frm><sg>$ ^naa/naa<prn><pers><p1><sg>$ ^Dizhsa/Dizhsa<n>$^../..<sent>$
There are also orthography-specific generation modes:
$ echo "^becw<n><px>+u<prn><pers><p2><infm><sg>$" | apertium -d . -f none zab-gener
xyecwu
$ echo "^becw<n><px>+u<prn><pers><p2><infm><sg>$" | apertium -d . -f none zab_Phon-gener
x꞉yèeʼcwùuʼ
Furthermore, there are orthography conversion modes:
$ echo "Yseidylayu naa Dizhsa." | apertium -d . zab_Simp-zab_Phon
Ysèiʼdylàaʼyuuʼ nàaʼ Dìiʼzhsah.
$ echo "Ysèiʼdylàaʼyuuʼ nàaʼ Dìiʼzhsah." | apertium -d . zab_Phon-zab_Simp
Yseidylayu naa Dizhsa.
apertium-zab.zab.lexd
- Morphotactic dictionaryapertium-zab.zab.twol
- Morphophonological rulesapertium-zab.zab.rlx
- Constraint Grammar disambiguation rulesapertium-zab.zab.spellrelax
- Rules for accepting orthographic variantsapertium-zab.post-zab.dix
- Post-generatorzab.prob
- Tagger modelmodes.xml
- Translation modes
- https://wiki.apertium.org/wiki/Installation
- https://wiki.apertium.org/wiki/apertium-zab
- https://wiki.apertium.org/wiki/Using_an_lttoolbox_dictionary
When referencing this work in an academic publication, we ask that you cite the following source:
- Washington, Jonathan N., Felipe H. Lopez, and Brook Danielle Lillehaugen (2021). “Towards a morphological transducer and orthography converter for Western Tlacolula Valley Zapotec”. In: Proceedings of the First Workshop on Natural Language Processing for Indigenous Languages of the Americas (AmericasNLP), pp. 185–193. URL: https://www.aclweb.org/anthology/2021.americasnlp-1.21.
If you need help using this language pair or data, you can contact:
- Mailing list: apertium-stuff@lists.sourceforge.net
- IRC:
#apertium
on irc.oftc.net (irc://irc.oftc.net/#apertium)
See also the file AUTHORS
, included in this distribution.