-
Notifications
You must be signed in to change notification settings - Fork 9
/
README
105 lines (75 loc) · 3.5 KB
/
README
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
Kazakh: `apertium-kaz`
===============================================================================
This is an Apertium monolingual language package for Kazakh. What
you can use this language package for:
* Morphological analysis of Kazakh
* Morphological generation of Kazakh
* Part-of-speech tagging of Kazakh
Requirements
===============================================================================
You will need the following software installed:
* lttoolbox (>= 3.3.0)
* apertium (>= 3.3.0)
* vislcg3 (>= 0.9.9.10297)
If this does not make any sense, we recommend you look at: apertium.org
Compiling
===============================================================================
Given the requirements being installed, you should be able to just run:
```console
$ ./configure
$ make
```
You can use ./autogen.sh instead of ./configure if you're compiling
from SVN.
If you're doing development, you don't have to install the data, you
can use it directly from this directory.
If you are installing this language package as a prerequisite for an
Apertium translation pair, then do (typically as root / with sudo):
```console
# make install
```
You can give a `--prefix` to `./configure` to install as a non-root user,
but make sure to use the same prefix when installing the translation
pair and any other language packages.
Testing
===============================================================================
If you are in the source directory after running make, the following
commands should work:
$ echo "Сәлем!" | apertium -d . kaz-morph
^Сәлем/сәлем<ij>/сәлем<n><nom>/сәлем<n><attr>/
сәлем<n><nom>+е<cop><aor><p3><pl>/сәлем<n><nom>+е<cop><aor><p3><sg>$
^!/!<sent>$^./.<sent>$
$ echo "Оқу инемен құдық қазғандай." | apertium -d . kaz-tagger
^Оқу/оқу<adj>$ ^инемен/ине<n><ins>$ ^құдық/құдық<n><nom>$
^қазғандай/қаз<v><tv><ger_past><sim>$^./.<sent>$^./.<sent>$
Files and data
===============================================================================
* apertium-kaz.kaz.lexc - Monolingual dictionary
* apertium-kaz.kaz.twol - Morphophonological rules
* apertium-kaz.kaz.err.twol -
* apertium-kaz.kaz.guesser.twol -
* kaz.prob - Tagger model
* apertium-kaz.kaz.rlx - Constraint Grammar disambiguation rules
* apertium-kaz.post-kaz.dix - Post-generator
* apertium-kaz.kaz.mtx -
* apertium-kaz.kaz.tsx -
* apertium-kaz.kaz.udx -
* modes.xml - Translation modes
For more information
===============================================================================
* https://wiki.apertium.org/wiki/Installation
* https://wiki.apertium.org/wiki/apertium-kaz
* https://wiki.apertium.org/wiki/Using_an_lttoolbox_dictionary
Help and support
===============================================================================
If you need help using this language pair or data, you can contact:
* Mailing list: apertium-stuff@lists.sourceforge.net
* IRC: #apertium on irc.oftc.net
See also the file AUTHORS included in this distribution.
Acknowledgements
================
If you use this in your work, please cite:
* Washington, J. N., Salimzyanov, I., and Tyers, F. M. (2014) "Finite-state
morphological transducers for three Kypchak languages". Proceedings of the 9th
Conference on Language Resources and Evaluation, LREC2014.
http://www.lrec-conf.org/proceedings/lrec2014/summaries/1207.html