- Some lexemes have multiple paradigms. This is defined as combination of "declination type" + "option_number" in the fmsynth output
- Currently, option number is ignored during import, which is bad. it has to be added, with a caveat:
- For some lexemes, fmsynth will report same paradigm twice (see "saks"). Duplicated paradigms must be ignored while imported.
- From the article, "initial forms" could be parsed to compare with fmsynth-provided paradigms, and non-compliant paradigms must be ignored.
- This should provide for much safer articleForm sets.
-
Notifications
You must be signed in to change notification settings - Fork 0
62mkv/estonian-forms
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
Simple one-off script to publish Estonian language lexemes at Wikidata
Topics
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published