-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
MW missing headword puzzle #221
Comments
A specific meta information example, root aṁśThe codings use the meta-iast forms under development. Coding with
Coding with
Note:
Note: |
Problems with above solutionThe current coding of nimF is:
I view this as a problem, but I don't have a clear conception of a solution. I don't know how many cases are 'like' this. We have all the update logs from back in 2012 on the Assuming we have identified the similar cases, should we resuscitate the original headwords and make I'll flag this comment as a 'bug' . Maybe sometime we'll get more data on the scope of the problem and |
Jim, you know I'm thrilled with verbs. I was not a aware of a genuine dhatu list, so you opened my eyes. I will want to reuse it my dhatu research as well.
Sure we want to. That is the only big excuse for using the digital version and not the original paper book - that it can fix what the Motilal reprinters will never do. |
I think this should be treated as an edge case, and does not require further analysis. Closing. |
Background
One of the goals of the MW meta/iast conversion is to improve the MW markup.
In the course of this, an odd aspect of the coding of the root
ni- √mṝ
came to the fore.Both the context leading to this oddity, and the somewhat unrelated nature of the oddity illuminate
interesting aspects of the construction of the MW digitization, past and current.
Context
Yesterday, I was working with the
<vlex>
tag. This tag was introduced in 2011. The objectivewas to identify with markup the parts of root records, so as to (a) know which headwords are roots and (b) to distinguish certain other aspects of root records (class-pada usage information, important for inflected forms; is root a prefixed root, or not; is root classified as a Denominative).
So, we wanted to mine MW for information about roots, and used the addition of a
<vlex>
tag asa means to narrow the problem. The
<root>
tag also played a part here.Later work then parsed this information and simplified it; for instance a listing of roots with class-pada information was developed.
Meta information
It now seems to me that it is best to think of this markup (using vlex and root tags) as an intermediate
step in generating meta-information about the MW dictionary. For instance it seems better to add the summarized class-pada information as explicitly meta information, rather to leave it in the dictionary
in implicit form involving the vlex and root tags.
The text was updated successfully, but these errors were encountered: