-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
AE corrections by page #340
Comments
Note: this was developed on local computer, and then uploaded to Cologne. I think it works on Cologne as locally, but have not checked. Thanks in advance for any feedback. |
There is a lot of space around. Why not show the scanned page on the same
page? This will empower reading of scanned and digitized things side by
side.
|
Wasted space, agree.
Agree, that's similar to https://en.wikisource.org/wiki/Page:Sanskrit_Grammar_by_Whitney_p1.djvu/103 but even more advanced because of the editing in WYSIWYG mode. And before hand cleaning I would propose to do some regex cleanup.
The What about extracting all devanagari words and comparing to MW, never done before, right? Should we not work with this good looking UI after the big dirt is out? I think it will give more fruit, if we use the old batch methods to weed out the hundreds of spelling mistakes out there. |
This WAS done (and in fact comparisons made to all dictionary headwords) in the prior step of correction (see #318). That step got around 1500 corrections, as I recall. |
Good observation. This is always 'causal'. Will put it on todo list. |
The wasted space is not a big deal here. Since we are dealing with just one scan page at a time, the user can click to open that page in a separate window, do the scan enlargement, and be set for all the cases in the batch. |
Sampada's begun working on the pages, and I've done a few. It seems to take about 30-45 min per page. On the first 9 pages, there were 67 corrections -- that would If The UI should already support Devanagari, but only limited testing has been done with Devanagari. Although the correction UI is WYSIWIG, there are a couple of |
That's a lot both ways. Thanks to Sampada.
Understood. |
Meant to say @SergeA in comment above. |
चनुर |
catura with a print smudge.
|
@drdhaval2785 Thanks. That fits with vidagDa. |
Did not expect to find so many errors !!! 7 per page is too many!
Seems to be a good decision to do the detailed checking.
…On Tue, Mar 28, 2017 at 5:48 AM, Mārcis Gasūns ***@***.***> wrote:
It seems to take about 30-45 min per page. On the first 9 pages, there
were 67 corrections -- that would
be 7 per page.
That's a lot both ways. Thanks to Sampada.
I'm planning to do installation about every 10 page
Understood.
—
You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub
<#340 (comment)>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/AKak33yNnlKrKtjfyroaqTNa6Oe02SjQks5rqIMMgaJpZM4MaUuv>
.
--
*"Faith,more faith!Faith in your possibilities,faith in the Power that is
at work behind the veil,and the offered guidance." - Sri
Aurobindo*
|
I did. I only hope that it's the best UI possible, so you spend as little amount of time as possible in doing so. Thanks, Sampada! |
In AE, the author typically writes a root as 'masj' (to sink, to bathe) and gives forms as 'majjati', etc. In MW, PW , WIL the root shows as 'majj' . So far, I haven't found an author who mentions both forms. Two questions:
|
In Zaliznyak, the authority above all in Russian on dhatus, it's called majj. And it was just yesterday I was exploring it. But I've never seen 'masj' and 'majj' in one source, it's allomorphs of the same root. I've never seen an explanation. MW, PW are on the right track. |
`wumasjo` is the root in dhAtupATha.
`wu` and `o` are anubandhas and are elided.
`masj` is what remains.
Converted to `majj` from rule स्तोः श्चुना श्चुः.
|
So it's not a final product, but a mid-stage byproduct, we could say so, Dhaval? |
Really good explanation as to why both forms are acceptable as 'the' root. |
question on rakzaRasAnaUnder headword bulwark in AE, we see
Is rakzaRasAna possible (if so how to understand 'sAna') -- the print shows this. Or, should it be rakzaRasTAna which makes sense (a bulwark is a 'place of protection' ) ? |
missing headword 'Invincible'Sampada discovered a missing headword in the digitization for page 239 of AE.
To add this without changing subsequent L-numbers is a problem at the moment. Do this when the meta-line form of AE has been accomplished. This would be best done AFTER Sampada has finished the rest of |
Let's change. We do not care yet for others as MW. |
Question re kOwwinyaIn AE, under headword 'pander' we find spelling This spelling is also shown in another edition of AE. However, only Should the 'inyam' spelling be considered a print error in AE ? |
कुट्टनी=कुट्टिनी >> कौट्टन्यम्=कौट्टिन्यम् |
Concerning the previous question for supposed ज्रिम्
another edition by the last link gives perfect reading विरम् |
also by the link - रक्षणसाधनं |
@SergeA Thanks for explanation and research on these three. kOwwinyamMy understanding now is that kOwwinyam is derived from kuwwinI by some 'ya' taddhita suffix formation rule which introduces the vriddhi O of u; and kOwwanyam is similarly derived from kuwwanI. jrim and rakzaRasAnaHave changed to viram and rakzaRasADanaM, per the newer print edition, classifying as print error in print edition used for Cologne digitization. |
Long live the Jim. |
DONE!Sampada informed me today that all 501 pages of AE have now been examined and corrections provided. All in all, there are about 7400 corrections as fruit of this year-long endeavor, approximately 15 corrections per page. Let's all doff our hat to Sampada for her persistence in seeing this project through. The Cologne Thanks, Sampada! |
I extend my gratitude to Sampada. |
7400 corrections - what patience did it take. |
invincible addedThe correction to add 'invincible' as headword (see comment above) has now been made to the meta-line |
Thank you so much ! Your praise is generous!
Everybody in this group is doing their bit to contribute to important
foundational research in the field of Sanskrit ... this is my bit! :-)
Regards
Sampada
…On Tue, Mar 13, 2018 at 10:12 AM Mārcis Gasūns ***@***.***> wrote:
7400 corrections - what patience did it take.
Nothing of this spirit is seen in Pune. It would
take 10 Sampadas to finish that dictionary, but
we have only 1, so I bow to her lotus feet.
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
<#340 (comment)>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/AKak3zNj8snJTUMokEDe_F7iochD9hZvks5td4zcgaJpZM4MaUuv>
.
--
*"Faith,more faith!Faith in your possibilities,faith in the Power that is
at work behind the veil,and the offered guidance." - Sri
Aurobindo*
|
@sanskritisampada do not try for this elephant to look like a puppy, it's huge! What's next? |
From the experience of AE corrections in #318, this dictionary is very dirty. There are many errors still remaining, despite the large number of corrections from #318.
Thus, I think that to get a reasonably clean dictionary, we need to systematically review the words on each page.
To that end, a UI has been developed with the aim of making this process of correction of words on a page as efficient as possible.
The main difference from the other correction UIs is that all the lines of a word entry are editable in a
WYSIWIG way (using the tinyMCE javascript library in the background).
I'd like others to try out the current sample of this UI, and make suggestion comments,
This sample deals just with the words on page 1 of AE.
The text was updated successfully, but these errors were encountered: