-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ap.xml issues #113
Comments
Seems to be related to the recent discussion (can't locate) when the lines crossing line breaks have been given separate |
what is the ap headword where this occurs? |
afRin |
Problem solved as follows.
|
It will be easier to change ap.xml, since this is done by a PYthon program which can handle the Probably will add markup so The question then will be how to render the new markup in the displays. Suggestions? |
Make it close to book visually? Bolded numbers.
Those were the days when everything was printed. Cost saving by reducing
spaces was a concern. Now times have changed. We can properly indent them
for easier readability now. No need to retrogress.
|
Bolded numbers are no retro. They catch the eye. |
For bolded numbers, I am with you.
But the overall display should have appropriate indentations.
|
improved version ready for reviewA version of ap.xml, with related changes to the display, is now ready for your viewing enjoyment. The changes are evident in the basic, list, etc. displays [Not giving links because of the restricted status However, for the sake of allowing comparisons to the previous version, I haven't yet installed the Take a look, and give me feedback. When there is general agreement on the changes, I'll finish installation steps, and describe some details of the process. |
Jim, has these "glued" words always existes or it's something new? In
Otherwise it's much better. Still I have a long pending question. I do not like what I see on the left, the way numbers are presented. Nobody (whom I asked) did not understand what is L or p. I would suggest mark them different colours and remove the L and p tags. And make p one line with no break, like http://stackoverflow.com/questions/7219007/html-no-line-break-at-hyphens |
I agree, Google's spacing is well thought. |
The 'indent' question is one I struggled with. I first tried the css I think that the If you know can show me how to implement the indentation style your image shows, I'll be glad to |
Regarding the 'L=', etc. comments. I also think the current format is awkward. Currently, the whole part of the basic display is a table with 2 columns; the 'key1, L=,p=' part is in What about making it just one column, and changing the labeling. [Idea implmented experimentally -- take a look.] |
This is a bug in the revised make_xml.py program. Bug now corrected. Good catch! 👍 |
First few lines in ap.txt
Whereas it is rendered in ap.xml as
Have a look at |
Euro character is not killed. It identifies verbs, but we should identify verb numbers and tag them in XML and not keep euro character.
See |
Here is the first part of headword 'a` in the revised xml:
I think this 'div' structure takes care of your concerns there. @drdhaval2785 Agree? |
There are 3068 lines matching As usual, there are multiple forms that need to be identified, and some likely errors also. We could employ an xml markup similar to that of MW; here's a sample of MW under hw
In AP case, we could render While most cases are simple like this, it will take some work to completely handle all relevant cases.
I have so many things on my todo list that I am reluctant to volunteer to do the needed work to
|
Since no additional comments regarding the revised ap displays (in particular regarding the adjustment to the handling of |
documentation of adding
|
Yes |
There may be some hierarchy uncaptured. But at least we made a start. Slowly we will inch there too. |
Is sufficient to properly indent the display. Good leap in readability and user friendliness. |
documentation of the html rendering of the divsThe html rendering of the
step 1. make the first 'word' of the div boldThis is done by rewriting the xml at the start of each div; here the
Note that this applies to the two superscript cases of ap.txt. For the other div case, we assume the step 2. indentationFor the 'n=2' superscript, we indent by
This occurs in the context of a SAX xml parser, using the xml_parser_create and related functions of PHP; this is a php version of the expat parser for xml. It is likely that there is an expat parser that could be used in a browser's Javascript code to do all this rendering in the browser. Anyway, the php parser essentially does a tree-walk of the xml structure, and when it encounters a Then when the end of the
That's it. Not too hard, once all the context is understood. |
Agree.
Should not. For Causative forms even some general verbal tag would not do? Sure it's better than nothing, but as Caus. with the abbreviature occur in many dictionaries, it could be used for RegExing them out and giving them what they deserve.
Yeah, indeed, plenty of issues. |
adjustment to css of list-0.2.html displayThere was an annoying side-effect of the indentation, when viewed in the list-0.2.html display. This is no doubt due to the relative positioning technique used for indentation. To improve this, a 'padding-right:15px' css rule was added to the list-0.2 display
This improves the situation for ap, and simply adds a little space at the right for other displays. |
Everything now installed. |
p and L tell nothing to nobody. I know them, but none of my students could not grasp it wihtout me telling what it is.
Yeah.
I would add a that is not n=2. I would add a CSS. And in the CSS file I could change and play around and see if 2 is a appropriate choice.
Makes sense, indeed. Let me write my proposals in a new thread. |
Do not think so.
As per me, if explained in a FAQ and in the tooltip for each link, it's ok to have: |
Sorry to hear that. Will add your suggestion to todo list (currently 7 deep). |
It's not about me. I understand, sure. But it's from what I've seen how @Shalu411 used it initially and people for whom English is not a mother language. The abbreviations are not obvious and need explanation. Even if they are longer, a commentary and intro is wanted. |
<s>
tag. See</s><lb/>.<s>
<s>divasasyAzwame BAge SAkaM pacati yo naraH </s><lb/>.<s> afRI cApravAsI ca sa vAricara modate ..</s>
This gives rise to the period being seen in the web display / stardict conversion too.
Verified from Web display to have period.
The text was updated successfully, but these errors were encountered: