-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Botanical names #11
Comments
|
This is the extracted data. Someone needs to look into it. |
A general guide can be
|
I can ask a student of mine. Want to weed out all non-flora? |
We need to weed out non-fauna. |
mw_bot.txt lists all the 'bot' tags in mw. It can be compared to wil_botany.txt. |
We have done some preliminary work. The proble is that MW uses outdated terminology and so does WIL. |
See:
|
I think this completes what was requested by @drdhaval2785 in the first comment. |
As regards of |
Yes. Read wil_bot.txt and wil_bio.txt The output could identify the lines that need to be reviewed manually for corrections.
This could be done in an hour or two by a student, I think. |
Thanks, crystal clear and will be done. |
@Amygdalus |
WIL has many botanical names.
That is giving a lot of false positives in finding out English spelling errors.
We need to give it a separate tag, as in SNP.
The text was updated successfully, but these errors were encountered: