Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Review composite mapping of prefixes and namespaces #8

Open
4 tasks
jmcmurry opened this issue Nov 19, 2015 · 0 comments
Open
4 tasks

Review composite mapping of prefixes and namespaces #8

jmcmurry opened this issue Nov 19, 2015 · 0 comments
Assignees

Comments

@jmcmurry
Copy link
Member

  • Ferret out any cross-registrations that may not be an exact match on prefix, but on other fields.
    • Assess the text-edit distance or other easy to do similarity match across all of the title fields.
    • Based on the above, for each prefix in the list, provide two pieces of data: 1) score indicating how likely the prefix in one row is to be related to a prefix in another row. (eg. corresponding to the same dataset, or to portions of that dataset). (eg. KEGG-disease, vs KEGG-protein). 2) what that corresponding prefix is to investigate
    • Re-arrange the list so that the related ones are clustered together for easier curation (?)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants