Skip to content

vanangamudi/tharavukkanam

Repository files navigation

tharavukkanam

collection of datasets

TODO

Collect direct sources for corpora

  • setup download for books and other public works, use wikisource
  • setup for blogs and news, a perdiodical scraper (can use existing crawler)

Dictionaries

  • Scrape every dictionary
    • Winslow
    • Fabricius
  • develop quasi-schema to merge different dictionaries into one

About

collection of datasets

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published