Skip to content

Latest commit

 

History

History
22 lines (13 loc) · 794 Bytes

TODO.md

File metadata and controls

22 lines (13 loc) · 794 Bytes

urgent:

  • change panic() in inits() into common.Log().Warn()

  • TH2EN: test behavior of rod when BrowserAccessURL is invalid

  • func (tokens TknSliceWrapper) Tokens() []AnyToken

  • write tests


  • mesure perf with pprof (impact of asserting every token in customModule notably)

  • for dealing with arbitrary languages either github.com/pemistahl/lingua-go or aksharamukha script autodetection and redirect at correct language

later:

  • Aksharamukha: somehow implement bulk transliteration (maybe with splitter)
  • add robpike/nihongo to force romanization after ichiran process
  • Slice []AnyToken →→→ Sentences [][]AnyToken using uniseg.FirstSentenceInString
  • Ideally: write a TokenWriter upon which multiple providers may write to in a row to enrich the linguistic annotation