Skip to content

sirasagi62/tinysegmenter.nvim

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 

Repository files navigation

TinySegmenter.nvim

TinySegmenter.nvim is a plugin that provides a Lua port of TinySegmenter, an ultra-minimal Japanese tokenizer. Since the plugin depends only on the Lua standard library, it can be executed anywhere that supports Lua 5.1 or later.

Install

-- tani/vim-jetpack
Jetpack "sirasagi62/tinysegmenter.nvim"

Usage

local tinysegmenter = require("tinysegmenter")

-- return table like {"これ","は","文章","です","。"}
local parsed_text = tinysegmenter.segment("これは文章です。")

-- これ|は|文章|です|。
print(table.concat(parsed_text,"|"))

API

tinysegmenter.segment(string)

Returns a table of Japanese sentences split into words.

License

This program is provided under BSD-3-Clause.

Copyright information is here: ./lua/tinysegmenter.lua

Acknowledgments

This program is originally created by Taku Kudo in 2008, modified for ES module by Taisuke Fukuno in 2022. Also, utf8.lua is provided under CC0 by NAKAI Tsuyoshi.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages