Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

KuromojiDefault extract noun prefix token #29

Merged
merged 3 commits into from
Sep 11, 2023

Conversation

smori1983
Copy link
Owner

@smori1983 smori1983 commented Sep 10, 2023

Example

node cli.js kuromoji 未分類

[
  {
    word_id: 71130,
    word_type: 'KNOWN',
    word_position: 1,
    surface_form: '未',
    pos: '接頭詞',
    pos_detail_1: '名詞接続',
    pos_detail_2: '*',
    pos_detail_3: '*',
    conjugated_type: '*',
    conjugated_form: '*',
    basic_form: '未',
    reading: 'ミ',
    pronunciation: 'ミ'
  },
  {
    word_id: 393330,
    word_type: 'KNOWN',
    word_position: 2,
    surface_form: '分類',
    pos: '名詞',
    pos_detail_1: 'サ変接続',
    pos_detail_2: '*',
    pos_detail_3: '*',
    conjugated_type: '*',
    conjugated_form: '*',
    basic_form: '分類',
    reading: 'ブンルイ',
    pronunciation: 'ブンルイ'
  }
]

How to resolve

Extract tokens which pos is 接頭詞 and pos_detail_1 is 名詞接続.

@smori1983 smori1983 merged commit a31f13c into master Sep 11, 2023
1 check passed
@smori1983 smori1983 deleted the kuromoji-default-extract-noun-prefix-token branch September 11, 2023 15:31
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant