Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

pos-id.defで素性の容量を削減可能 #4

Open
Leko opened this issue Dec 30, 2021 · 0 comments
Open

pos-id.defで素性の容量を削減可能 #4

Leko opened this issue Dec 30, 2021 · 0 comments
Labels
enhancement New feature or request

Comments

@Leko
Copy link
Owner

Leko commented Dec 30, 2021

品詞IDの定義
概要
出力される素性(品詞)に任意の数値ID を付与することができます. 通常, 素性は文字列として表現されますが, 機械処理には向いていません. 数値ID に変換することで, 機械処理が容易になります.
素性にどの ID を割りあてるかは, ユーザが自由に定義することができます.

MeCab: 品詞 ID

もっと複雑な例

その他,間投,*,* 0
フィラー,*,*,* 1
感動詞,*,*,* 2
記号,アルファベット,*,* 3
記号,一般,*,* 4
記号,括弧開,*,* 5
記号,括弧閉,*,* 6
記号,句点,*,* 7
記号,空白,*,* 8
記号,読点,*,* 9
形容詞,自立,*,* 10
形容詞,接尾,*,* 11
形容詞,非自立,*,* 12
助詞,格助詞,一般,* 13
助詞,格助詞,引用,* 14
助詞,格助詞,連語,* 15
...

MeCab: 品詞 ID

@Leko Leko added the enhancement New feature or request label Dec 30, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant