Extract "phrase token" like one word by setting "user dictionary" #13693
lamrongol
started this conversation in
New Features & Project Ideas
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
For example, "good morning!" is tokenized to separating "good" and "morning" as following:
But I want to get result like
good morning INTJ
.I think this feature doesn't exist now,
PhraseMatcher
and Rule-based matching only extract specified phrases.I want to set phrases by myself like following "dictionary file"
This feature exists in Kuromoji(Japanese NLP library) as following Japanese page: https://qiita.com/masato_ka/items/2bae7f1e8ce245443947
Beta Was this translation helpful? Give feedback.
All reactions