What is the difference of norm and lower attributes in token #13283
-
Hi All, |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment
-
Hi! In many cases, Example:
output:
Here you see that the token |
Beta Was this translation helpful? Give feedback.
Hi!
In many cases,
token.norm
andtoken.lower
will be the same. Some languages can have tokenizer exceptions though where thenorm
attribute gets assigned and holds more information beyond just the lowercasing of the token.Example:
output:
Here you see that the token
n't
is normalized tonot
.