Multi-word Entity #5139
Replies: 2 comments
-
Most statistical NER models tag each word individually with tags that show whether a word is at the beginning/middle/end of an entity or not an entity (O). One common kind of sequence tags are IOB tags. The quality of this article is not great, but it has some useful examples and links: https://en.wikipedia.org/wiki/Inside%E2%80%93outside%E2%80%93beginning_(tagging) What spacy stores internally in a
Output:
I hope that helps! |
Beta Was this translation helpful? Give feedback.
-
Yeah that's was so helpful thank you :) I want to understand how spacy do recognize these entities ? |
Beta Was this translation helpful? Give feedback.
-
Hello, I am new to Spacy, I would like to know how the model can recognize entities with 2 words.
For example in this sentence ("Mr. Best flew to New York on Saturday morning from New delhi to buy a New clothes."), the spacy recognizes "New York" and "New Delhi" as GPE entities.
I want to know how Spacy works ? Does it go through the sentence word by word and then compare the words with the words on the training dataset?
if it is the case ? how can detect the entity with 2 words like "New York" and "New Delhi"
Thank you
Beta Was this translation helpful? Give feedback.
All reactions