Issue Entity Ruler : detect Entities with several tokens #13688
Unanswered
CaSSog
asked this question in
Help: Coding & Implementations
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hello,
I have an issue in using Entity Ruler for a project in the specific domaine of french Agriculture.
I am trying to recognize specific french agricultural organization in different text over the web.
For this I have created a patterns.jsonl file with specific rules that I call before NER. It's working well except for entities with several tokens and where Spacy recognize other label in the entity name.
An example :
One of this organization is "Chambre d'Agriculture" (in english Chamber of Agriculture). In texts This entity is often followed by a localization. For example "Chambre d'Agriculture de Bourgogne" (in english Chamber of Agriculture of Burgundy) or "Chambre d'Agriculture France" (in english French Chamber of Agriculture).
In these case Spacy either recognize only the region as a localization (for example Burgundy) or recognize all the entity as a localization (for example Chamber of Agriculture of Burgundy as a localization).
Here is the rule I have created in my patterns file :
Here is my code :
Beta Was this translation helpful? Give feedback.
All reactions