You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The way the pelias-parser Classifiers works is, that they take all the configured tokens for the configured countries and apply them to given address string irrespective of which country the address belong to.
For example the CompoundStreetClassifier reads in the tokens for the following countries:
The problem here is that if we have define a street type for Denmark (de), which is not the a valid street type for Norway (nb), the classifier will still try to classify the Norwegian addresses with the Danish Street types and the other street types defined for other countries.
So for example we want to add land and lien as valid street types for Norway, but will then it will affect other countries addresses, like the following unit test for French address fails when adding 'lien' as street type for Norway.
address FRA: Rue de l'Empereur Julien Paris
expected: |-
[ { street: 'Rue de l\'Empereur Julien' }, { locality: 'Paris' } ]
actual: |-
[ { street: 'Rue de l\'Empereur' }, { locality: 'Julien' } ]
This works best when our scope is full earth, where we don't know in advance which country the address belongs to. But in our case for example we know that we have norwegian addresses, so it would be nice if the classifier can be optionally configured to use only the Norwegian street types and not from the other countries.
Solution
The solution is obviously be able to configure the countries we want the parser to work on.
We can use the options parameter in parser/Parser.js to send in the configuration.
And in case of pelias-api, we can add the configuration option in the pelias-config, which pelias-api can send further into the pelias-parser.
The text was updated successfully, but these errors were encountered:
The way the
pelias-parser
Classifiers works is, that they take all the configured tokens for the configured countries and apply them to given address string irrespective of which country the address belong to.For example the
CompoundStreetClassifier
reads in the tokens for the following countries:The problem here is that if we have define a street type for Denmark (de), which is not the a valid street type for Norway (nb), the classifier will still try to classify the Norwegian addresses with the Danish Street types and the other street types defined for other countries.
So for example we want to add
land
andlien
as valid street types for Norway, but will then it will affect other countries addresses, like the following unit test for French address fails when adding 'lien' as street type for Norway.This works best when our scope is full earth, where we don't know in advance which country the address belongs to. But in our case for example we know that we have norwegian addresses, so it would be nice if the classifier can be optionally configured to use only the Norwegian street types and not from the other countries.
Solution
The solution is obviously be able to configure the countries we want the parser to work on.
We can use the
options
parameter inparser/Parser.js
to send in the configuration.And in case of
pelias-api
, we can add the configuration option in thepelias-config
, whichpelias-api
can send further into thepelias-parser
.The text was updated successfully, but these errors were encountered: