Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Pterostylis sp. aff. boormanii (Sunset Country) being returned rather than Pterostylis sp. aff. boormanii #207

Open
Sherrin-ALA opened this issue Jun 9, 2023 · 2 comments

Comments

@Sherrin-ALA
Copy link

In the name matching Unit Tests - AlaNameSearcherTest.testAffLookup1
Pterostylis sp. aff. boormanii (Sunset Country) is being returned rather than Pterostylis sp. aff. boormanii.

(Sunset Country) is an excluded name, but is coming up with a higher score the Pterostylis sp. aff. boormanii,

@Sherrin-ALA
Copy link
Author

Notes from Doug:

Worked it out, The taxon.txt entry is
https://id.biodiversity.org.au/instance/apni/51441972 https://id.biodiversity.org.au/taxon/apni/51441976 ICBN Pterostylis sp. aff. boormanii heterotypicSynonym species dr5214 https://id.biodiversity.org.au/instance/apni/51441972 https://id.biodiversity.org.au/name/apni/194759 Plantae Orchidaceae Pterostylis sp. aff. boormanii Pterostylis sp. aff. boormanii https://id.biodiversity.org.au/reference/apni/51428473 CHAH (7 July 2021), Australian Plant Census Backhouse, G.N. & Jeanes, J.A. (1995), The Orchids of Victoria [273] 1995 https://id.biodiversity.org.au/instance/apni/51441972 Placeholder name has been treated as unique
The important bit here is the Plantae Orchidaceae bit. This will place this in familty Orchidaceae and kingdom Plantae. So the base score is 6000 not 5000 then -1000 and -200 for 4800. The plain usage is a synonym and doesn’t get the 6000.

The core problem here is that synonyms don’t have much higher taxonomic information and so don’t get the “owned=by” boost. Fix would be to either have a synonym inherit it’s defaultScore from the default score of the accepted taxon, as well as the parent for accepted taxa (not a bad idea) or adding an extra rule to detect “Genus (sp.?)? (aff.?)? (cf.)? epithet (Geographic)” in the name (hard to do without complications with authors)
I've done some experiments with the synonym approach for baseScore and it's going into some infinite loops ... Thanks NZOR

@Sherrin-ALA
Copy link
Author

Sherrin-ALA commented Jun 9, 2023

An additional issue here may be that Pterostylis sp. aff. boormanii (Sunset Country) isn't being attached to the genus Pterostylis, it's being attached (as a species) to the family Orchidaceae - even though in the taxon.csv file from APC, it's ParentNameUseageID is identical to Pterostylis boormanii which is being placed properly under the genus.

Need to figure out why that's occurring - this may be causing excluded names to be matched to a family level rather than a genus level.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant