Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Hierarchy of disambiguation information #56

Open
tilltnet opened this issue Nov 7, 2018 · 0 comments
Open

Hierarchy of disambiguation information #56

tilltnet opened this issue Nov 7, 2018 · 0 comments

Comments

@tilltnet
Copy link

tilltnet commented Nov 7, 2018

Hi,
I've noticed that the pruning part of the authors_match function separates entries that were formerly matched by the same ORCID, Researcher ID or E-Mail address. In my case that would lead to "unnecessary" under-matching. My quick fix for that was to set the similarity for those entries that were matched by ORCID and RID to 1, which would then exclude them from the pruning. For entries matched by E-Mail addresses the pruning seemed to do a good job though!

I don't know if giving ORCID and RID a higher priority is a universally better solution to the problem, but the way I understand this, ORCID and RID are quite reliable and might also identify a person that had their name changed due to marriage etc. Therefore the pruning those matches by name initials might not be the best solution.

If there are good reasons to overrule ORCID/ RID matches by name initial differences, it I might be worthwhile to consider letting the user decide the hierarchy ORCID/ RID, Email, and names.

Best,
Till

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant