Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Extraction of person names, places, etc. #321

Open
michaelnmmeyer opened this issue Jun 21, 2024 · 0 comments
Open

Extraction of person names, places, etc. #321

michaelnmmeyer opened this issue Jun 21, 2024 · 0 comments
Labels
enhancement New feature or request

Comments

@michaelnmmeyer
Copy link
Member

This is the kind of tool I had in mind for extracting keywords from documents. GPE means "Geopolitical entity"; NORP means "Nationality or religious or political group". I used one of Manu's inscriptions as example.

stanza

I ran the same thing against all our corpus. There are a lot of tagging errors. This can be improved by reworking the underlying AI model, or by using another programming library which provides better results but is extremely slow.

Even so, the data is already useful as is. We have around 4000 person names which could be used for faceted search or in an autocomplete interface, for instance.

@michaelnmmeyer michaelnmmeyer added the enhancement New feature or request label Jun 21, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant