Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Distinguish facility IDs and building ones and start index both #5

Open
Abbe98 opened this issue Nov 28, 2016 · 0 comments
Open

Distinguish facility IDs and building ones and start index both #5

Abbe98 opened this issue Nov 28, 2016 · 0 comments

Comments

@Abbe98
Copy link
Member

Abbe98 commented Nov 28, 2016

Probably happening late December(required for Kyrkosok/web-client#23).

  • Figure out the exact value of the BBR ID change break point , see Template:BBR-länk for estimated values then do a loop with HTTP requests to get the exact one. Once done update the template too.

  • Run the kulturarvsdata-prefer-rdf.py bot.

  • Check for duplicate statements(should be none or very few), have seen something for this task over att Tool Labs.

  • Start indexing the WLM lists on sv.wikipedia.org to a CSV or SQLite file(index only WP articles and BBR URIs?)

  • check this list against existing data in Wikidata. Look for conflicts and data which exists only in Wikidata(which should not be the case).

  • fix any data that needs fixing

  • add Wikipedia articles for all the WLM BBR items missing one(if Geonames can be a source for bot created articles anything can be a source).

  • Index a new CSV or SQLite file from the WLM tables.

  • Import all the missing data to Wikidata.

  • start indexing both facility and building IDs(breaks the API). Use the "BBR ID change break point" if it's a fuzzy one create a buffer were all IDs gets verified using HTTP requests(the way all currently are validated).

  • Add all the Wikidata IDs to the WLM lists on sv.wikipedia.org and notify the folks over at Phabricator. Research on how to parse and process wikitext tabels <-- new to me

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant