You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
cedricdcc
changed the title
TravHarv does not dereference on first run
TravHarv subjects must be made when task is started , not when config builder is called
Apr 16, 2024
When testing in kgap , it was found that all subjects for all tasks are made when config_builder is called and not when tasks are started. This causes many tasks not to have any subjects to dereference.
- Added helper.py containing functions that allow for prefix support in sparql queries and traversal harvesting paths.
- deleted call functions and refactored code in config_builder and all subsequent files that used this __call__ method.
- replaced and refactored all files that worked with the GraphNameMapper, now the maper of py-rdf-store is being used.
- refactored the config builder propery subjects so that when they are called they will get the subjects from the graph if
that is required (when SPARQL query is given instead of list of subjects).
- edited the .yml files that are used as configs to now not contain the <> anymore in the prefixes since these will now cause issues for the helper functions resolve_uri()
Issues that were affecting by the changes in this commit are:
- #35
- #43
- #48
- #34
With the following config:
Travharv does not dereference the publications from a given dataset.
However on the next run it does.
The same issue has been detected for the LWUA where @laurianvm had to rerun the sembench container for the publications to be dereferenced.
The text was updated successfully, but these errors were encountered: