Skip to content
This repository has been archived by the owner on Aug 13, 2021. It is now read-only.

[266] Refactor and simplify ES configuration #275

Merged
merged 16 commits into from
Jun 9, 2020
Merged

[266] Refactor and simplify ES configuration #275

merged 16 commits into from
Jun 9, 2020

Conversation

jaklinger
Copy link
Contributor

@jaklinger jaklinger commented Jun 1, 2020

Closes #266

This will likely require spawning of sub-PRs to manage complexity of reviewing the rewiring of

a) the base config [this PR]
b) the ES mappings [PR #280]
c) schema transformation tidying and mapping tests [PR #281]
d) the pipeline codebase [need to spawn this PR by rebasing from this branch]

jaklinger and others added 3 commits June 9, 2020 12:39
* changed branch name

* mappings build

* updated docs

* updated docs

* updated docs

* added docstrings

* added dynamic strict to settings

* removed index.json in favour of a single defaults file

* using soft alias until a future PR to minimise changes

* cleaned and sorted json

* [267] Tidy & slim schema transformations (#281)

* pruned deprecated schema transformations

* updated fos fieldname on arxlive

* unified data set schema transformations

* restructured directory

* refactored references to schema_transformation

* refactored references to schema_transformation

* slimmed down transformations, and included entity_type

* pruned ontology

* tidied schemas

* consistency tests

* reverted unrelated json file

* harmonised name fieldsofstudy across arxiv

* added novelty back in

* sorted json

* sorted json

* sorted json

Co-authored-by: Joel Klinger <[email protected]>

Co-authored-by: Joel Klinger <[email protected]>
@jaklinger jaklinger marked this pull request as ready for review June 9, 2020 16:43
@jaklinger jaklinger merged commit f98955a into dev Jun 9, 2020
@jaklinger jaklinger deleted the 266_es_base branch June 9, 2020 16:44
jaklinger added a commit that referenced this pull request Sep 28, 2020
* make sure conf dir is empty

* simplified es config

* added orm es config reader

* modified setup_es to pick up new es config

* swapped es_mode for boolean

* aliases now consistent with config

* aliases now automatically located

* added endpoint field to estasks

* added endpoint field to sql2estasks

* [267] Pool ES mappings across datasets (#280)

* changed branch name

* mappings build

* updated docs

* updated docs

* updated docs

* added docstrings

* added dynamic strict to settings

* removed index.json in favour of a single defaults file

* using soft alias until a future PR to minimise changes

* cleaned and sorted json

* [267] Tidy & slim schema transformations (#281)

* pruned deprecated schema transformations

* updated fos fieldname on arxlive

* unified data set schema transformations

* restructured directory

* refactored references to schema_transformation

* refactored references to schema_transformation

* slimmed down transformations, and included entity_type

* pruned ontology

* tidied schemas

* consistency tests

* reverted unrelated json file

* harmonised name fieldsofstudy across arxiv

* added novelty back in

* sorted json

* sorted json

* sorted json

Co-authored-by: Joel Klinger <[email protected]>

Co-authored-by: Joel Klinger <[email protected]>

* patched out es config setup from tests

* removed redundant tests

* fixed json formatting

* none included for testing

* picked up bug in test

Co-authored-by: Joel Klinger <[email protected]>
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[DAPS] Migrate to base ES config, to remove repetition from config
1 participant