-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Create service info endpoint #289
Comments
This issue is stale because it has been open 45 days with no activity. Please make a comment for triaging or closing the issue. |
@bwalsh suggested also including URLs to the public s3 CDMs used |
I think this is a deceptively complicated question. If we're just talking about being able to fast-forward a local database to match a snapshot given on the cloud instance, I think the code is ~there already -- we should be thinking about how to automate data updates/dumps in the cloud, of course, but the basic CLI command exists. There is a complicating factor that search outcomes are highly dependent on software and data versioning of each of the normalizers, though. Reproducing the result of a search isn't just a matter of matching your MetaKB graph to the remote instance, you also need to match a smattering of library versions, 3 dynamo DB tables, a SeqRepo snapshot, and a UTA database version (plus some coolseqtool stuff), depending on the kind of question that you ask. IMO, a good starting point would be to implement the GA4GH service info spec, and include a bunch of extra fields that display as much data and software versioning as we can. Like, a totally excessive amount of information. In the long term, we could work towards a set of features along the lines of
^ That doesn't solve the issue of normalizer snapshots/reproduceability, but maybe we could tackle them the same way (publish regular snapshots for each and write methods to copy them to a local DB) and then incorporate that into the metakb update-from-snapshot procedure. |
@jsstevenson I think automation/reproducibility should be a separate issue, maybe even epic. IMO this issue more about having an |
Yeah, I think the spec is |
@jsstevenson IMO any metadata we can expose is helpful. We already have a lot of it -- UTA, SeqRepo, dates we pulled CIViC/MOA, GKS schema versions. Is there anything else major I'm missing? |
Conform to https://www.ga4gh.org/product/service-info/
Info to provide:
Other things
The text was updated successfully, but these errors were encountered: