Organize outputs into structs where appropriate #44

chrisamiller · 2022-05-06T20:03:27Z

Pipelines which are considered "terminal" should get their outputs organized into structs (which coupled with our scripts for pulling results, makes for neatly nested directories). Right now immuno.cwl is organized in this way.

define the list of these (certainly somatic exome, rnaseq, immuno, etc)
figure out how to pass results from wdls that can be run either as subworkflows or stand-alone. (i.e. rnaseq, which feeds into immuno). Can the output structs just be returned and elements accessed from the higher level wdls? Or do we need to use a param/conditionals to say "do this struct stuff only if you're not being run as a subworkflow".

tmooney · 2022-08-31T19:34:11Z

It should be fine to organize the outputs into arbitrary data structures in either CWL or WDL and pass them along as outputs from tools or subworkflows. The trade-off is having a bunch of custom types defined that you need to track down to know what to expect (which maybe doesn't matter if you're only looking at the output JSON and reading through it).

The current implementation of this in immuno.wdl uses the deprecated-in-WDL-1.1 object keyword... however Cromwell doesn't support WDL 1.1 (just like it doesn't support CWL v1.2 🙂) so I guess it's fine for now?

chrisamiller · 2022-08-31T19:40:49Z

We are open to ideas here - it's definitely clunky. The end goal is to have a gather step at the end that can assign specific names to certain output files and into subdirectories as well. Since cromwell/WDL doesn't natively support this, we thought it ideal to encode as much of that as possible into the WDL, and have a simple script that parses the structs. The alternative would seem to be defining some kind of output mapping file alongside the wdl to be parsed by another script, but that seems more prone to being broken as wdls are changed.

Would love to hear your suggestions if you have other ways to accomplish this goal in a cleaner/better supported manner!

tmooney mentioned this issue Mar 6, 2023

Create QC Metrics output type. #86

Merged

malachig added the enhancement New feature or request label Mar 24, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Organize outputs into structs where appropriate #44

Organize outputs into structs where appropriate #44

chrisamiller commented May 6, 2022

tmooney commented Aug 31, 2022

chrisamiller commented Aug 31, 2022

Organize outputs into structs where appropriate #44

Organize outputs into structs where appropriate #44

Comments

chrisamiller commented May 6, 2022

tmooney commented Aug 31, 2022

chrisamiller commented Aug 31, 2022