In sash runs, SNV process involving PCGR (annotate, report) and SigRap/MUTPAT fail for hypermutated samples due to OOM or time-outs on current process config. Regular samples are fine.
A quick fix is to add:
errorStrategy = 'retry'
maxRetries = x
which restart the process with x * memory and x * cpu
Another possibility is to select an adaptive config file based on DRAGEN results (e.g: variant load, contamination). That would require reading those metrics upstream in Step Functions and propagating them to the execution of the workflow.
@alexiswl is this have been considered ?