Releases: umccr/cwl-ica
dragen-wgts-dna-pipeline/4.4.4__20250915021010
Overview
MD5Sum: None
Documentation
Dragen dragen-wgts-dna-pipeline v4.4.4
Performs the following steps:
- If input is a bam file (for tumor or normal), reheader the bam file such that the rgsm value matches
the output_prefix value. We can't do this inside the dragen container since it doesn't have the samtools binary installed.
Instead we run a seperate step to reheader the bam file. - If tumor sequence data is provided and normal sequence data are provided, run the dragen-somatic tool.
- If only the normal sequence data are provided, run the dragen-germline tool.
Inputs:
This pipeline differs from our previous dragen-germline and dragen-somatic pipeline by instead grouping inputs.
This makes overall workflow cwl file much smaller, and the workflow graphs much more readable.
It also means appending input options is much easier since it involves updating just one schema yaml file,
rather than the workflow inputs, workflow steps and the tool inputs.
Most of the grunt work is done in the dragen tools typescript file to convert input options into a configuration file.
Outputs:
The outputs are the same as the previous dragen-germline and dragen-somatic pipelines.
The outputs are:
* Dragen somatic output directory (if tumor sequence data are provided)
* Dragen germline output directory (always)
* Multiqc output directory (always)
Dockstore
ICAv2
Tenant: umccr-prod
Bundles Generated
Bundle Name: dragen_wgts_dna_pipeline_with_validation_data_v4_4_4__20250915021010 / Bundle Version v11_r5__20250915021010
Description
This bundle has been generated by the release of workflows/dragen-wgts-dna-pipeline/4.4.4/dragen-wgts-dna-pipeline__4.4.4.cwl. The pipeline can be found at https://github.com/umccr/cwl-ica/releases/tag/dragen-wgts-dna-pipeline/4.4.4__20250915021010.
Version Description
Bundle version description is currently redundant while we cannot append versions to bundles. Regardless - the bunch version is v11_r5
Bundle ID: d9b195be-1352-4e42-a7f3-aeb2217efa32
- Bundle Link
Pipeline Project ID: 5844391a-69db-4b52-86b5-6a0d55c2386f
Pipeline Project Name: pipelines
Pipeline ID: defdcf63-27c9-4e7b-9b5d-f621376c1f59
Pipeline Code: dragen-wgts-dna-pipeline__4_4_4__20250915021010
Projects
- development
- staging
Datasets
- dragen_hash_table_chm13_v2_v11_r5_linear_cnv_hla_methyl_cg_methylated_combined_rna
- dragen_hash_table_chm13_v2_v11_r5_graph_cnv_hla_methyl_cg_rna
- dragen_hash_table_hg38_alt_masked_v11_r5_linear_cnv_hla_methyl_cg_methylated_combined_rna
- dragen_hash_table_hg38_alt_masked_v11_r5_graph_cnv_hla_methyl_cg_rna
- wgs_validation_fastq__cups_pair_8
- wgs_validation_fastq__2016_249_17_MH_P033
- wgs_validation_fastq__2016_249_18_WH_P025
- wgs_validation_fastq__B_ALL_Case_10
- wgs_validation_fastq_Diploid_Never_Responder
- wgs_validation_fastq_SBJ00303
- wgs_validation_fastq_SEQC50
- wgs_validation_fastq_SFRC01073
- ora_reference_v2
Bundle Name: dragen_wgts_dna_pipeline_production_v4_4_4__20250915021010 / Bundle Version v11_r5__20250915021010
Description
This bundle has been generated by the release of workflows/dragen-wgts-dna-pipeline/4.4.4/dragen-wgts-dna-pipeline__4.4.4.cwl. The pipeline can be found at https://github.com/umccr/cwl-ica/releases/tag/dragen-wgts-dna-pipeline/4.4.4__20250915021010.
Version Description
Bundle version description is currently redundant while we cannot append versions to bundles. Regardless - the bunch version is v11_r5
Bundle ID: 5fc299de-4518-46f7-9f06-2179eef91c7e
- Bundle Link
Pipeline Project ID: 5844391a-69db-4b52-86b5-6a0d55c2386f
Pipeline Project Name: pipelines
Pipeline ID: defdcf63-27c9-4e7b-9b5d-f621376c1f59
Pipeline Code: dragen-wgts-dna-pipeline__4_4_4__20250915021010
Projects
- production
Datasets
- dragen_hash_table_chm13_v2_v11_r5_linear_cnv_hla_methyl_cg_methylated_combined_rna
- dragen_hash_table_chm13_v2_v11_r5_graph_cnv_hla_methyl_cg_rna
- dragen_hash_table_hg38_alt_masked_v11_r5_linear_cnv_hla_methyl_cg_methylated_combined_rna
- dragen_hash_table_hg38_alt_masked_v11_r5_graph_cnv_hla_methyl_cg_rna
- ora_reference_v2
Visual Overview
Click to expand!
Inputs Template
Yaml
Click to expand!
# yaml-language-server: $schema=https://github.com/umccr/cwl-ica/releases/download/dragen-wgts-dna-pipeline%2F4.4.4__20250915021010/dragen-wgts-dna-pipeline__4.4.4__20250915021010.schema.json
# sequence data (Required)
# Docs: The sequence data to be aligned and called variants on.
# This can either be a bam file, cram file or a list of fastq list row objects
sequence_data:
fastq_list_rows:
rgid: string
rglb: string
rgsm: string
rgcn: string
rgds: string
rgdt: string
rgpl: string
lane: string
read_1:
class: File
location: icav2://project_id/path/to/file
read_2:
class: File
location: icav2://project_id/path/to/file
# tumor sequence data (Optional)
# Docs: The sequence data to be aligned and called variants on.
# This can either be a bam file, cram file or a list of fastq list row objects
# Only specify the tumor sequence data IF you want to run the somatic variant calling pipeline.
tumor_sequence_data:
fastq_list_rows:
rgid: string
rglb: string
rgsm: string
rgcn: string
rgds: string
rgdt: string
rgpl: string
lane: string
read_1:
class: File
location: icav2://project_id/path/to/file
read_2:
class: File
location: icav2://project_id/path/to/file
# reference (Required)
# Docs: The reference genome to be used for alignment and variant calling.
reference:
name: string
structure:
tarball:
class: File
location: icav2://project_id/path/to/file
# somatic reference (Optional)
# Docs: The somatic reference genome to be used for alignment and variant calling.
# This is only used if the somatic reference is different from the normal reference.
somatic_reference:
name: string
structure:
tarball:
class: File
location: icav2://project_id/path/to/file
# reference (Optional)
# Docs: The ora reference used to decompress the input fastq files,
# required if any of the fastqs are ora-compressed
ora_reference:
class: File
location: icav2://project_id/path/to/file
# sample name (Required)
# Docs: The sample name to be used for alignment and variant calling.
# This is used to name the output files
sample_name: string
# tumor sample name (Optional)
# Docs: The tumor sample name to be used for alignment and variant calling.
# This is used to name the output files
tumor_sample_name: string
# alignment options (Optional)
# Docs: The options to be used for alignment.
# This is a JSON object that contains the options to be used for alignment.
# This is passed to the Dragen alignment tool.
alignment_options:
enable_duplicate_marking: false
enable_deterministic_sort: false
enable_sampling: false
enable_down_sampler: false
append_read_index_to_name: false
fastq_offset: string
filter_flags_from_output: string
generate_md_tags: false
generate_sa_tags: false
generate_zs_tags: false
input_qname_suffix_delimiter: string
output_format: string
preserve_bqsr_tags: false
preserve_map_align_order: false
qc_coverage:
region:
class: File
location: icav2://project_id/path/to/file
report_type: string
thresholds:
- string
ref_sequence_filter: false
remove_duplicates: false
sample_size: string
strip_input_qname_suffixes: false
aligner:
aln_min_score: string
clip_pe_overhang: string
dedup_min_qual: string
en_alt_hap_aln: string
en_chimeric_aln: string
gap_ext_pen: string
gap_open_pen: string
global: string
hard_clip_tags: string
hard_clips: string
mapq_floor_1snp: string
map_orientations: string
mapq_max: string
mapq_strict_sjs: string
match_n_score: string
match_score: string
max_rescues: string
min_overhang: string
min_score_coeff:
mismatch_pen: string
no_noncan_motifs: string
no_unclip_score: string
no_unpaired: string
pe_max_penalty: string
pe_orientation: string
rescue_sigmas:
sec_aligns: string
sec_aligns_hard: string
sec_phred_delta: string
sec_score_delta:
supp_aligns: string
supp_as_sec: string
supp_min_score_adj: string
unclip_score: string
unpaired_pen: string
mapper:
ann_sj_max_indel: string
edit_chain_limit: string
edit_mode: string
edit_read_len: string
edit_seed_num: string
map_orientations: string
max_intron_bases: string
min_intron_bases: string
seed_density: string
rrna_filter_enable: false
rrna_filter_contig: string
rna_library_type: string
rna_mapq_unique: string
rna_ann_sj_min_len: string
# somatic alignment options (Optional)
# Docs: The options to be used for alignment of the tumor sample.
# This is a JSON object that contains the options to be used for alignment.
# This is passed to the Dragen alignment tool.
# For the somatic alignment, both alignment options and somatic alignment options are first merged before being provided to dragen
somatic_alignment_options:
enable_duplicate_marking: false
enable_deterministic_sort: false
enable_sampling: false
enable_down_sampler: false
append_read_index_to_name: false
fastq_offset: string
filter_flags_from_output: string
generate_md_tags: false
generate_sa_tags: false
generate_zs_tags: fals...
dragen-wgts-dna-pipeline/4.4.4__20250902012720
Overview
MD5Sum: None
Documentation
Dragen dragen-wgts-dna-pipeline v4.4.4
Performs the following steps:
- If input is a bam file (for tumor or normal), reheader the bam file such that the rgsm value matches
the output_prefix value. We can't do this inside the dragen container since it doesn't have the samtools binary installed.
Instead we run a seperate step to reheader the bam file. - If tumor sequence data is provided and normal sequence data are provided, run the dragen-somatic tool.
- If only the normal sequence data are provided, run the dragen-germline tool.
Inputs:
This pipeline differs from our previous dragen-germline and dragen-somatic pipeline by instead grouping inputs.
This makes overall workflow cwl file much smaller, and the workflow graphs much more readable.
It also means appending input options is much easier since it involves updating just one schema yaml file,
rather than the workflow inputs, workflow steps and the tool inputs.
Most of the grunt work is done in the dragen tools typescript file to convert input options into a configuration file.
Outputs:
The outputs are the same as the previous dragen-germline and dragen-somatic pipelines.
The outputs are:
* Dragen somatic output directory (if tumor sequence data are provided)
* Dragen germline output directory (always)
* Multiqc output directory (always)
Dockstore
ICAv2
Tenant: umccr-prod
Bundles Generated
Bundle Name: dragen_wgts_dna_pipeline_with_validation_data_v4_4_4__20250902012720 / Bundle Version v11_r5__20250902012720
Description
This bundle has been generated by the release of workflows/dragen-wgts-dna-pipeline/4.4.4/dragen-wgts-dna-pipeline__4.4.4.cwl. The pipeline can be found at https://github.com/umccr/cwl-ica/releases/tag/dragen-wgts-dna-pipeline/4.4.4__20250902012720.
Version Description
Bundle version description is currently redundant while we cannot append versions to bundles. Regardless - the bunch version is v11_r5
Bundle ID: be50ca0c-1099-416a-8a65-d44d68188a55
- Bundle Link
Pipeline Project ID: 5844391a-69db-4b52-86b5-6a0d55c2386f
Pipeline Project Name: pipelines
Pipeline ID: 0c7b15f1-f5d4-433a-9333-4618e5ca1731
Pipeline Code: dragen-wgts-dna-pipeline__4_4_4__20250902012720
Projects
- development
- staging
Datasets
- dragen_hash_table_chm13_v2_v11_r5_linear_cnv_hla_methyl_cg_methylated_combined_rna
- dragen_hash_table_chm13_v2_v11_r5_graph_cnv_hla_methyl_cg_rna
- dragen_hash_table_hg38_alt_masked_v11_r5_linear_cnv_hla_methyl_cg_methylated_combined_rna
- dragen_hash_table_hg38_alt_masked_v11_r5_graph_cnv_hla_methyl_cg_rna
- wgs_validation_fastq__cups_pair_8
- wgs_validation_fastq__2016_249_17_MH_P033
- wgs_validation_fastq__2016_249_18_WH_P025
- wgs_validation_fastq__B_ALL_Case_10
- wgs_validation_fastq_Diploid_Never_Responder
- wgs_validation_fastq_SBJ00303
- wgs_validation_fastq_SEQC50
- wgs_validation_fastq_SFRC01073
- ora_reference_v2
Bundle Name: dragen_wgts_dna_pipeline_production_v4_4_4__20250902012720 / Bundle Version v11_r5__20250902012720
Description
This bundle has been generated by the release of workflows/dragen-wgts-dna-pipeline/4.4.4/dragen-wgts-dna-pipeline__4.4.4.cwl. The pipeline can be found at https://github.com/umccr/cwl-ica/releases/tag/dragen-wgts-dna-pipeline/4.4.4__20250902012720.
Version Description
Bundle version description is currently redundant while we cannot append versions to bundles. Regardless - the bunch version is v11_r5
Bundle ID: 9728cd22-f150-4a38-8cf3-e16c41f56549
- Bundle Link
Pipeline Project ID: 5844391a-69db-4b52-86b5-6a0d55c2386f
Pipeline Project Name: pipelines
Pipeline ID: 0c7b15f1-f5d4-433a-9333-4618e5ca1731
Pipeline Code: dragen-wgts-dna-pipeline__4_4_4__20250902012720
Projects
- production
Datasets
- dragen_hash_table_chm13_v2_v11_r5_linear_cnv_hla_methyl_cg_methylated_combined_rna
- dragen_hash_table_chm13_v2_v11_r5_graph_cnv_hla_methyl_cg_rna
- dragen_hash_table_hg38_alt_masked_v11_r5_linear_cnv_hla_methyl_cg_methylated_combined_rna
- dragen_hash_table_hg38_alt_masked_v11_r5_graph_cnv_hla_methyl_cg_rna
- ora_reference_v2
Visual Overview
Click to expand!
Inputs Template
Yaml
Click to expand!
# yaml-language-server: $schema=https://github.com/umccr/cwl-ica/releases/download/dragen-wgts-dna-pipeline%2F4.4.4__20250902012720/dragen-wgts-dna-pipeline__4.4.4__20250902012720.schema.json
# sequence data (Required)
# Docs: The sequence data to be aligned and called variants on.
# This can either be a bam file, cram file or a list of fastq list row objects
sequence_data:
fastq_list_rows:
rgid: string
rglb: string
rgsm: string
rgcn: string
rgds: string
rgdt: string
rgpl: string
lane: string
read_1:
class: File
location: icav2://project_id/path/to/file
read_2:
class: File
location: icav2://project_id/path/to/file
# tumor sequence data (Optional)
# Docs: The sequence data to be aligned and called variants on.
# This can either be a bam file, cram file or a list of fastq list row objects
# Only specify the tumor sequence data IF you want to run the somatic variant calling pipeline.
tumor_sequence_data:
fastq_list_rows:
rgid: string
rglb: string
rgsm: string
rgcn: string
rgds: string
rgdt: string
rgpl: string
lane: string
read_1:
class: File
location: icav2://project_id/path/to/file
read_2:
class: File
location: icav2://project_id/path/to/file
# reference (Required)
# Docs: The reference genome to be used for alignment and variant calling.
reference:
name: string
structure:
tarball:
class: File
location: icav2://project_id/path/to/file
# somatic reference (Optional)
# Docs: The somatic reference genome to be used for alignment and variant calling.
# This is only used if the somatic reference is different from the normal reference.
somatic_reference:
name: string
structure:
tarball:
class: File
location: icav2://project_id/path/to/file
# reference (Optional)
# Docs: The ora reference used to decompress the input fastq files,
# required if any of the fastqs are ora-compressed
ora_reference:
class: File
location: icav2://project_id/path/to/file
# sample name (Required)
# Docs: The sample name to be used for alignment and variant calling.
# This is used to name the output files
sample_name: string
# tumor sample name (Optional)
# Docs: The tumor sample name to be used for alignment and variant calling.
# This is used to name the output files
tumor_sample_name: string
# alignment options (Optional)
# Docs: The options to be used for alignment.
# This is a JSON object that contains the options to be used for alignment.
# This is passed to the Dragen alignment tool.
alignment_options:
enable_duplicate_marking: false
enable_deterministic_sort: false
enable_sampling: false
enable_down_sampler: false
append_read_index_to_name: false
fastq_offset: string
filter_flags_from_output: string
generate_md_tags: false
generate_sa_tags: false
generate_zs_tags: false
input_qname_suffix_delimiter: string
output_format: string
preserve_bqsr_tags: false
preserve_map_align_order: false
qc_coverage:
region:
class: File
location: icav2://project_id/path/to/file
report_type: string
thresholds:
- string
ref_sequence_filter: false
remove_duplicates: false
sample_size: string
strip_input_qname_suffixes: false
aligner:
aln_min_score: string
clip_pe_overhang: string
dedup_min_qual: string
en_alt_hap_aln: string
en_chimeric_aln: string
gap_ext_pen: string
gap_open_pen: string
global: string
hard_clip_tags: string
hard_clips: string
mapq_floor_1snp: string
map_orientations: string
mapq_max: string
mapq_strict_sjs: string
match_n_score: string
match_score: string
max_rescues: string
min_overhang: string
min_score_coeff:
mismatch_pen: string
no_noncan_motifs: string
no_unclip_score: string
no_unpaired: string
pe_max_penalty: string
pe_orientation: string
rescue_sigmas:
sec_aligns: string
sec_aligns_hard: string
sec_phred_delta: string
sec_score_delta:
supp_aligns: string
supp_as_sec: string
supp_min_score_adj: string
unclip_score: string
unpaired_pen: string
mapper:
ann_sj_max_indel: string
edit_chain_limit: string
edit_mode: string
edit_read_len: string
edit_seed_num: string
map_orientations: string
max_intron_bases: string
min_intron_bases: string
seed_density: string
rrna_filter_enable: false
rrna_filter_contig: string
rna_library_type: string
rna_mapq_unique: string
rna_ann_sj_min_len: string
# somatic alignment options (Optional)
# Docs: The options to be used for alignment of the tumor sample.
# This is a JSON object that contains the options to be used for alignment.
# This is passed to the Dragen alignment tool.
# For the somatic alignment, both alignment options and somatic alignment options are first merged before being provided to dragen
somatic_alignment_options:
enable_duplicate_marking: false
enable_deterministic_sort: false
enable_sampling: false
enable_down_sampler: false
append_read_index_to_name: false
fastq_offset: string
filter_flags_from_output: string
generate_md_tags: false
generate_sa_tags: false
generate_zs_tags: fals...
dragen-wgts-dna-pipeline/4.4.4__20250901220047
Overview
MD5Sum: None
Documentation
Dragen dragen-wgts-dna-pipeline v4.4.4
Performs the following steps:
- If input is a bam file (for tumor or normal), reheader the bam file such that the rgsm value matches
the output_prefix value. We can't do this inside the dragen container since it doesn't have the samtools binary installed.
Instead we run a seperate step to reheader the bam file. - If tumor sequence data is provided and normal sequence data are provided, run the dragen-somatic tool.
- If only the normal sequence data are provided, run the dragen-germline tool.
Inputs:
This pipeline differs from our previous dragen-germline and dragen-somatic pipeline by instead grouping inputs.
This makes overall workflow cwl file much smaller, and the workflow graphs much more readable.
It also means appending input options is much easier since it involves updating just one schema yaml file,
rather than the workflow inputs, workflow steps and the tool inputs.
Most of the grunt work is done in the dragen tools typescript file to convert input options into a configuration file.
Outputs:
The outputs are the same as the previous dragen-germline and dragen-somatic pipelines.
The outputs are:
* Dragen somatic output directory (if tumor sequence data are provided)
* Dragen germline output directory (always)
* Multiqc output directory (always)
Dockstore
ICAv2
Tenant: umccr-prod
Bundles Generated
Bundle Name: dragen_wgts_dna_pipeline_with_validation_data_v4_4_4__20250901220047 / Bundle Version v11_r5__20250901220047
Description
This bundle has been generated by the release of workflows/dragen-wgts-dna-pipeline/4.4.4/dragen-wgts-dna-pipeline__4.4.4.cwl. The pipeline can be found at https://github.com/umccr/cwl-ica/releases/tag/dragen-wgts-dna-pipeline/4.4.4__20250901220047.
Version Description
Bundle version description is currently redundant while we cannot append versions to bundles. Regardless - the bunch version is v11_r5
Bundle ID: c5df5fcc-49e7-4493-8bd7-a5f71ccf924a
- Bundle Link
Pipeline Project ID: 5844391a-69db-4b52-86b5-6a0d55c2386f
Pipeline Project Name: pipelines
Pipeline ID: d83b1056-57a9-499a-b0ef-834f6e2cb229
Pipeline Code: dragen-wgts-dna-pipeline__4_4_4__20250901220047
Projects
- development
- staging
Datasets
- dragen_hash_table_chm13_v2_v11_r5_linear_cnv_hla_methyl_cg_methylated_combined_rna
- dragen_hash_table_chm13_v2_v11_r5_graph_cnv_hla_methyl_cg_rna
- dragen_hash_table_hg38_alt_masked_v11_r5_linear_cnv_hla_methyl_cg_methylated_combined_rna
- dragen_hash_table_hg38_alt_masked_v11_r5_graph_cnv_hla_methyl_cg_rna
- wgs_validation_fastq__cups_pair_8
- wgs_validation_fastq__2016_249_17_MH_P033
- wgs_validation_fastq__2016_249_18_WH_P025
- wgs_validation_fastq__B_ALL_Case_10
- wgs_validation_fastq_Diploid_Never_Responder
- wgs_validation_fastq_SBJ00303
- wgs_validation_fastq_SEQC50
- wgs_validation_fastq_SFRC01073
- ora_reference_v2
Bundle Name: dragen_wgts_dna_pipeline_production_v4_4_4__20250901220047 / Bundle Version v11_r5__20250901220047
Description
This bundle has been generated by the release of workflows/dragen-wgts-dna-pipeline/4.4.4/dragen-wgts-dna-pipeline__4.4.4.cwl. The pipeline can be found at https://github.com/umccr/cwl-ica/releases/tag/dragen-wgts-dna-pipeline/4.4.4__20250901220047.
Version Description
Bundle version description is currently redundant while we cannot append versions to bundles. Regardless - the bunch version is v11_r5
Bundle ID: e85e1d34-b695-4086-a42c-a761e70983bd
- Bundle Link
Pipeline Project ID: 5844391a-69db-4b52-86b5-6a0d55c2386f
Pipeline Project Name: pipelines
Pipeline ID: d83b1056-57a9-499a-b0ef-834f6e2cb229
Pipeline Code: dragen-wgts-dna-pipeline__4_4_4__20250901220047
Projects
- production
Datasets
- dragen_hash_table_chm13_v2_v11_r5_linear_cnv_hla_methyl_cg_methylated_combined_rna
- dragen_hash_table_chm13_v2_v11_r5_graph_cnv_hla_methyl_cg_rna
- dragen_hash_table_hg38_alt_masked_v11_r5_linear_cnv_hla_methyl_cg_methylated_combined_rna
- dragen_hash_table_hg38_alt_masked_v11_r5_graph_cnv_hla_methyl_cg_rna
- ora_reference_v2
Visual Overview
Click to expand!
Inputs Template
Yaml
Click to expand!
# yaml-language-server: $schema=https://github.com/umccr/cwl-ica/releases/download/dragen-wgts-dna-pipeline%2F4.4.4__20250901220047/dragen-wgts-dna-pipeline__4.4.4__20250901220047.schema.json
# sequence data (Required)
# Docs: The sequence data to be aligned and called variants on.
# This can either be a bam file, cram file or a list of fastq list row objects
sequence_data:
fastq_list_rows:
rgid: string
rglb: string
rgsm: string
rgcn: string
rgds: string
rgdt: string
rgpl: string
lane: string
read_1:
class: File
location: icav2://project_id/path/to/file
read_2:
class: File
location: icav2://project_id/path/to/file
# tumor sequence data (Optional)
# Docs: The sequence data to be aligned and called variants on.
# This can either be a bam file, cram file or a list of fastq list row objects
# Only specify the tumor sequence data IF you want to run the somatic variant calling pipeline.
tumor_sequence_data:
fastq_list_rows:
rgid: string
rglb: string
rgsm: string
rgcn: string
rgds: string
rgdt: string
rgpl: string
lane: string
read_1:
class: File
location: icav2://project_id/path/to/file
read_2:
class: File
location: icav2://project_id/path/to/file
# reference (Required)
# Docs: The reference genome to be used for alignment and variant calling.
reference:
name: string
structure:
tarball:
class: File
location: icav2://project_id/path/to/file
# somatic reference (Optional)
# Docs: The somatic reference genome to be used for alignment and variant calling.
# This is only used if the somatic reference is different from the normal reference.
somatic_reference:
name: string
structure:
tarball:
class: File
location: icav2://project_id/path/to/file
# reference (Optional)
# Docs: The ora reference used to decompress the input fastq files,
# required if any of the fastqs are ora-compressed
ora_reference:
class: File
location: icav2://project_id/path/to/file
# sample name (Required)
# Docs: The sample name to be used for alignment and variant calling.
# This is used to name the output files
sample_name: string
# tumor sample name (Optional)
# Docs: The tumor sample name to be used for alignment and variant calling.
# This is used to name the output files
tumor_sample_name: string
# alignment options (Optional)
# Docs: The options to be used for alignment.
# This is a JSON object that contains the options to be used for alignment.
# This is passed to the Dragen alignment tool.
alignment_options:
enable_duplicate_marking: false
enable_deterministic_sort: false
enable_sampling: false
enable_down_sampler: false
append_read_index_to_name: false
fastq_offset: string
filter_flags_from_output: string
generate_md_tags: false
generate_sa_tags: false
generate_zs_tags: false
input_qname_suffix_delimiter: string
output_format: string
preserve_bqsr_tags: false
preserve_map_align_order: false
qc_coverage:
region:
class: File
location: icav2://project_id/path/to/file
report_type: string
thresholds:
- string
ref_sequence_filter: false
remove_duplicates: false
sample_size: string
strip_input_qname_suffixes: false
aligner:
aln_min_score: string
clip_pe_overhang: string
dedup_min_qual: string
en_alt_hap_aln: string
en_chimeric_aln: string
gap_ext_pen: string
gap_open_pen: string
global: string
hard_clip_tags: string
hard_clips: string
mapq_floor_1snp: string
map_orientations: string
mapq_max: string
mapq_strict_sjs: string
match_n_score: string
match_score: string
max_rescues: string
min_overhang: string
min_score_coeff:
mismatch_pen: string
no_noncan_motifs: string
no_unclip_score: string
no_unpaired: string
pe_max_penalty: string
pe_orientation: string
rescue_sigmas:
sec_aligns: string
sec_aligns_hard: string
sec_phred_delta: string
sec_score_delta:
supp_aligns: string
supp_as_sec: string
supp_min_score_adj: string
unclip_score: string
unpaired_pen: string
mapper:
ann_sj_max_indel: string
edit_chain_limit: string
edit_mode: string
edit_read_len: string
edit_seed_num: string
map_orientations: string
max_intron_bases: string
min_intron_bases: string
seed_density: string
rrna_filter_enable: false
rrna_filter_contig: string
rna_library_type: string
rna_mapq_unique: string
rna_ann_sj_min_len: string
# somatic alignment options (Optional)
# Docs: The options to be used for alignment of the tumor sample.
# This is a JSON object that contains the options to be used for alignment.
# This is passed to the Dragen alignment tool.
# For the somatic alignment, both alignment options and somatic alignment options are first merged before being provided to dragen
somatic_alignment_options:
enable_duplicate_marking: false
enable_deterministic_sort: false
enable_sampling: false
enable_down_sampler: false
append_read_index_to_name: false
fastq_offset: string
filter_flags_from_output: string
generate_md_tags: false
generate_sa_tags: false
generate_zs_tags: fals...
dragen-wgts-dna-pipeline/4.4.4__20250901010327
Overview
MD5Sum: None
Documentation
Dragen dragen-wgts-dna-pipeline v4.4.4
Performs the following steps:
- If input is a bam file (for tumor or normal), reheader the bam file such that the rgsm value matches
the output_prefix value. We can't do this inside the dragen container since it doesn't have the samtools binary installed.
Instead we run a seperate step to reheader the bam file. - If tumor sequence data is provided and normal sequence data are provided, run the dragen-somatic tool.
- If only the normal sequence data are provided, run the dragen-germline tool.
Inputs:
This pipeline differs from our previous dragen-germline and dragen-somatic pipeline by instead grouping inputs.
This makes overall workflow cwl file much smaller, and the workflow graphs much more readable.
It also means appending input options is much easier since it involves updating just one schema yaml file,
rather than the workflow inputs, workflow steps and the tool inputs.
Most of the grunt work is done in the dragen tools typescript file to convert input options into a configuration file.
Outputs:
The outputs are the same as the previous dragen-germline and dragen-somatic pipelines.
The outputs are:
* Dragen somatic output directory (if tumor sequence data are provided)
* Dragen germline output directory (always)
* Multiqc output directory (always)
Dockstore
ICAv2
Tenant: umccr-prod
Bundles Generated
Bundle Name: dragen_wgts_dna_pipeline_with_validation_data_v4_4_4__20250901010327 / Bundle Version v11_r5__20250901010327
Description
This bundle has been generated by the release of workflows/dragen-wgts-dna-pipeline/4.4.4/dragen-wgts-dna-pipeline__4.4.4.cwl. The pipeline can be found at https://github.com/umccr/cwl-ica/releases/tag/dragen-wgts-dna-pipeline/4.4.4__20250901010327.
Version Description
Bundle version description is currently redundant while we cannot append versions to bundles. Regardless - the bunch version is v11_r5
Bundle ID: b5d32b66-83b9-45ef-8978-b222eb658f2e
- Bundle Link
Pipeline Project ID: 5844391a-69db-4b52-86b5-6a0d55c2386f
Pipeline Project Name: pipelines
Pipeline ID: e436c569-6022-4dfb-9692-3f265522a66f
Pipeline Code: dragen-wgts-dna-pipeline__4_4_4__20250901010327
Projects
- development
- staging
Datasets
- dragen_hash_table_chm13_v2_v11_r5_linear_cnv_hla_methyl_cg_methylated_combined_rna
- dragen_hash_table_chm13_v2_v11_r5_graph_cnv_hla_methyl_cg_rna
- dragen_hash_table_hg38_alt_masked_v11_r5_linear_cnv_hla_methyl_cg_methylated_combined_rna
- dragen_hash_table_hg38_alt_masked_v11_r5_graph_cnv_hla_methyl_cg_rna
- wgs_validation_fastq__cups_pair_8
- wgs_validation_fastq__2016_249_17_MH_P033
- wgs_validation_fastq__2016_249_18_WH_P025
- wgs_validation_fastq__B_ALL_Case_10
- wgs_validation_fastq_Diploid_Never_Responder
- wgs_validation_fastq_SBJ00303
- wgs_validation_fastq_SEQC50
- wgs_validation_fastq_SFRC01073
- ora_reference_v2
Bundle Name: dragen_wgts_dna_pipeline_production_v4_4_4__20250901010327 / Bundle Version v11_r5__20250901010327
Description
This bundle has been generated by the release of workflows/dragen-wgts-dna-pipeline/4.4.4/dragen-wgts-dna-pipeline__4.4.4.cwl. The pipeline can be found at https://github.com/umccr/cwl-ica/releases/tag/dragen-wgts-dna-pipeline/4.4.4__20250901010327.
Version Description
Bundle version description is currently redundant while we cannot append versions to bundles. Regardless - the bunch version is v11_r5
Bundle ID: fa4bb886-5020-46c1-8e2c-8a3e511708ec
- Bundle Link
Pipeline Project ID: 5844391a-69db-4b52-86b5-6a0d55c2386f
Pipeline Project Name: pipelines
Pipeline ID: e436c569-6022-4dfb-9692-3f265522a66f
Pipeline Code: dragen-wgts-dna-pipeline__4_4_4__20250901010327
Projects
- production
Datasets
- dragen_hash_table_chm13_v2_v11_r5_linear_cnv_hla_methyl_cg_methylated_combined_rna
- dragen_hash_table_chm13_v2_v11_r5_graph_cnv_hla_methyl_cg_rna
- dragen_hash_table_hg38_alt_masked_v11_r5_linear_cnv_hla_methyl_cg_methylated_combined_rna
- dragen_hash_table_hg38_alt_masked_v11_r5_graph_cnv_hla_methyl_cg_rna
- ora_reference_v2
Visual Overview
Click to expand!
Inputs Template
Yaml
Click to expand!
# yaml-language-server: $schema=https://github.com/umccr/cwl-ica/releases/download/dragen-wgts-dna-pipeline%2F4.4.4__20250901010327/dragen-wgts-dna-pipeline__4.4.4__20250901010327.schema.json
# sequence data (Required)
# Docs: The sequence data to be aligned and called variants on.
# This can either be a bam file, cram file or a list of fastq list row objects
sequence_data:
fastq_list_rows:
rgid: string
rglb: string
rgsm: string
rgcn: string
rgds: string
rgdt: string
rgpl: string
lane: string
read_1:
class: File
location: icav2://project_id/path/to/file
read_2:
class: File
location: icav2://project_id/path/to/file
# tumor sequence data (Optional)
# Docs: The sequence data to be aligned and called variants on.
# This can either be a bam file, cram file or a list of fastq list row objects
# Only specify the tumor sequence data IF you want to run the somatic variant calling pipeline.
tumor_sequence_data:
fastq_list_rows:
rgid: string
rglb: string
rgsm: string
rgcn: string
rgds: string
rgdt: string
rgpl: string
lane: string
read_1:
class: File
location: icav2://project_id/path/to/file
read_2:
class: File
location: icav2://project_id/path/to/file
# reference (Required)
# Docs: The reference genome to be used for alignment and variant calling.
reference:
name: string
structure:
tarball:
class: File
location: icav2://project_id/path/to/file
# somatic reference (Optional)
# Docs: The somatic reference genome to be used for alignment and variant calling.
# This is only used if the somatic reference is different from the normal reference.
somatic_reference:
name: string
structure:
tarball:
class: File
location: icav2://project_id/path/to/file
# reference (Optional)
# Docs: The ora reference used to decompress the input fastq files,
# required if any of the fastqs are ora-compressed
ora_reference:
class: File
location: icav2://project_id/path/to/file
# sample name (Required)
# Docs: The sample name to be used for alignment and variant calling.
# This is used to name the output files
sample_name: string
# tumor sample name (Optional)
# Docs: The tumor sample name to be used for alignment and variant calling.
# This is used to name the output files
tumor_sample_name: string
# alignment options (Optional)
# Docs: The options to be used for alignment.
# This is a JSON object that contains the options to be used for alignment.
# This is passed to the Dragen alignment tool.
alignment_options:
enable_duplicate_marking: false
enable_deterministic_sort: false
enable_sampling: false
enable_down_sampler: false
append_read_index_to_name: false
fastq_offset: string
filter_flags_from_output: string
generate_md_tags: false
generate_sa_tags: false
generate_zs_tags: false
input_qname_suffix_delimiter: string
output_format: string
preserve_bqsr_tags: false
preserve_map_align_order: false
qc_coverage:
region:
class: File
location: icav2://project_id/path/to/file
report_type: string
thresholds:
- string
ref_sequence_filter: false
remove_duplicates: false
sample_size: string
strip_input_qname_suffixes: false
aligner:
aln_min_score: string
clip_pe_overhang: string
dedup_min_qual: string
en_alt_hap_aln: string
en_chimeric_aln: string
gap_ext_pen: string
gap_open_pen: string
global: string
hard_clip_tags: string
hard_clips: string
mapq_floor_1snp: string
map_orientations: string
mapq_max: string
mapq_strict_sjs: string
match_n_score: string
match_score: string
max_rescues: string
min_overhang: string
min_score_coeff:
mismatch_pen: string
no_noncan_motifs: string
no_unclip_score: string
no_unpaired: string
pe_max_penalty: string
pe_orientation: string
rescue_sigmas:
sec_aligns: string
sec_aligns_hard: string
sec_phred_delta: string
sec_score_delta:
supp_aligns: string
supp_as_sec: string
supp_min_score_adj: string
unclip_score: string
unpaired_pen: string
mapper:
ann_sj_max_indel: string
edit_chain_limit: string
edit_mode: string
edit_read_len: string
edit_seed_num: string
map_orientations: string
max_intron_bases: string
min_intron_bases: string
seed_density: string
rrna_filter_enable: false
rrna_filter_contig: string
rna_library_type: string
rna_mapq_unique: string
rna_ann_sj_min_len: string
# somatic alignment options (Optional)
# Docs: The options to be used for alignment of the tumor sample.
# This is a JSON object that contains the options to be used for alignment.
# This is passed to the Dragen alignment tool.
# For the somatic alignment, both alignment options and somatic alignment options are first merged before being provided to dragen
somatic_alignment_options:
enable_duplicate_marking: false
enable_deterministic_sort: false
enable_sampling: false
enable_down_sampler: false
append_read_index_to_name: false
fastq_offset: string
filter_flags_from_output: string
generate_md_tags: false
generate_sa_tags: false
generate_zs_tags: fals...
dragen-wgts-dna-pipeline/4.4.4__20250831041955
Overview
MD5Sum: None
Documentation
Dragen dragen-wgts-dna-pipeline v4.4.4
Performs the following steps:
- If input is a bam file (for tumor or normal), reheader the bam file such that the rgsm value matches
the output_prefix value. We can't do this inside the dragen container since it doesn't have the samtools binary installed.
Instead we run a seperate step to reheader the bam file. - If tumor sequence data is provided and normal sequence data are provided, run the dragen-somatic tool.
- If only the normal sequence data are provided, run the dragen-germline tool.
Inputs:
This pipeline differs from our previous dragen-germline and dragen-somatic pipeline by instead grouping inputs.
This makes overall workflow cwl file much smaller, and the workflow graphs much more readable.
It also means appending input options is much easier since it involves updating just one schema yaml file,
rather than the workflow inputs, workflow steps and the tool inputs.
Most of the grunt work is done in the dragen tools typescript file to convert input options into a configuration file.
Outputs:
The outputs are the same as the previous dragen-germline and dragen-somatic pipelines.
The outputs are:
* Dragen somatic output directory (if tumor sequence data are provided)
* Dragen germline output directory (always)
* Multiqc output directory (always)
Dockstore
ICAv2
Tenant: umccr-prod
Bundles Generated
Bundle Name: dragen_wgts_dna_pipeline_with_validation_data_v4_4_4__20250831041955 / Bundle Version v11_r5__20250831041955
Description
This bundle has been generated by the release of workflows/dragen-wgts-dna-pipeline/4.4.4/dragen-wgts-dna-pipeline__4.4.4.cwl. The pipeline can be found at https://github.com/umccr/cwl-ica/releases/tag/dragen-wgts-dna-pipeline/4.4.4__20250831041955.
Version Description
Bundle version description is currently redundant while we cannot append versions to bundles. Regardless - the bunch version is v11_r5
Bundle ID: 4b54316c-9698-4107-bfcc-5d5aeb587417
- Bundle Link
Pipeline Project ID: 5844391a-69db-4b52-86b5-6a0d55c2386f
Pipeline Project Name: pipelines
Pipeline ID: c8a273d6-bd83-44de-b2b7-3df53d21b278
Pipeline Code: dragen-wgts-dna-pipeline__4_4_4__20250831041955
Projects
- development
- staging
Datasets
- dragen_hash_table_chm13_v2_v11_r5_linear_cnv_hla_methyl_cg_methylated_combined_rna
- dragen_hash_table_chm13_v2_v11_r5_graph_cnv_hla_methyl_cg_rna
- dragen_hash_table_hg38_alt_masked_v11_r5_linear_cnv_hla_methyl_cg_methylated_combined_rna
- dragen_hash_table_hg38_alt_masked_v11_r5_graph_cnv_hla_methyl_cg_rna
- wgs_validation_fastq__cups_pair_8
- wgs_validation_fastq__2016_249_17_MH_P033
- wgs_validation_fastq__2016_249_18_WH_P025
- wgs_validation_fastq__B_ALL_Case_10
- wgs_validation_fastq_Diploid_Never_Responder
- wgs_validation_fastq_SBJ00303
- wgs_validation_fastq_SEQC50
- wgs_validation_fastq_SFRC01073
- ora_reference_v2
Bundle Name: dragen_wgts_dna_pipeline_production_v4_4_4__20250831041955 / Bundle Version v11_r5__20250831041955
Description
This bundle has been generated by the release of workflows/dragen-wgts-dna-pipeline/4.4.4/dragen-wgts-dna-pipeline__4.4.4.cwl. The pipeline can be found at https://github.com/umccr/cwl-ica/releases/tag/dragen-wgts-dna-pipeline/4.4.4__20250831041955.
Version Description
Bundle version description is currently redundant while we cannot append versions to bundles. Regardless - the bunch version is v11_r5
Bundle ID: 00161db3-3123-4c6e-b620-843968b444d3
- Bundle Link
Pipeline Project ID: 5844391a-69db-4b52-86b5-6a0d55c2386f
Pipeline Project Name: pipelines
Pipeline ID: c8a273d6-bd83-44de-b2b7-3df53d21b278
Pipeline Code: dragen-wgts-dna-pipeline__4_4_4__20250831041955
Projects
- production
Datasets
- dragen_hash_table_chm13_v2_v11_r5_linear_cnv_hla_methyl_cg_methylated_combined_rna
- dragen_hash_table_chm13_v2_v11_r5_graph_cnv_hla_methyl_cg_rna
- dragen_hash_table_hg38_alt_masked_v11_r5_linear_cnv_hla_methyl_cg_methylated_combined_rna
- dragen_hash_table_hg38_alt_masked_v11_r5_graph_cnv_hla_methyl_cg_rna
- ora_reference_v2
Visual Overview
Click to expand!
Inputs Template
Yaml
Click to expand!
# yaml-language-server: $schema=https://github.com/umccr/cwl-ica/releases/download/dragen-wgts-dna-pipeline%2F4.4.4__20250831041955/dragen-wgts-dna-pipeline__4.4.4__20250831041955.schema.json
# sequence data (Required)
# Docs: The sequence data to be aligned and called variants on.
# This can either be a bam file, cram file or a list of fastq list row objects
sequence_data:
fastq_list_rows:
rgid: string
rglb: string
rgsm: string
rgcn: string
rgds: string
rgdt: string
rgpl: string
lane: string
read_1:
class: File
location: icav2://project_id/path/to/file
read_2:
class: File
location: icav2://project_id/path/to/file
# tumor sequence data (Optional)
# Docs: The sequence data to be aligned and called variants on.
# This can either be a bam file, cram file or a list of fastq list row objects
# Only specify the tumor sequence data IF you want to run the somatic variant calling pipeline.
tumor_sequence_data:
fastq_list_rows:
rgid: string
rglb: string
rgsm: string
rgcn: string
rgds: string
rgdt: string
rgpl: string
lane: string
read_1:
class: File
location: icav2://project_id/path/to/file
read_2:
class: File
location: icav2://project_id/path/to/file
# reference (Required)
# Docs: The reference genome to be used for alignment and variant calling.
reference:
name: string
structure:
tarball:
class: File
location: icav2://project_id/path/to/file
# somatic reference (Optional)
# Docs: The somatic reference genome to be used for alignment and variant calling.
# This is only used if the somatic reference is different from the normal reference.
somatic_reference:
name: string
structure:
tarball:
class: File
location: icav2://project_id/path/to/file
# reference (Optional)
# Docs: The ora reference used to decompress the input fastq files,
# required if any of the fastqs are ora-compressed
ora_reference:
class: File
location: icav2://project_id/path/to/file
# sample name (Required)
# Docs: The sample name to be used for alignment and variant calling.
# This is used to name the output files
sample_name: string
# tumor sample name (Optional)
# Docs: The tumor sample name to be used for alignment and variant calling.
# This is used to name the output files
tumor_sample_name: string
# alignment options (Optional)
# Docs: The options to be used for alignment.
# This is a JSON object that contains the options to be used for alignment.
# This is passed to the Dragen alignment tool.
alignment_options:
enable_duplicate_marking: false
enable_deterministic_sort: false
enable_sampling: false
enable_down_sampler: false
append_read_index_to_name: false
fastq_offset: string
filter_flags_from_output: string
generate_md_tags: false
generate_sa_tags: false
generate_zs_tags: false
input_qname_suffix_delimiter: string
output_format: string
preserve_bqsr_tags: false
preserve_map_align_order: false
qc_coverage:
region:
class: File
location: icav2://project_id/path/to/file
report_type: string
thresholds:
- string
ref_sequence_filter: false
remove_duplicates: false
sample_size: string
strip_input_qname_suffixes: false
aligner:
aln_min_score: string
clip_pe_overhang: string
dedup_min_qual: string
en_alt_hap_aln: string
en_chimeric_aln: string
gap_ext_pen: string
gap_open_pen: string
global: string
hard_clip_tags: string
hard_clips: string
mapq_floor_1snp: string
map_orientations: string
mapq_max: string
mapq_strict_sjs: string
match_n_score: string
match_score: string
max_rescues: string
min_overhang: string
min_score_coeff:
mismatch_pen: string
no_noncan_motifs: string
no_unclip_score: string
no_unpaired: string
pe_max_penalty: string
pe_orientation: string
rescue_sigmas:
sec_aligns: string
sec_aligns_hard: string
sec_phred_delta: string
sec_score_delta:
supp_aligns: string
supp_as_sec: string
supp_min_score_adj: string
unclip_score: string
unpaired_pen: string
mapper:
ann_sj_max_indel: string
edit_chain_limit: string
edit_mode: string
edit_read_len: string
edit_seed_num: string
map_orientations: string
max_intron_bases: string
min_intron_bases: string
seed_density: string
rrna_filter_enable: false
rrna_filter_contig: string
rna_library_type: string
rna_mapq_unique: string
rna_ann_sj_min_len: string
# somatic alignment options (Optional)
# Docs: The options to be used for alignment of the tumor sample.
# This is a JSON object that contains the options to be used for alignment.
# This is passed to the Dragen alignment tool.
# For the somatic alignment, both alignment options and somatic alignment options are first merged before being provided to dragen
somatic_alignment_options:
enable_duplicate_marking: false
enable_deterministic_sort: false
enable_sampling: false
enable_down_sampler: false
append_read_index_to_name: false
fastq_offset: string
filter_flags_from_output: string
generate_md_tags: false
generate_sa_tags: false
generate_zs_tags: fals...
dragen-wgts-dna-pipeline/4.4.4__20250827053004
Overview
MD5Sum: None
Documentation
Dragen dragen-wgts-dna-pipeline v4.4.4
Performs the following steps:
- If input is a bam file (for tumor or normal), reheader the bam file such that the rgsm value matches
the output_prefix value. We can't do this inside the dragen container since it doesn't have the samtools binary installed.
Instead we run a seperate step to reheader the bam file. - If tumor sequence data is provided and normal sequence data are provided, run the dragen-somatic tool.
- If only the normal sequence data are provided, run the dragen-germline tool.
Inputs:
This pipeline differs from our previous dragen-germline and dragen-somatic pipeline by instead grouping inputs.
This makes overall workflow cwl file much smaller, and the workflow graphs much more readable.
It also means appending input options is much easier since it involves updating just one schema yaml file,
rather than the workflow inputs, workflow steps and the tool inputs.
Most of the grunt work is done in the dragen tools typescript file to convert input options into a configuration file.
Outputs:
The outputs are the same as the previous dragen-germline and dragen-somatic pipelines.
The outputs are:
* Dragen somatic output directory (if tumor sequence data are provided)
* Dragen germline output directory (always)
* Multiqc output directory (always)
Dockstore
ICAv2
Tenant: umccr-prod
Bundles Generated
Bundle Name: dragen_wgts_dna_pipeline_with_validation_data_v4_4_4__20250827053004 / Bundle Version v11_r5__20250827053004
Description
This bundle has been generated by the release of workflows/dragen-wgts-dna-pipeline/4.4.4/dragen-wgts-dna-pipeline__4.4.4.cwl. The pipeline can be found at https://github.com/umccr/cwl-ica/releases/tag/dragen-wgts-dna-pipeline/4.4.4__20250827053004.
Version Description
Bundle version description is currently redundant while we cannot append versions to bundles. Regardless - the bunch version is v11_r5
Bundle ID: 3f0c7127-8aef-4a1b-ba4f-b9c1a77a6fe3
- Bundle Link
Pipeline Project ID: 5844391a-69db-4b52-86b5-6a0d55c2386f
Pipeline Project Name: pipelines
Pipeline ID: 6868123f-a845-444f-a32f-5cda8c56302b
Pipeline Code: dragen-wgts-dna-pipeline__4_4_4__20250827053004
Projects
- development
- staging
Datasets
- dragen_hash_table_chm13_v2_v11_r5_linear_cnv_hla_methyl_cg_methylated_combined_rna
- dragen_hash_table_chm13_v2_v11_r5_graph_cnv_hla_methyl_cg_rna
- dragen_hash_table_hg38_alt_masked_v11_r5_linear_cnv_hla_methyl_cg_methylated_combined_rna
- dragen_hash_table_hg38_alt_masked_v11_r5_graph_cnv_hla_methyl_cg_rna
- wgs_validation_fastq__cups_pair_8
- wgs_validation_fastq__2016_249_17_MH_P033
- wgs_validation_fastq__2016_249_18_WH_P025
- wgs_validation_fastq__B_ALL_Case_10
- wgs_validation_fastq_Diploid_Never_Responder
- wgs_validation_fastq_SBJ00303
- wgs_validation_fastq_SEQC50
- wgs_validation_fastq_SFRC01073
- ora_reference_v2
Bundle Name: dragen_wgts_dna_pipeline_production_v4_4_4__20250827053004 / Bundle Version v11_r5__20250827053004
Description
This bundle has been generated by the release of workflows/dragen-wgts-dna-pipeline/4.4.4/dragen-wgts-dna-pipeline__4.4.4.cwl. The pipeline can be found at https://github.com/umccr/cwl-ica/releases/tag/dragen-wgts-dna-pipeline/4.4.4__20250827053004.
Version Description
Bundle version description is currently redundant while we cannot append versions to bundles. Regardless - the bunch version is v11_r5
Bundle ID: 011e4bb7-04f3-47c3-ae8a-92a19379cb31
- Bundle Link
Pipeline Project ID: 5844391a-69db-4b52-86b5-6a0d55c2386f
Pipeline Project Name: pipelines
Pipeline ID: 6868123f-a845-444f-a32f-5cda8c56302b
Pipeline Code: dragen-wgts-dna-pipeline__4_4_4__20250827053004
Projects
- production
Datasets
- dragen_hash_table_chm13_v2_v11_r5_linear_cnv_hla_methyl_cg_methylated_combined_rna
- dragen_hash_table_chm13_v2_v11_r5_graph_cnv_hla_methyl_cg_rna
- dragen_hash_table_hg38_alt_masked_v11_r5_linear_cnv_hla_methyl_cg_methylated_combined_rna
- dragen_hash_table_hg38_alt_masked_v11_r5_graph_cnv_hla_methyl_cg_rna
- ora_reference_v2
Visual Overview
Click to expand!
Inputs Template
Yaml
Click to expand!
# yaml-language-server: $schema=https://github.com/umccr/cwl-ica/releases/download/dragen-wgts-dna-pipeline%2F4.4.4__20250827053004/dragen-wgts-dna-pipeline__4.4.4__20250827053004.schema.json
# sequence data (Required)
# Docs: The sequence data to be aligned and called variants on.
# This can either be a bam file, cram file or a list of fastq list row objects
sequence_data:
fastq_list_rows:
rgid: string
rglb: string
rgsm: string
rgcn: string
rgds: string
rgdt: string
rgpl: string
lane: string
read_1:
class: File
location: icav2://project_id/path/to/file
read_2:
class: File
location: icav2://project_id/path/to/file
# tumor sequence data (Optional)
# Docs: The sequence data to be aligned and called variants on.
# This can either be a bam file, cram file or a list of fastq list row objects
# Only specify the tumor sequence data IF you want to run the somatic variant calling pipeline.
tumor_sequence_data:
fastq_list_rows:
rgid: string
rglb: string
rgsm: string
rgcn: string
rgds: string
rgdt: string
rgpl: string
lane: string
read_1:
class: File
location: icav2://project_id/path/to/file
read_2:
class: File
location: icav2://project_id/path/to/file
# reference (Required)
# Docs: The reference genome to be used for alignment and variant calling.
reference:
name: string
structure:
tarball:
class: File
location: icav2://project_id/path/to/file
# somatic reference (Optional)
# Docs: The somatic reference genome to be used for alignment and variant calling.
# This is only used if the somatic reference is different from the normal reference.
somatic_reference:
name: string
structure:
tarball:
class: File
location: icav2://project_id/path/to/file
# reference (Optional)
# Docs: The ora reference used to decompress the input fastq files,
# required if any of the fastqs are ora-compressed
ora_reference:
class: File
location: icav2://project_id/path/to/file
# sample name (Required)
# Docs: The sample name to be used for alignment and variant calling.
# This is used to name the output files
sample_name: string
# tumor sample name (Optional)
# Docs: The tumor sample name to be used for alignment and variant calling.
# This is used to name the output files
tumor_sample_name: string
# alignment options (Optional)
# Docs: The options to be used for alignment.
# This is a JSON object that contains the options to be used for alignment.
# This is passed to the Dragen alignment tool.
alignment_options:
enable_duplicate_marking: false
enable_deterministic_sort: false
enable_sampling: false
enable_down_sampler: false
append_read_index_to_name: false
fastq_offset: string
filter_flags_from_output: string
generate_md_tags: false
generate_sa_tags: false
generate_zs_tags: false
input_qname_suffix_delimiter: string
output_format: string
preserve_bqsr_tags: false
preserve_map_align_order: false
qc_coverage:
region:
class: File
location: icav2://project_id/path/to/file
report_type: string
thresholds:
- string
ref_sequence_filter: false
remove_duplicates: false
sample_size: string
strip_input_qname_suffixes: false
aligner:
aln_min_score: string
clip_pe_overhang: string
dedup_min_qual: string
en_alt_hap_aln: string
en_chimeric_aln: string
gap_ext_pen: string
gap_open_pen: string
global: string
hard_clip_tags: string
hard_clips: string
mapq_floor_1snp: string
map_orientations: string
mapq_max: string
mapq_strict_sjs: string
match_n_score: string
match_score: string
max_rescues: string
min_overhang: string
min_score_coeff:
mismatch_pen: string
no_noncan_motifs: string
no_unclip_score: string
no_unpaired: string
pe_max_penalty: string
pe_orientation: string
rescue_sigmas:
sec_aligns: string
sec_aligns_hard: string
sec_phred_delta: string
sec_score_delta:
supp_aligns: string
supp_as_sec: string
supp_min_score_adj: string
unclip_score: string
unpaired_pen: string
mapper:
ann_sj_max_indel: string
edit_chain_limit: string
edit_mode: string
edit_read_len: string
edit_seed_num: string
map_orientations: string
max_intron_bases: string
min_intron_bases: string
seed_density: string
rrna_filter_enable: false
rrna_filter_contig: string
rna_library_type: string
rna_mapq_unique: string
rna_ann_sj_min_len: string
# somatic alignment options (Optional)
# Docs: The options to be used for alignment of the tumor sample.
# This is a JSON object that contains the options to be used for alignment.
# This is passed to the Dragen alignment tool.
# For the somatic alignment, both alignment options and somatic alignment options are first merged before being provided to dragen
somatic_alignment_options:
enable_duplicate_marking: false
enable_deterministic_sort: false
enable_sampling: false
enable_down_sampler: false
append_read_index_to_name: false
fastq_offset: string
filter_flags_from_output: string
generate_md_tags: false
generate_sa_tags: false
generate_zs_tags: fals...
dragen-wgts-rna-pipeline/4.4.4__20250822040654
Overview
MD5Sum: None
Documentation
Dragen dragen-wgts-rna-pipeline v4.4.4
Performs the following steps:
- If input is a bam file, reheader the bam file such that the rgsm value matches
the output_prefix value. We can't do this inside the dragen container since it doesn't have the samtools binary installed.
Instead we run a seperate step to reheader the bam file. - Run the dragen rna pipeline
- Run multiqc on the outputs of the dragen rna pipeline
Inputs:
This pipeline differs from our previous rna pipeline by instead grouping inputs.
This makes overall workflow cwl file much smaller, and the workflow graphs much more readable.
It also means appending input options is much easier since it involves updating just one schema yaml file,
rather than the workflow inputs, workflow steps and the tool inputs.
Most of the grunt work is done in the dragen tools typescript file to convert input options into a configuration file.
Outputs:
The outputs are the same as the previous dragen-rna pipelines
The outputs are:
* Dragen rna output directory (always)
* Multiqc output directory (always)
Dockstore
Visual Overview
Click to expand!
Subworkflows
dragen-wgts-rna-variant-calling-stage__4.4.4.cwl
dragen-create-aligner-options-object__4.4.4.cwl
dragen-create-wgts-rna-variant-calling-options-object__4.4.4.cwl
dragen-create-wgts-alignment-options-object__4.4.4.cwl
dragen-create-mapper-options-object__4.4.4.cwl
Inputs Template
Yaml
Click to expand!
# yaml-language-server: $schema=https://github.com/umccr/cwl-ica/releases/download/dragen-wgts-rna-pipeline%2F4.4.4__20250822040654/dragen-wgts-rna-pipeline__4.4.4__20250822040654.schema.json
# sequence data (Required)
# Docs: The sequence data to be aligned and called variants on.
# This can either be a bam file, cram file or a list of fastq list row objects
sequence_data:
fastq_list_rows:
rgid: string
rglb: string
rgsm: string
rgcn: string
rgds: string
rgdt: string
rgpl: string
lane: string
read_1:
class: File
location: icav2://project_id/path/to/file
read_2:
class: File
location: icav2://project_id/path/to/file
# reference (Required)
# Docs: The reference genome to be used for alignment and variant calling.
reference:
name: string
structure:
tarball:
class: File
location: icav2://project_id/path/to/file
# reference (Optional)
# Docs: The ora reference used to decompress the input fastq files,
# required if any of the fastqs are ora-compressed
ora_reference:
class: File
location: icav2://project_id/path/to/file
# sample name (Required)
# Docs: The sample name to be used for alignment and variant calling.
# This is used to name the output files
sample_name: string
# annotation file (Optional)
# Docs: The annotation file to be used for
# gene expression quantification, splice variant calling, and gene fusion calling
annotation_file:
class: File
location: icav2://project_id/path/to/file
# alignment options (Optional)
# Docs: The options to be used for alignment.
# This is a JSON object that contains the options to be used for alignment.
# This is passed to the Dragen alignment tool.
alignment_options:
enable_duplicate_marking: false
enable_deterministic_sort: false
enable_sampling: false
enable_down_sampler: false
append_read_index_to_name: false
fastq_offset: string
filter_flags_from_output: string
generate_md_tags: false
generate_sa_tags: false
generate_zs_tags: false
input_qname_suffix_delimiter: string
output_format: string
preserve_bqsr_tags: false
preserve_map_align_order: false
qc_coverage:
region:
class: File
location: icav2://project_id/path/to/file
report_type: string
thresholds:
- string
ref_sequence_filter: false
remove_duplicates: false
sample_size: string
strip_input_qname_suffixes: false
aligner:
aln_min_score: string
clip_pe_overhang: string
dedup_min_qual: string
en_alt_hap_aln: string
en_chimeric_aln: string
gap_ext_pen: string
gap_open_pen: string
global: string
hard_clip_tags: string
hard_clips: string
mapq_floor_1snp: string
map_orientations: string
mapq_max: string
mapq_strict_sjs: string
match_n_score: string
match_score: string
max_rescues: string
min_overhang: string
min_score_coeff:
mismatch_pen: string
no_noncan_motifs: string
no_unclip_score: string
no_unpaired: string
pe_max_penalty: string
pe_orientation: string
rescue_sigmas:
sec_aligns: string
sec_aligns_hard: string
sec_phred_delta: string
sec_score_delta:
supp_aligns: string
supp_as_sec: string
supp_min_score_adj: string
unclip_score: string
unpaired_pen: string
mapper:
ann_sj_max_indel: string
edit_chain_limit: string
edit_mode: string
edit_read_len: string
edit_seed_num: string
map_orientations: string
max_intron_bases: string
min_intron_bases: string
seed_density: string
rrna_filter_enable: false
rrna_filter_contig: string
rna_library_type: string
rna_mapq_unique: string
rna_ann_sj_min_len: string
# snv variant calling options (Optional)
# Docs: The options to be used for rna variant calling.
# This is a JSON object that contains the options to be used for variant calling.
# This is passed to the Dragen variant calling tool.
snv_variant_caller_options:
enable_variant_caller: false
enable_methylation_calling: false
enable_variant_deduplication: false
enable_vcf_compression: false
enable_vcf_indexing: false
dbsnp:
class: File
location: icav2://project_id/path/to/file
vd_eh_vcf:
class: File
location: icav2://project_id/path/to/file
vd_output_match_log: false
vd_small_variant_vcf:
class: File
location: icav2://project_id/path/to/file
vd_sv_vcf:
class: File
location: icav2://project_id/path/to/file
dn_cnv_vcf:
class: File
location: icav2://project_id/path/to/file
dn_input_vcf:
class: File
location: icav2://project_id/path/to/file
dn_output_vcf:
class: File
location: icav2://project_id/path/to/file
dn_sv_vcf:
class: File
location: icav2://project_id/path/to/file
enable_joint_genotyping: false
enable_multi_sample_gvcf: false
enable_vlrd: false
pedigree_file:
class: File
location: icav2://project_id/path/to/file
qc_snp_denovo_quality_threshold: string
qc_indel_denovo_quality_threshold: string
variant:
class: File
location: icav2://project_id/path/to/file
variant_list:
class: File
location: icav2://project_id/path/to/file
vc_af_call_threshold: string
vc_af_filter_threshold: string
vc_af_call_threshold_mito: string
vc_af_filter_threshold_mito: string
vc_callability_normal_threshold: string
vc_callability_tumor_threshold: string
vc_clustered_event_penalty: string
vc_decoy_contigs:
vc_depth_annotation_threshold: string
vc_depth_filter_threshold: string
vc_emit_ref_confidence: string
rna_vc_enable_homozygous_genotype: false
rna_vc_homozygous_genotype_af_threshold:
vc_max_alternate_alleles: string
vc_emit_zero_coverage_intervals: false
vc_mnv_emit_component_calls: false
vc_combine_phased_variants_distance: string
vc_combine_phased_variants_max_vaf_delta:
vc_combine_phased_variants_distance_snvs_only: string
qc_detect_contamination: false
# gene fusion detection options (Optional)
# Docs: The Gene Fusion Detection options for the rna gene fusion detection
gene_fusion_detection_options:
enable_rna_gene_fusion: false
rna_gf_blast_pairs:
class: File
location: icav2://project_id/path/to/file
# gene expression quantification options (Optional)
# Docs: The Gene Expression Quantification options for the rna gene expression quantification
gene_expression_quantification_options:
enable_rna_quantification: false
rna_quantification_gc_bias: false
rna_quantification_fld_max: string
rna_quantification_fld_mean: string
rna_quantification_fld_sd: string
rna_quantification_tlen_min: string
# splice variant caller options (Optional)
# Docs: The Splice Variant Caller options for the rna spl...
dragen-wgts-dna-pipeline/4.4.4__20250822104303
Overview
MD5Sum: None
Documentation
Dragen dragen-wgts-dna-pipeline v4.4.4
Performs the following steps:
- If input is a bam file (for tumor or normal), reheader the bam file such that the rgsm value matches
the output_prefix value. We can't do this inside the dragen container since it doesn't have the samtools binary installed.
Instead we run a seperate step to reheader the bam file. - If tumor sequence data is provided and normal sequence data are provided, run the dragen-somatic tool.
- If only the normal sequence data are provided, run the dragen-germline tool.
Inputs:
This pipeline differs from our previous dragen-germline and dragen-somatic pipeline by instead grouping inputs.
This makes overall workflow cwl file much smaller, and the workflow graphs much more readable.
It also means appending input options is much easier since it involves updating just one schema yaml file,
rather than the workflow inputs, workflow steps and the tool inputs.
Most of the grunt work is done in the dragen tools typescript file to convert input options into a configuration file.
Outputs:
The outputs are the same as the previous dragen-germline and dragen-somatic pipelines.
The outputs are:
* Dragen somatic output directory (if tumor sequence data are provided)
* Dragen germline output directory (always)
* Multiqc output directory (always)
Dockstore
ICAv2
Tenant: umccr-prod
Bundles Generated
Bundle Name: dragen_wgts_dna_pipeline_with_validation_data_v4_4_4__20250822104303 / Bundle Version v11_r5__20250822104303
Description
This bundle has been generated by the release of workflows/dragen-wgts-dna-pipeline/4.4.4/dragen-wgts-dna-pipeline__4.4.4.cwl. The pipeline can be found at https://github.com/umccr/cwl-ica/releases/tag/dragen-wgts-dna-pipeline/4.4.4__20250822104303.
Version Description
Bundle version description is currently redundant while we cannot append versions to bundles. Regardless - the bunch version is v11_r5
Bundle ID: 54473d95-84e8-400d-9a1c-86782aec21e1
- Bundle Link
Pipeline Project ID: 5844391a-69db-4b52-86b5-6a0d55c2386f
Pipeline Project Name: pipelines
Pipeline ID: f6a4c255-5d49-4379-91ec-dd89b76a213f
Pipeline Code: dragen-wgts-dna-pipeline__4_4_4__20250822104303
Projects
- development
- staging
Datasets
- dragen_hash_table_chm13_v2_v11_r5_linear_cnv_hla_methyl_cg_methylated_combined_rna
- dragen_hash_table_chm13_v2_v11_r5_graph_cnv_hla_methyl_cg_rna
- dragen_hash_table_hg38_alt_masked_v11_r5_linear_cnv_hla_methyl_cg_methylated_combined_rna
- dragen_hash_table_hg38_alt_masked_v11_r5_graph_cnv_hla_methyl_cg_rna
- wgs_validation_fastq__cups_pair_8
- wgs_validation_fastq__2016_249_17_MH_P033
- wgs_validation_fastq__2016_249_18_WH_P025
- wgs_validation_fastq__B_ALL_Case_10
- wgs_validation_fastq_Diploid_Never_Responder
- wgs_validation_fastq_SBJ00303
- wgs_validation_fastq_SEQC50
- wgs_validation_fastq_SFRC01073
- ora_reference_v2
Bundle Name: dragen_wgts_dna_pipeline_production_v4_4_4__20250822104303 / Bundle Version v11_r5__20250822104303
Description
This bundle has been generated by the release of workflows/dragen-wgts-dna-pipeline/4.4.4/dragen-wgts-dna-pipeline__4.4.4.cwl. The pipeline can be found at https://github.com/umccr/cwl-ica/releases/tag/dragen-wgts-dna-pipeline/4.4.4__20250822104303.
Version Description
Bundle version description is currently redundant while we cannot append versions to bundles. Regardless - the bunch version is v11_r5
Bundle ID: a063caa0-5b7a-480f-85e0-1a39ff7ad98e
- Bundle Link
Pipeline Project ID: 5844391a-69db-4b52-86b5-6a0d55c2386f
Pipeline Project Name: pipelines
Pipeline ID: f6a4c255-5d49-4379-91ec-dd89b76a213f
Pipeline Code: dragen-wgts-dna-pipeline__4_4_4__20250822104303
Projects
- production
Datasets
- dragen_hash_table_chm13_v2_v11_r5_linear_cnv_hla_methyl_cg_methylated_combined_rna
- dragen_hash_table_chm13_v2_v11_r5_graph_cnv_hla_methyl_cg_rna
- dragen_hash_table_hg38_alt_masked_v11_r5_linear_cnv_hla_methyl_cg_methylated_combined_rna
- dragen_hash_table_hg38_alt_masked_v11_r5_graph_cnv_hla_methyl_cg_rna
- ora_reference_v2
Visual Overview
Click to expand!
Inputs Template
Yaml
Click to expand!
# yaml-language-server: $schema=https://github.com/umccr/cwl-ica/releases/download/dragen-wgts-dna-pipeline%2F4.4.4__20250822104303/dragen-wgts-dna-pipeline__4.4.4__20250822104303.schema.json
# sequence data (Required)
# Docs: The sequence data to be aligned and called variants on.
# This can either be a bam file, cram file or a list of fastq list row objects
sequence_data:
fastq_list_rows:
rgid: string
rglb: string
rgsm: string
rgcn: string
rgds: string
rgdt: string
rgpl: string
lane: string
read_1:
class: File
location: icav2://project_id/path/to/file
read_2:
class: File
location: icav2://project_id/path/to/file
# tumor sequence data (Optional)
# Docs: The sequence data to be aligned and called variants on.
# This can either be a bam file, cram file or a list of fastq list row objects
# Only specify the tumor sequence data IF you want to run the somatic variant calling pipeline.
tumor_sequence_data:
fastq_list_rows:
rgid: string
rglb: string
rgsm: string
rgcn: string
rgds: string
rgdt: string
rgpl: string
lane: string
read_1:
class: File
location: icav2://project_id/path/to/file
read_2:
class: File
location: icav2://project_id/path/to/file
# reference (Required)
# Docs: The reference genome to be used for alignment and variant calling.
reference:
name: string
structure:
tarball:
class: File
location: icav2://project_id/path/to/file
# somatic reference (Optional)
# Docs: The somatic reference genome to be used for alignment and variant calling.
# This is only used if the somatic reference is different from the normal reference.
somatic_reference:
name: string
structure:
tarball:
class: File
location: icav2://project_id/path/to/file
# reference (Optional)
# Docs: The ora reference used to decompress the input fastq files,
# required if any of the fastqs are ora-compressed
ora_reference:
class: File
location: icav2://project_id/path/to/file
# sample name (Required)
# Docs: The sample name to be used for alignment and variant calling.
# This is used to name the output files
sample_name: string
# tumor sample name (Optional)
# Docs: The tumor sample name to be used for alignment and variant calling.
# This is used to name the output files
tumor_sample_name: string
# alignment options (Optional)
# Docs: The options to be used for alignment.
# This is a JSON object that contains the options to be used for alignment.
# This is passed to the Dragen alignment tool.
alignment_options:
enable_duplicate_marking: false
enable_deterministic_sort: false
enable_sampling: false
enable_down_sampler: false
append_read_index_to_name: false
fastq_offset: string
filter_flags_from_output: string
generate_md_tags: false
generate_sa_tags: false
generate_zs_tags: false
input_qname_suffix_delimiter: string
output_format: string
preserve_bqsr_tags: false
preserve_map_align_order: false
qc_coverage:
region:
class: File
location: icav2://project_id/path/to/file
report_type: string
thresholds:
- string
ref_sequence_filter: false
remove_duplicates: false
sample_size: string
strip_input_qname_suffixes: false
aligner:
aln_min_score: string
clip_pe_overhang: string
dedup_min_qual: string
en_alt_hap_aln: string
en_chimeric_aln: string
gap_ext_pen: string
gap_open_pen: string
global: string
hard_clip_tags: string
hard_clips: string
mapq_floor_1snp: string
map_orientations: string
mapq_max: string
mapq_strict_sjs: string
match_n_score: string
match_score: string
max_rescues: string
min_overhang: string
min_score_coeff:
mismatch_pen: string
no_noncan_motifs: string
no_unclip_score: string
no_unpaired: string
pe_max_penalty: string
pe_orientation: string
rescue_sigmas:
sec_aligns: string
sec_aligns_hard: string
sec_phred_delta: string
sec_score_delta:
supp_aligns: string
supp_as_sec: string
supp_min_score_adj: string
unclip_score: string
unpaired_pen: string
mapper:
ann_sj_max_indel: string
edit_chain_limit: string
edit_mode: string
edit_read_len: string
edit_seed_num: string
map_orientations: string
max_intron_bases: string
min_intron_bases: string
seed_density: string
rrna_filter_enable: false
rrna_filter_contig: string
rna_library_type: string
rna_mapq_unique: string
rna_ann_sj_min_len: string
# somatic alignment options (Optional)
# Docs: The options to be used for alignment of the tumor sample.
# This is a JSON object that contains the options to be used for alignment.
# This is passed to the Dragen alignment tool.
# For the somatic alignment, both alignment options and somatic alignment options are first merged before being provided to dragen
somatic_alignment_options:
enable_duplicate_marking: false
enable_deterministic_sort: false
enable_sampling: false
enable_down_sampler: false
append_read_index_to_name: false
fastq_offset: string
filter_flags_from_output: string
generate_md_tags: false
generate_sa_tags: false
generate_zs_tags: fals...
dragen-wgts-dna-pipeline/4.4.4__20250822034548
Overview
MD5Sum: None
Documentation
Dragen dragen-wgts-dna-pipeline v4.4.4
Performs the following steps:
- If input is a bam file (for tumor or normal), reheader the bam file such that the rgsm value matches
the output_prefix value. We can't do this inside the dragen container since it doesn't have the samtools binary installed.
Instead we run a seperate step to reheader the bam file. - If tumor sequence data is provided and normal sequence data are provided, run the dragen-somatic tool.
- If only the normal sequence data are provided, run the dragen-germline tool.
Inputs:
This pipeline differs from our previous dragen-germline and dragen-somatic pipeline by instead grouping inputs.
This makes overall workflow cwl file much smaller, and the workflow graphs much more readable.
It also means appending input options is much easier since it involves updating just one schema yaml file,
rather than the workflow inputs, workflow steps and the tool inputs.
Most of the grunt work is done in the dragen tools typescript file to convert input options into a configuration file.
Outputs:
The outputs are the same as the previous dragen-germline and dragen-somatic pipelines.
The outputs are:
* Dragen somatic output directory (if tumor sequence data are provided)
* Dragen germline output directory (always)
* Multiqc output directory (always)
Dockstore
ICAv2
Tenant: umccr-prod
Bundles Generated
Bundle Name: dragen_wgts_dna_pipeline_with_validation_data_v4_4_4__20250822034548 / Bundle Version v11_r5__20250822034548
Description
This bundle has been generated by the release of workflows/dragen-wgts-dna-pipeline/4.4.4/dragen-wgts-dna-pipeline__4.4.4.cwl. The pipeline can be found at https://github.com/umccr/cwl-ica/releases/tag/dragen-wgts-dna-pipeline/4.4.4__20250822034548.
Version Description
Bundle version description is currently redundant while we cannot append versions to bundles. Regardless - the bunch version is v11_r5
Bundle ID: 6abf113b-27dd-46ad-91c9-8e59364ca599
- Bundle Link
Pipeline Project ID: 5844391a-69db-4b52-86b5-6a0d55c2386f
Pipeline Project Name: pipelines
Pipeline ID: 1ce7dab7-6f83-48c6-8a06-173d06048504
Pipeline Code: dragen-wgts-dna-pipeline__4_4_4__20250822034548
Projects
- development
- staging
Datasets
- dragen_hash_table_chm13_v2_v11_r5_linear_cnv_hla_methyl_cg_methylated_combined_rna
- dragen_hash_table_chm13_v2_v11_r5_graph_cnv_hla_methyl_cg_rna
- dragen_hash_table_hg38_alt_masked_v11_r5_linear_cnv_hla_methyl_cg_methylated_combined_rna
- dragen_hash_table_hg38_alt_masked_v11_r5_graph_cnv_hla_methyl_cg_rna
- wgs_validation_fastq__cups_pair_8
- wgs_validation_fastq__2016_249_17_MH_P033
- wgs_validation_fastq__2016_249_18_WH_P025
- wgs_validation_fastq__B_ALL_Case_10
- wgs_validation_fastq_Diploid_Never_Responder
- wgs_validation_fastq_SBJ00303
- wgs_validation_fastq_SEQC50
- wgs_validation_fastq_SFRC01073
- ora_reference_v2
Bundle Name: dragen_wgts_dna_pipeline_production_v4_4_4__20250822034548 / Bundle Version v11_r5__20250822034548
Description
This bundle has been generated by the release of workflows/dragen-wgts-dna-pipeline/4.4.4/dragen-wgts-dna-pipeline__4.4.4.cwl. The pipeline can be found at https://github.com/umccr/cwl-ica/releases/tag/dragen-wgts-dna-pipeline/4.4.4__20250822034548.
Version Description
Bundle version description is currently redundant while we cannot append versions to bundles. Regardless - the bunch version is v11_r5
Bundle ID: 416025c3-7cca-49c0-8be4-d42a0cd0abd3
- Bundle Link
Pipeline Project ID: 5844391a-69db-4b52-86b5-6a0d55c2386f
Pipeline Project Name: pipelines
Pipeline ID: 1ce7dab7-6f83-48c6-8a06-173d06048504
Pipeline Code: dragen-wgts-dna-pipeline__4_4_4__20250822034548
Projects
- production
Datasets
- dragen_hash_table_chm13_v2_v11_r5_linear_cnv_hla_methyl_cg_methylated_combined_rna
- dragen_hash_table_chm13_v2_v11_r5_graph_cnv_hla_methyl_cg_rna
- dragen_hash_table_hg38_alt_masked_v11_r5_linear_cnv_hla_methyl_cg_methylated_combined_rna
- dragen_hash_table_hg38_alt_masked_v11_r5_graph_cnv_hla_methyl_cg_rna
- ora_reference_v2
Visual Overview
Click to expand!
Inputs Template
Yaml
Click to expand!
# yaml-language-server: $schema=https://github.com/umccr/cwl-ica/releases/download/dragen-wgts-dna-pipeline%2F4.4.4__20250822034548/dragen-wgts-dna-pipeline__4.4.4__20250822034548.schema.json
# sequence data (Required)
# Docs: The sequence data to be aligned and called variants on.
# This can either be a bam file, cram file or a list of fastq list row objects
sequence_data:
fastq_list_rows:
rgid: string
rglb: string
rgsm: string
rgcn: string
rgds: string
rgdt: string
rgpl: string
lane: string
read_1:
class: File
location: icav2://project_id/path/to/file
read_2:
class: File
location: icav2://project_id/path/to/file
# tumor sequence data (Optional)
# Docs: The sequence data to be aligned and called variants on.
# This can either be a bam file, cram file or a list of fastq list row objects
# Only specify the tumor sequence data IF you want to run the somatic variant calling pipeline.
tumor_sequence_data:
fastq_list_rows:
rgid: string
rglb: string
rgsm: string
rgcn: string
rgds: string
rgdt: string
rgpl: string
lane: string
read_1:
class: File
location: icav2://project_id/path/to/file
read_2:
class: File
location: icav2://project_id/path/to/file
# reference (Required)
# Docs: The reference genome to be used for alignment and variant calling.
reference:
name: string
structure:
tarball:
class: File
location: icav2://project_id/path/to/file
# somatic reference (Optional)
# Docs: The somatic reference genome to be used for alignment and variant calling.
# This is only used if the somatic reference is different from the normal reference.
somatic_reference:
name: string
structure:
tarball:
class: File
location: icav2://project_id/path/to/file
# reference (Optional)
# Docs: The ora reference used to decompress the input fastq files,
# required if any of the fastqs are ora-compressed
ora_reference:
class: File
location: icav2://project_id/path/to/file
# sample name (Required)
# Docs: The sample name to be used for alignment and variant calling.
# This is used to name the output files
sample_name: string
# tumor sample name (Optional)
# Docs: The tumor sample name to be used for alignment and variant calling.
# This is used to name the output files
tumor_sample_name: string
# alignment options (Optional)
# Docs: The options to be used for alignment.
# This is a JSON object that contains the options to be used for alignment.
# This is passed to the Dragen alignment tool.
alignment_options:
enable_duplicate_marking: false
enable_deterministic_sort: false
enable_sampling: false
enable_down_sampler: false
append_read_index_to_name: false
fastq_offset: string
filter_flags_from_output: string
generate_md_tags: false
generate_sa_tags: false
generate_zs_tags: false
input_qname_suffix_delimiter: string
output_format: string
preserve_bqsr_tags: false
preserve_map_align_order: false
qc_coverage:
region:
class: File
location: icav2://project_id/path/to/file
report_type: string
thresholds:
- string
ref_sequence_filter: false
remove_duplicates: false
sample_size: string
strip_input_qname_suffixes: false
aligner:
aln_min_score: string
clip_pe_overhang: string
dedup_min_qual: string
en_alt_hap_aln: string
en_chimeric_aln: string
gap_ext_pen: string
gap_open_pen: string
global: string
hard_clip_tags: string
hard_clips: string
mapq_floor_1snp: string
map_orientations: string
mapq_max: string
mapq_strict_sjs: string
match_n_score: string
match_score: string
max_rescues: string
min_overhang: string
min_score_coeff:
mismatch_pen: string
no_noncan_motifs: string
no_unclip_score: string
no_unpaired: string
pe_max_penalty: string
pe_orientation: string
rescue_sigmas:
sec_aligns: string
sec_aligns_hard: string
sec_phred_delta: string
sec_score_delta:
supp_aligns: string
supp_as_sec: string
supp_min_score_adj: string
unclip_score: string
unpaired_pen: string
mapper:
ann_sj_max_indel: string
edit_chain_limit: string
edit_mode: string
edit_read_len: string
edit_seed_num: string
map_orientations: string
max_intron_bases: string
min_intron_bases: string
seed_density: string
rrna_filter_enable: false
rrna_filter_contig: string
rna_library_type: string
rna_mapq_unique: string
rna_ann_sj_min_len: string
# somatic alignment options (Optional)
# Docs: The options to be used for alignment of the tumor sample.
# This is a JSON object that contains the options to be used for alignment.
# This is passed to the Dragen alignment tool.
# For the somatic alignment, both alignment options and somatic alignment options are first merged before being provided to dragen
somatic_alignment_options:
enable_duplicate_marking: false
enable_deterministic_sort: false
enable_sampling: false
enable_down_sampler: false
append_read_index_to_name: false
fastq_offset: string
filter_flags_from_output: string
generate_md_tags: false
generate_sa_tags: false
generate_zs_tags: fals...
dragen-wgts-dna-pipeline/4.4.4__20250805055151
Overview
MD5Sum: None
Documentation
Dragen dragen-wgts-dna-pipeline v4.4.4
Performs the following steps:
- If input is a bam file (for tumor or normal), reheader the bam file such that the rgsm value matches
the output_prefix value. We can't do this inside the dragen container since it doesn't have the samtools binary installed.
Instead we run a seperate step to reheader the bam file. - If tumor sequence data is provided and normal sequence data are provided, run the dragen-somatic tool.
- If only the normal sequence data are provided, run the dragen-germline tool.
Inputs:
This pipeline differs from our previous dragen-germline and dragen-somatic pipeline by instead grouping inputs.
This makes overall workflow cwl file much smaller, and the workflow graphs much more readable.
It also means appending input options is much easier since it involves updating just one schema yaml file,
rather than the workflow inputs, workflow steps and the tool inputs.
Most of the grunt work is done in the dragen tools typescript file to convert input options into a configuration file.
Outputs:
The outputs are the same as the previous dragen-germline and dragen-somatic pipelines.
The outputs are:
* Dragen somatic output directory (if tumor sequence data are provided)
* Dragen germline output directory (always)
* Multiqc output directory (always)
Dockstore
ICAv2
Tenant: umccr-prod
Bundles Generated
Bundle Name: dragen_wgts_dna_pipeline_with_validation_data_v4_4_4__20250805055151 / Bundle Version v11_r5__20250805055151
Description
This bundle has been generated by the release of workflows/dragen-wgts-dna-pipeline/4.4.4/dragen-wgts-dna-pipeline__4.4.4.cwl. The pipeline can be found at https://github.com/umccr/cwl-ica/releases/tag/dragen-wgts-dna-pipeline/4.4.4__20250805055151.
Version Description
Bundle version description is currently redundant while we cannot append versions to bundles. Regardless - the bunch version is v11_r5
Bundle ID: 57c39c53-7c3f-4713-8f3e-b721adf9712a
- Bundle Link
Pipeline Project ID: 5844391a-69db-4b52-86b5-6a0d55c2386f
Pipeline Project Name: pipelines
Pipeline ID: 3dff2d05-569a-4b5a-88b8-bdca48a121bd
Pipeline Code: dragen-wgts-dna-pipeline__4_4_4__20250805055151
Projects
- development
- staging
Datasets
- dragen_hash_table_chm13_v2_v11_r5_linear_cnv_hla_methyl_cg_methylated_combined_rna
- dragen_hash_table_chm13_v2_v11_r5_graph_cnv_hla_methyl_cg_rna
- dragen_hash_table_hg38_alt_masked_v11_r5_linear_cnv_hla_methyl_cg_methylated_combined_rna
- dragen_hash_table_hg38_alt_masked_v11_r5_graph_cnv_hla_methyl_cg_rna
- wgs_validation_fastq__cups_pair_8
- wgs_validation_fastq__2016_249_17_MH_P033
- wgs_validation_fastq__2016_249_18_WH_P025
- wgs_validation_fastq__B_ALL_Case_10
- wgs_validation_fastq_Diploid_Never_Responder
- wgs_validation_fastq_SBJ00303
- wgs_validation_fastq_SEQC50
- wgs_validation_fastq_SFRC01073
- ora_reference_v2
Bundle Name: dragen_wgts_dna_pipeline_production_v4_4_4__20250805055151 / Bundle Version v11_r5__20250805055151
Description
This bundle has been generated by the release of workflows/dragen-wgts-dna-pipeline/4.4.4/dragen-wgts-dna-pipeline__4.4.4.cwl. The pipeline can be found at https://github.com/umccr/cwl-ica/releases/tag/dragen-wgts-dna-pipeline/4.4.4__20250805055151.
Version Description
Bundle version description is currently redundant while we cannot append versions to bundles. Regardless - the bunch version is v11_r5
Bundle ID: ab287c5d-5222-40a9-bf61-64660c54585f
- Bundle Link
Pipeline Project ID: 5844391a-69db-4b52-86b5-6a0d55c2386f
Pipeline Project Name: pipelines
Pipeline ID: 3dff2d05-569a-4b5a-88b8-bdca48a121bd
Pipeline Code: dragen-wgts-dna-pipeline__4_4_4__20250805055151
Projects
- production
Datasets
- dragen_hash_table_chm13_v2_v11_r5_linear_cnv_hla_methyl_cg_methylated_combined_rna
- dragen_hash_table_chm13_v2_v11_r5_graph_cnv_hla_methyl_cg_rna
- dragen_hash_table_hg38_alt_masked_v11_r5_linear_cnv_hla_methyl_cg_methylated_combined_rna
- dragen_hash_table_hg38_alt_masked_v11_r5_graph_cnv_hla_methyl_cg_rna
- ora_reference_v2
Visual Overview
Click to expand!
Inputs Template
Yaml
Click to expand!
# yaml-language-server: $schema=https://github.com/umccr/cwl-ica/releases/download/dragen-wgts-dna-pipeline%2F4.4.4__20250805055151/dragen-wgts-dna-pipeline__4.4.4__20250805055151.schema.json
# sequence data (Required)
# Docs: The sequence data to be aligned and called variants on.
# This can either be a bam file, cram file or a list of fastq list row objects
sequence_data:
fastq_list_rows:
rgid: string
rglb: string
rgsm: string
rgcn: string
rgds: string
rgdt: string
rgpl: string
lane: string
read_1:
class: File
location: icav2://project_id/path/to/file
read_2:
class: File
location: icav2://project_id/path/to/file
# tumor sequence data (Optional)
# Docs: The sequence data to be aligned and called variants on.
# This can either be a bam file, cram file or a list of fastq list row objects
# Only specify the tumor sequence data IF you want to run the somatic variant calling pipeline.
tumor_sequence_data:
fastq_list_rows:
rgid: string
rglb: string
rgsm: string
rgcn: string
rgds: string
rgdt: string
rgpl: string
lane: string
read_1:
class: File
location: icav2://project_id/path/to/file
read_2:
class: File
location: icav2://project_id/path/to/file
# reference (Required)
# Docs: The reference genome to be used for alignment and variant calling.
reference:
name: string
structure:
tarball:
class: File
location: icav2://project_id/path/to/file
# somatic reference (Optional)
# Docs: The somatic reference genome to be used for alignment and variant calling.
# This is only used if the somatic reference is different from the normal reference.
somatic_reference:
name: string
structure:
tarball:
class: File
location: icav2://project_id/path/to/file
# reference (Optional)
# Docs: The ora reference used to decompress the input fastq files,
# required if any of the fastqs are ora-compressed
ora_reference:
class: File
location: icav2://project_id/path/to/file
# sample name (Required)
# Docs: The sample name to be used for alignment and variant calling.
# This is used to name the output files
sample_name: string
# tumor sample name (Optional)
# Docs: The tumor sample name to be used for alignment and variant calling.
# This is used to name the output files
tumor_sample_name: string
# alignment options (Optional)
# Docs: The options to be used for alignment.
# This is a JSON object that contains the options to be used for alignment.
# This is passed to the Dragen alignment tool.
alignment_options:
enable_duplicate_marking: false
enable_deterministic_sort: false
enable_sampling: false
enable_down_sampler: false
append_read_index_to_name: false
fastq_offset: string
filter_flags_from_output: string
generate_md_tags: false
generate_sa_tags: false
generate_zs_tags: false
input_qname_suffix_delimiter: string
output_format: string
preserve_bqsr_tags: false
preserve_map_align_order: false
qc_coverage:
region:
class: File
location: icav2://project_id/path/to/file
report_type: string
thresholds:
- string
ref_sequence_filter: false
remove_duplicates: false
sample_size: string
strip_input_qname_suffixes: false
aligner:
aln_min_score: string
clip_pe_overhang: string
dedup_min_qual: string
en_alt_hap_aln: string
en_chimeric_aln: string
gap_ext_pen: string
gap_open_pen: string
global: string
hard_clip_tags: string
hard_clips: string
mapq_floor_1snp: string
map_orientations: string
mapq_max: string
mapq_strict_sjs: string
match_n_score: string
match_score: string
max_rescues: string
min_overhang: string
min_score_coeff:
mismatch_pen: string
no_noncan_motifs: string
no_unclip_score: string
no_unpaired: string
pe_max_penalty: string
pe_orientation: string
rescue_sigmas:
sec_aligns: string
sec_aligns_hard: string
sec_phred_delta: string
sec_score_delta:
supp_aligns: string
supp_as_sec: string
supp_min_score_adj: string
unclip_score: string
unpaired_pen: string
mapper:
ann_sj_max_indel: string
edit_chain_limit: string
edit_mode: string
edit_read_len: string
edit_seed_num: string
map_orientations: string
max_intron_bases: string
min_intron_bases: string
seed_density: string
rrna_filter_enable: false
rrna_filter_contig: string
rna_library_type: string
rna_mapq_unique: string
rna_ann_sj_min_len: string
# somatic alignment options (Optional)
# Docs: The options to be used for alignment of the tumor sample.
# This is a JSON object that contains the options to be used for alignment.
# This is passed to the Dragen alignment tool.
# For the somatic alignment, both alignment options and somatic alignment options are first merged before being provided to dragen
somatic_alignment_options:
enable_duplicate_marking: false
enable_deterministic_sort: false
enable_sampling: false
enable_down_sampler: false
append_read_index_to_name: false
fastq_offset: string
filter_flags_from_output: string
generate_md_tags: false
generate_sa_tags: false
generate_zs_tags: fals...