Norovirus Nextflow Pipeline

Overview

Norovirus is a highly contagious virus responsible for causing gastroenteritis, an infection of the stomach and intestine. British Columbia sees outbreaks of Norovirus every year, particularly associated with consumption of contaminated batches of shellfish. This Nextflow pipeline aims to generate detailed typing, variant calling, and phylogenetic information for clinical Norovirus isolates sequenced using amplicon-based methods.

Norovirus Typing

Norovirus has a linear, positive-sense RNA genome approximately 7.5 kb in length with three open reading frames. Two genes of particular interest are the polymerase (RDRP) and capsid (VP1) genes located in ORF1 and ORF2, respectively. As Norovirus displays a strong propensity to recombine around the ORF1/2 boundary, researchers have adopted a dual typing scheme that combines both the capsid type (genotype) and polymerase type (p-type) in a single notation (genotype[p-type]) . To adhere to this dual typing scheme, this Norovirus pipeline branches at appropriate points in the workflow to type the capsid and polymerase genes independently. Typing these two genes independently ensures that previously unseen Norovirus combinations can be detected effectively.

Pipeline Diagram

graph TD
A[FASTQ Input]  --> AA(Cutadapt)
AA --> D(Fastp)
D--> B(FastQC)
B --> C(MultiQC) 
D --> C
D --> G(Kraken2 - Filter)
G --> C
G --> I(PHAC Custom Dehoster)
I --> J(Spades - Assembly)
X[Genotype Database] --> K(Genotype Query - BlastN)
Y[Ptype Database] --> KA(Ptype Query - BlastN)
K --> KC(Call Best Genotype)
KA --> KD(Call Best Ptype)
K --> KE(Pick Best Reference/Contig)
KA --> KE
J --> K
J --> KA
J --> KB(Quast - QC)
KE --> M(BWA - Map Reads)
M --> N(Samtools - Sort & Filter)
N --> OA(Freebayes VCF)
N --> OB(Mpileup VCF)
OA --> P(Get Common SNPs)
OB --> P
P --> Q(Bcftools - Make Consensus)
Q --> QA(Add Background Seqs)
QA --> R(Mafft - Alignment)
Q --> R
R --> T(IQTree - Phylogenetic Tree)
T --> U(Summary Report)

Name		Name	Last commit message	Last commit date
Latest commit History 237 Commits
assets		assets
bin		bin
conf		conf
environments		environments
images		images
modules		modules
plot		plot
testing		testing
workflows		workflows
.gitignore		.gitignore
README.md		README.md
main.nf		main.nf
nextflow.config		nextflow.config

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Norovirus Nextflow Pipeline

Overview

Norovirus Typing

Pipeline Diagram

About

Releases 3

Packages

Languages

BCCDC-PHL/noro-typing-nf

Folders and files

Latest commit

History

Repository files navigation

Norovirus Nextflow Pipeline

Overview

Norovirus Typing

Pipeline Diagram

About

Resources

Stars

Watchers

Forks

Releases 3

Packages 0

Languages

Packages