To run this workflow, the following tools need to be available:
- Add all sample ids to
samples.tsv
in the columnsample
. - Add all sample data information to
units.tsv
. Each row represents afastq
file pair with corresponding forward and reverse reads. Also indicate the sample id, run id and lane number, adapter.
- You need a ...
Coming soon...
The workflow is designed for WGS data meaning huge datasets which require a lot of compute power. For HPC clusters, it is recommended to use a cluster profile and run something like:
snakemake -s /path/to/Snakefile --profile my-awesome-profile