str_ase

Allele-specific expression (ASE) for short tandem repeats (STRs)

download

Execute the following command.

git clone https://github.com/aryarm/str_ase

Example data for the pipeline is also available for download upon request.

setup

The pipeline is written as a Snakefile which can be executed via Snakemake. For reproduciblity, we recommend installing version 5.30.2:

conda create -n snakemake -c conda-forge --no-channel-priority 'bioconda::snakemake==5.30.2'

We highly recommend you install Snakemake via conda like this so that you can use the --use-conda flag when calling snakemake to let it automatically handle all dependencies of the pipeline. Otherwise, you must manually install the dependencies listed in the env files.

execution

Activate snakemake via conda:
```
conda activate snakemake
```
Execute the pipeline on the example data

Locally:
```
./run &
```

Log files describing the output of the pipeline will be created within the output directory. The log file contains a basic description of the progress of each rule, while the qlog file is more detailed.

Executing the pipeline on your own data

You must modify the config.yaml file to specify paths to your data before you perform step 2 above. Currently, the pipeline is configured to run on our example data.

If this is your first time using Snakemake

We recommend that you run snakemake --help to learn about Snakemake's options. For example, to check that the pipeline will be executed correctly before you run it, you can call Snakemake with the -n -p -r flags. This is also a good way to familiarize yourself with the steps of the pipeline and their inputs and outputs (the latter of which are inputs to the first rule in each workflow -- ie the all rule).

Note that Snakemake will not recreate output that it has already generated, unless you request it. If a job fails or is interrupted, subsequent executions of Snakemake will just pick up where it left off. This can also apply to files that you create and provide in place of the files it would have generated.

By default, the pipeline will automatically delete some files it deems unnecessary (ex: unsorted copies of a VCF). You can opt to keep these files instead by providing the --notemp flag to Snakemake when executing the pipeline.

files

Snakefile

A Snakemake pipeline defining rules for every step of the analysis.

config.yml

Defines options and input for the Snakemake pipeline.

run

An example bash script for executing the pipeline using snakemake and conda. Any arguments to this script are passed directly to snakemake.

envs/

Conda environment files for the dependencies of our pipeline.

data/

Example data for the pipeline, which is available upon request. You can also generate your own using the prepare pipeline.

rules/

Snakemake rules used by the pipeline. See the rules README for more information about each file.

scripts/

Various scripts that are utilized within the pipeline. See the scripts README for more information about each file.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

str_ase

download

setup

execution

Executing the pipeline on your own data

If this is your first time using Snakemake

files

Snakefile

config.yml

run

envs/

data/

rules/

scripts/

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 61 Commits
envs		envs
rules		rules
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
Snakefile		Snakefile
config.yml		config.yml
run		run

License

aryarm/str_ase

Folders and files

Latest commit

History

Repository files navigation

str_ase

download

setup

execution

Executing the pipeline on your own data

If this is your first time using Snakemake

files

Snakefile

config.yml

run

envs/

data/

rules/

scripts/

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages