Supporting tools for dnascore rare-variant analysis.

The following scripts can be used to parse gnomad vcf files to extract minor allele frequencies in control population and VEP VCF file to annotate list of variants with gene symbols and predicted variant consequences.

gnomAD VCF file parser

To run the parser use the following command:

wget -qO- https://storage.googleapis.com/gcp-public-data--gnomad/release/2.1.1/vcf/exomes/gnomad.exomes.r2.1.1.    sites.vcf.bgz | zcat | python3 gnomad.py list_of_variants.tsv > tbl.out

The list of variants (one per line) should be passed in the following format, for example:

chr1:55039768\tC\tT
chr1:55039805\tC\tT

VEP VCF file parser

To run the parser you need to open annotation file and look for the VEP annotation format in the INFO section of VCF file. The following command will parse the file vep_annotation.vcf with corresponding VEP annotation strcture:

python3 vep_parser.py annotation.vcf output.tsv "Allele|Consequence|IMPACT|SYMBOL|Gene|Feature_type|Feature|BIOTYPE|EXON|INTRON|HGVSc|HGVSp|cDNA_position|CDS_position|Protein_position|Amino_acids|Codons|Existing_variation|DISTANCE|STRAND|FLAGS|VARIANT_CLASS|SYMBOL_SOURCE|HGNC_ID|CANONICAL|REFSEQ_MATCH|SOURCE|GIVEN_REF|USED_REF|BAM_EDIT"

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
LICENSE		LICENSE
README.md		README.md
gnomad.py		gnomad.py
vep2variant.sh		vep2variant.sh
vep_parser.py		vep_parser.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Supporting tools for dnascore rare-variant analysis.

gnomAD VCF file parser

VEP VCF file parser

About

Uh oh!

Releases

Packages

Languages

License

alexloboda/vep-parser

Folders and files

Latest commit

History

Repository files navigation

Supporting tools for dnascore rare-variant analysis.

gnomAD VCF file parser

VEP VCF file parser

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages