Skip to content

Latest commit

 

History

History
46 lines (33 loc) · 3.97 KB

README.md

File metadata and controls

46 lines (33 loc) · 3.97 KB

Fungal annotation resources to work with fungal genomics data in R

This repository has various annotation resources for different fungal genomes I have worked on. One can easily create these resources on their own using R packages like AnnotationForge, GenomicFeatures etc.

OrgDb packages: To prepare OrgDb package for an organism, gene information, GO assignment, KEGG pathway data was extracted from various public domain databases like AspGD, CGD, FunigDB, NCBI etc. This curated data was used with R function AnnotationForge::makeOrgPackage() to create OrgDb package. One can store different form of information for each gene in OrgDb package and once created, can easily extract the information using AnnotationDbi::select() function. This orgDb package can be used in GO enrichment analysis using clusterProfiler package.

TxDb packages: Most updated GFF or GTF file was obtained for organism of interest from public domain databases. All the TxDb packages were created using GenomicFeatures::makeTxDbFromGFF() function in R. Various Bioconductor R packages require a TxDb object of an organism for data analysis. See here for the details.

BSgenome packages: This package holds the FASTA sequence data in form of a R package object. BSgenome object allows easy manipulation of sequence data using R. One can imagine its function similar to samtools package. BioConductor version 3.12 onwards do not create BSgenome package from .2bit files if the genome is not NCBI or UCSC genome assembly.

topGO mapping files: OrgDb pacakge created for each organism in previous step was used to extract gene to GO assignment table. This table was saved in tabular file which is used by GO enrichment package topGO.

Aspergillus fumigatus Af293

Aspergillus fumigatus A1163

Aspergillus nidulans FGSCA4

Candida albicans SC5314 A21

Candida auris




License: GPL v2