Team2-ComparativeGenomics

Summary

This pipeline is designed by Team 2, Group 4. The purpose is to perform comparative analysis of genomes of various bacterial isolates.

** Written by Jiahan Zhan, Maggie Fisher, Jihyun Park, Vineeth Aljapur, Di Zhou, Bridget Neary **

USAGE

./comparative_genomics_scripts/COMPARATIVE_GENOMICS_PIPELINE.sh

REQUIREMENTS

please follow the respective installation instructions

install MentaLiST
install kSNP3.0_Linux_package
insall mummer

Input

The input files should be in fastq format

MLST

MLST Requirements

Must have MentaLiST installed along with its dependencies (ie. Julia: https://julialang.org/downloads/)
fastq raw data in the designated "$dataset" directory: dataset=/projects/team2/genome_assembly/dataset

MentaLiST will run in two steps:

build the kmer database

This will pull fasta files used to build the database and store them in a MLST_FASTA_FILES directory. The database will be outputted as MLST.db. Both are required for MLST calling.

MLST calling

The output from this step will be placed in the ./MLST_RESULTS directory. Several files will be created with the prefix "MLST_calls"; the file with the sequence types will be in the file called MLST_calls.txt

The output from the MLST analysis can be used to measure "closeness" of the isolates used and visulized in several ways, for instance minimum spanning trees.

kSNP3

kSNP3 Preparations

File name: a file ID and an extension. (e.g. Escherichia_coli_042.fasta)
Input list file: one file per genome. The genome can be a finished (closed) sequence, multiple chromosomes and plasmids, an assembly of multiple contigs, or raw, unassembled reads.
our input file list path: /projects/team2/comp_gen_temp/ksnp_run_folder/input_list.txt

ANI

requirements

Python 3: could be installed easily without sudo access using anaconda
mummer: please follow the instructions of the mummer package to install mummer

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
comparative_genomics_scripts		comparative_genomics_scripts
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Team2-ComparativeGenomics

Summary

USAGE

REQUIREMENTS

Input

MLST

MLST Requirements

kSNP3

kSNP3 Preparations

ANI

requirements

About

Releases

Packages

Languages

compgenomics2019/Team2-ComparativeGenomics

Folders and files

Latest commit

History

Repository files navigation

Team2-ComparativeGenomics

Summary

USAGE

REQUIREMENTS

Input

MLST

MLST Requirements

kSNP3

kSNP3 Preparations

ANI

requirements

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages