- Check the quality of the data using Reads_QC (Hafnia spp project in VIGASP)
- Check the species identification using Species Abundance pipeline
- Use the reference data (Hafnia_PostiveControl project in VIGASP) given by Girum to find a cut-off for species identification process
- Build Genome Assembly for the Hafnia data in Hafnia spp project in VIGASP
- Download the Gene sequences for the list of AMR genes given by Girum
- Download the Gene sequences for the list of Virulence genes given by Girum
- Run Abricate on the AMR and VirulenceGenes and tabulate the results.
- Use CoreGenomeSNP_Phylogeny pipeline to build the Tree
-
Check the quality of the data using Reads_QC - DONE 94 Hafnia samples from "Hafnia spp." project was analysed using "Reads_QC" pipeline Results:
- Coverage
- Phred Score
- QC%
- Predicted Species (using proportion of reads mapped) using Kraken2 and Bracken
-
Check the species identification using Species Abundance pipeline - DONE
- Predicted species using Kraken2 and Bracken (kind of redundant results from Reads_QC)
-
Use the reference data to find a cut-off for species identification process - DONE
- Girum sent us 3 Hafnia strains to use as reference to decide on Cut-off for species confirmation
- The 3 sample were analyzed using Reads_QC pipeline and "ProportionofReads" mapped used as cut-off for species identification prediction
-
Download the Gene sequences for the list of AMR genes.
-
Update: Many of the nucleotide_IDs are outdated and not able to download only the gene sequences
-
Suggestions from Håkon: Use NCBIAmrPlus for AMR genes
-
Use ALPPACA for the SNP analysis and phylogenetics.
- Total number of WGS data (isolates) 92 in VIGAS
- Quality control, species and contamination check : DONE (VIGAS)
- Assembly and annotation : DONE (VIGAS)
- AMR using NCBI-AMRFinderPlus : DONE (SAGA)
- Phylogenetics: Core genome SNPs using ALPPACA : DONE (SAGA)
- Plasmid Finder/assembler using MOBSUIT: DONE (SAGA)
- Use Abricate with VFDB for the 4 reference once
- Abrticate against AMRFinderPlus (NCBI) DB as well
- Abricate against CARD, MEGARES
#!/bin/bash
input="List_of_Fasta_Files.csv"
while IFS= read -r line
do
echo "$line"
IF="All_Genomes/$line"
OF_1="Hafnia_Hafnia_AMR_NCBI/"$line"_NCBI"
OF_2="Hafnia_Hafnia_AMR_MEGARES/"$line"_MEGARES"
OF_3="Hafnia_Hafnia_AMR_VFDB/"$line"_VFDB"
OF_4="Hafnia_Hafnia_AMR_CARD/"$line"_CARD"
abricate --db ncbi --quiet $IF --minid 50 --mincov 60 >$OF_1
abricate --db megares --quiet $IF --minid 50 --mincov 60 >$OF_2
abricate --db vfdb --quiet $IF --minid 50 --mincov 60 >$OF_3
abricate --db card --quiet $IF --minid 50 --mincov 60 >$OF_4
done < "$input"
#!/bin/bash
input="List_of_Fasta_Files.csv"
while IFS= read -r line
do
echo "$line"
IF="Assemblies/$line"
OF="Hafnia_Plamids/"$line"_Plasmid"
echo $OF
mob_recon --infile $IF --outdir $OF
done < "$input"
fastANI --q1 FastANI_input1.txt --q1 FastANI_input2.txt -o fastANI/All_vs_All.txt
#!/bin/bash
input="List_of_Fasta_Files.csv"
while IFS= read -r line
do
echo "$line"
IF="Assemblies/$line"
OF=$line"_output.csv"
echo $OF
amrfinder -n $IF --plus >$OF
done < "$input