Skip to content

Latest commit

 

History

History
15 lines (12 loc) · 997 Bytes

StarshipsOnMiniXs.md

File metadata and controls

15 lines (12 loc) · 997 Bytes

Identify contigs that potentially belong to mini-chromosomes

  1. BLAST raw assembly against repeat-masked B71 reference genome minus mini-chromosome (Chr8) and identify contigs lacking extended matches. Here, we require alignments to be about 20 kb in length or larger by setting the -min_raw_gapped_score flag to be > 40000.
blastn -query U269_minion.fasta -query B71v2sh_masked.fasta -outfmt 7 -task dc-megablast -min_raw_gapped_score 40000 | grep ' 0 hits' -B 3

This will return a list list this:

BLASToutput.png

Note that tig0000402 does not have an extended match to the B71 reference genome even though it is >160 kb in length. This makes it a good candidate for a mini-chromosome segment.

  1. If we only want a list of candidate contigs, we can modify the above code:
blastn -query U269_minion.fasta -query B71v2sh_masked.fasta -outfmt 7 -task dc-megablast -min_raw_gapped_score 40000 | grep ' 0 hits' -B 3 | awk '$0 ~ /tig/ {print $3}'