Skip to content
/ kmsk Public

Mask homologous sequences in query fasta(s) by kmers of subject fasta

License

Notifications You must be signed in to change notification settings

slw287r/kmsk

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

kmsk

Mask homologous sequences in query fasta(s) by kmers (k≤31) of subject fasta

Build

git clone https://github.com/slw287r/kmsk.git
cd kmsk
make

Usage

  • Output masked query fastas
kmsk -s t2t.fna -p t2tmsk -o . -t 8 562.fna 1280.fna 5476.fna
# output
562.t2tmsk.fna
1280.t2tmsk.fna
5476.t2tmsk.fna
  • Output masked region bed file
kmsk -r -s t2t.fna -p t2tmsk -o . -t 8 562.fna 1280.fna 5476.fna
# output
562.t2tmsk.bed
1280.t2tmsk.bed
5476.t2tmsk.bed

* ~40GB memory is used for the Homo sapiens subject.

About

Mask homologous sequences in query fasta(s) by kmers of subject fasta

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published