Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Command: k-mer abundance histogram #1846

Open
taranglute opened this issue Mar 6, 2018 · 1 comment
Open

Command: k-mer abundance histogram #1846

taranglute opened this issue Mar 6, 2018 · 1 comment

Comments

@taranglute
Copy link

We have downloaded khmer 2.1.1 version. Now we want to benchmark for the full histogram for k-mer abundance,

Is the following command correct for obtaining 'the full k-mer abundance histogram" to be run on the dataset like human HS3 for the benchmark.

./abundance-dist-single.py -k 25 -T 12 input.fastq output_histo

@standage
Copy link
Member

standage commented Mar 6, 2018

Hi @taranglute.

Your command looks mostly correct, although:

  • It's not typical to run commands from the scripts/ directory, so the ./ prefix may not make sense. Did you follow the latest installation instructions? If so, you should be able to execute the abundance-distance-single.py from any directory.
  • The command uses a constant amount of memory, and by default it is a very small amount of memory. For human whole-genome shotgun data, note that dozens of gigabytes of memory must be available if you want accurate k-mer abundances. For example, if you want to allocate 32 gigabyes computing the k-mer counts, you can add -M 32G to the command.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants