assignTaxonomic() never ends with UNITE reference dataset #2069

danielsangarci · 2025-01-03T16:17:58Z

Hello,

I am analyzing 16S rRNA and ITS sequences from bacteria and fungi. While the assignTaxonomic() function works as expected with the 16S sequences and SILVA reference dataset (processing completes in a few hours), I encounter an issue when running it on ITS sequences with the UNITE reference dataset (the process never completes, even after more than 24 hours).

I have tried running it with a small subset of data (2 samples with 10 sequences each), but it still never ends.

seqtab.nochim <- seqtab.nochim[5:6,1:10]
taxa <- assignTaxonomy(seqtab.nochim, "sh_general_release_dynamic_04.04.2024.fasta", multithread=TRUE)
UNITE fungal taxonomic reference detected.

Do you know what could be the problem or how could i solve it?
Thank you so much in advance.

benjjneb · 2025-01-07T18:34:43Z

My guess is you are hitting the memory ceiling, which slows down assignTaxonomy extremely as it has to start swapping. How much memory is available in the compute environment you are using? Do you have access to something with more available memory?

danielsangarci · 2025-01-09T10:54:31Z

My laptop has 8Gb of RAM, so its possible thats the problem.

But... why did it work then with 16S and SILVA reference dataset? does 16S analysis requere less RAM memory?

benjjneb · 2025-01-09T15:19:47Z

Nothing specifically about 16S, it's the size of the database both in terms of number of sequences and in the number of unique terminal taxa that determines the memory requirements.

32GB is definitely enough to use UNITE (that's what I have on my machine). There's a good chance 16GB is enough as well, it used to be with older versions of UNITE.

danielsangarci changed the title ~~assignTaxonomic() never ends with UNITE references dataset~~ assignTaxonomic() never ends with UNITE reference dataset Jan 3, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

assignTaxonomic() never ends with UNITE reference dataset #2069

assignTaxonomic() never ends with UNITE reference dataset #2069

danielsangarci commented Jan 3, 2025 •

edited

Loading

benjjneb commented Jan 7, 2025

danielsangarci commented Jan 9, 2025

benjjneb commented Jan 9, 2025

assignTaxonomic() never ends with UNITE reference dataset #2069

assignTaxonomic() never ends with UNITE reference dataset #2069

Comments

danielsangarci commented Jan 3, 2025 • edited Loading

benjjneb commented Jan 7, 2025

danielsangarci commented Jan 9, 2025

benjjneb commented Jan 9, 2025

danielsangarci commented Jan 3, 2025 •

edited

Loading