-
Notifications
You must be signed in to change notification settings - Fork 26
Description
Hey!!
I have an aligned bam from direct RNA sequencing, and it has multiple mods attached to it (m6A, 5mC, 5hmC, and pseudouridine).
Around 6 months ago we ran into this issue where when we would use modkit extract calls to extract multiple modification from RNA data, it would create an insanely large bam and/or crash out on memory and couldn't finish the process because it was using so much RAM (even after throttling with setting threads, queue-size, and interval-size). I solved this problem by using the --call-code parameter and doing one mod at a time, decreasing memory and size of our output files.
Now, I'm noticing --call-code is not even a parameter or option anymore in the latest versions of modkit. I can't seem to figure out 1) what older version I was using 6ish months ago and 2) what versions actually had that parameter as an option.
If I want to use modkit and extract one modification at a time, is there another way to do so rather than just doing a awk command after it is complete? Or, can you tell me what was the last version to use --call-code so I can pull a respective container with the right version?
Lastly, is this something you can add back in? Because at least for us it was extremely useful and our bams were so large we couldn't even get the process to work without doing one modification at a time. Thanks!
Hailey Z