-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat: skip too short reads #32
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Something about this PR doesn't sit right with me, beyond my inline comments.
E.g. I think this could lead to empty records in the output files, and lots of tools struggle with that. I know it would be a bigger change, but I wonder if it would be better to output such reads to a "rejected" file as opposed to an "unmatched" pile.
Similarly, what happens if the reads are supposed to have UMIs, and have enough bases to cover the sample barcodes, but not the UMIs? Those will demux just fine and end up in output files, but with blank UMIs?
src/bin/commands/demux.rs
Outdated
e | ||
) | ||
}); | ||
.unwrap_or((&EMPTY_BASES, &EMPTY_QUALS)); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I don't like doing this here because it will also eat any other errors coming out of extract_bases_and_quals
. I think this will also (unexpectedly) lead to situations where a segment gets returned with 0 bases if it is defined as 8bp long and there are only 7bp available?
Should we perhaps change .extract_bases_and_quals()
to extract the sub-set of bases that are available, and update the documentation to match?
The --skip-reasons options specifies the reason(s) to skip reads, with only TooFewBases currently supported.
33dd5c4
to
6fe2ce9
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Approved once comments addressed.
Fixes: #27