-
Notifications
You must be signed in to change notification settings - Fork 12
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
NlaIII digestion problem #39
Comments
I had the same issue, and I think it's because you need to provide the dangling sequence too on the otherside of the caret: https://en.wikipedia.org/wiki/NlaIII so I got something to print, when I used
Edit: ignore this comment, see updated issue below |
After more playing around I realise that "CATG^" should actually work, and that the sequences being looked for should just be "CATG" and not "CATGCATG" or "CATGGTAC" or any other. Currently the truncated file is completely improperly truncated: If I have a sequenceI'll call this file test1.fq
How CATG^GTAC is truncated
yields: "Truncating sequences at occurrence of sequences '[CATGGTAC]'"
Note that each read which matches "CATGGTAC" is cut, and the sequence ends with it How CATG^ is truncated:
yields: "Truncating sequences at occurrence of sequences '[]'"
Note how the read has basically just vanished. This is wrong What CATG^ should be producing
"Truncating sequences at occurrence of sequences '[CATG]'"
Note how each read is truncated by ends with the desired sequence. |
This fix is implemented in PR above |
We used NlaIII enzyme to digest in our Hi-C.
I specified --re1 CATG^,NlaIII when I run hicup_digester, the result file seems good, here shows the head of the file.
But when I run HICUP with it, it gives no results, from the log I found there is no sequence in [].
Truncating with HiCUP Truncater v0.7.4
Truncating sequences at occurrence of sequences '[]'
Truncating sequences
The text was updated successfully, but these errors were encountered: