Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bwa-Mem and tied pair-placements #42

Open
wants to merge 5 commits into
base: master
Choose a base branch
from
Open

Bwa-Mem and tied pair-placements #42

wants to merge 5 commits into from

Conversation

boukn
Copy link

@boukn boukn commented May 6, 2015

It looks like bwa-mem pairing can only ever pick one 'best' paired placement, and discards the rest, even if they are ties according to the scoring. Also, tie-breaking is essentially handled by the sort and the hash of align-id and read-id. This can lead to funky results between assemblies.

For example, on grch38, chr2:36341197 and chr3:53400868 are in regions that are 100% identical to each other, so read pairs that align to one actually align to both, equally well. This region also happens to be 100% identical in grch37. But I was getting results where on grch37 a test pair was placed on chr3, and on grch38 it was placed on chr2, with no difference in alignment quality to justify the difference. The sort-by-hash behavior was causing the change in results, but the only-return-one behavior was the real problem. The fix here is to simply return all placements that are tied with the top placement. It does mean more dup hits, but its better than inexplicable differences in what should be identical result.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant