Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Preventing self search? #175

Open
ghost opened this issue Jul 25, 2017 · 5 comments
Open

Preventing self search? #175

ghost opened this issue Jul 25, 2017 · 5 comments

Comments

@ghost
Copy link

ghost commented Jul 25, 2017

Hello,

Is there an option to prevent Sibelia to find synteny block inside one of the 2 input genomes?
For example, when I compare genome_1 and genome_2, Sibelia will not only find the syntenic regions of the 2 genomes but will also find the duplications inside each genome.

I think it would be interesting to have an option to avoid that, so that we could just get the synteny blocks involving the 2 genomes. The "sharedonly" option doesn't give me what I want, because if the genome is polyploid then it returns empty files.

@iminkin
Copy link
Member

iminkin commented Jul 26, 2017

What should the output be if genomes are polyploid? For example, suppose you have two genomes:

G1: +A +A +B +B
G2: +A +B +A +B

Where A and B ary synteny blocks. So what would be the desired output in this case?

@ghost
Copy link
Author

ghost commented Jul 26, 2017

It would be, if we just consider the first "A" of the G1 genome

G1 +A and G2 +A
G1 +A and G2 +B
etc ...
And never G1 + A and G1 +B

If I understood correctly your example.

@iminkin
Copy link
Member

iminkin commented Jul 26, 2017

This is tricky because it is not clear why we should choose the first appearance of "A" as the "main" one, and not the second. There is no ready-made option to impose this behaviour in Sibelia. You can write a simple postprocessing script that takes ouput of Sibelia and "resolves" duplications the way you like.

@ghost
Copy link
Author

ghost commented Jul 26, 2017

Yes indeed, this is the approach I have undertaken. I am working on a very unusual organism for which no genomics tool is really well adapted (the bdelloid rotifers, which are degenerate polyploid with no homologuous chromosomes, which means I have to work with all allelic pairs or paralogs present. There is also no way to distinguish an allelic pair from a paralog). Sibelia is nevertheless the best so far, I really like your program.

@iminkin
Copy link
Member

iminkin commented Jul 26, 2017

That is cool :) I am glad that you found Sibelia useful.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Development

No branches or pull requests

1 participant