How to estimate the gene flow betweem sister species using gIMble? #148
-
Hello, thank you very much for developing this tool. I saw that the authors used gIMble to estimate gene flow across genome-wide windows in the paper “Chromosome Fissions and Fusions Act as Barriers to Gene Flow between Brenthis Fritillary Butterflies.” However, I didn’t find information related to gene flow analysis on the GitHub page. Should I expect gene flow estimates to be included in the output files after sequentially running parse, blocks, ......, gridsearch, and simulate? |
Beta Was this translation helpful? Give feedback.
Replies: 3 comments 2 replies
-
Converting to a discussion, this question might be useful to others. |
Beta Was this translation helpful? Give feedback.
-
Hi, I have a pair of sister species, A and B. They were originally separated by geography, but that barrier has now disappeared. Species A has undergone several chromosomal rearrangements, and I want to know whether gene flow between the two sister species is reduced across those rearranged chromosomes. I’ve noticed that gIMble can only estimate gene flow in one direction at a time, so my plan is to calculate gene flow separately for each direction and then take the average. |
Beta Was this translation helpful? Give feedback.
-
How one should interpret unidirectional gene flow estimates is an interesting and tricky question and answering it would require checks on simulated data. Given that many of the signals in the data that inform us about m (e.g. topology info in the form of shared heterozygous sites) are not sensitive to the direction of gene flow (all else being equal), I would argue that ii) is more likely. So I think your idea of interpreting the average of unidirectional m estimates for each direction as our best estimate for the the rate of bidirectional gene flow does make sense. However, I would suggest dividing this average by 2! One thing worth trying may be to fit a model of bidirectional gene flow (using SFS based methods) as a sanity check. |
Beta Was this translation helpful? Give feedback.
If you fit a model with gene flow, then the gene flow estimates will indeed be included as part of the parameter combinations that are explored during gridsearch. By default processing results get stored in the data store that contains all the data of a single project. You'll have to run
query
to extract this information from the store.