Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

NorwegianFrequencies and location of markers #3

Open
thoree opened this issue Jan 18, 2021 · 4 comments
Open

NorwegianFrequencies and location of markers #3

thoree opened this issue Jan 18, 2021 · 4 comments
Assignees

Comments

@thoree
Copy link

thoree commented Jan 18, 2021

Great app! Would it be possible to include the NorwegianFrequencies, the 35 markers available in R Familias and the R library forrel. It would generally be very nice if also chromosome location of markers could be available using the app, a suggestion for the the to do list!

@knifecake
Copy link
Owner

Including chromosome location of markers is definitely on the roadmap! As for the Norwegian frequencies available in R Familias, I'll look into it (I need to find a reference and then I can run the data through the same pipeline as the others so that the formats are consistent).

@knifecake knifecake self-assigned this Jan 18, 2021
@thoree
Copy link
Author

thoree commented Jan 19, 2021

Reassuring roadmap! The source for NorwegianFrequencies is (I don't know if it helps): Dupuy et al. (2013): Frequency data for 35 autosomal STR markers in a Norwegian, an East African, an East Asian and Middle Asian population and simulation of adequate database size. Forensic Science International: Genetics Supplement Series, Volume 4 (1).

@knifecake
Copy link
Owner

knifecake commented Feb 2, 2021

The study by Dupuy et al. is also now part of the pipeline and will be added in the next release. A preview is available here: https://next.leapdna.org/explore/study/dupuy2013_norway. I had some trouble finding coordinates for two markers: D11S554 and D17S906. I hope to find some soon. Also, the paper reports H_obs and H_exp are available online but I've been unable to extract them from the .fam file available in familias.no (perhaps it is because I'm using forrel::readFam and this is not supported). H_exp, I can calculate myself, but I'm following the convention that if a paper reports them, I use what it says.

Chromosome location of markers will be available in the next release of leapdna which can be previewed at https://next.leapdna.org. Coordinates wrt. GRCh38 will be included when known in frequency studies downloaded in the leapdna format. I think these can be especially useful for mutation models and for crossover simulations as I know some R libraries do. My intention is to make it easy to read leapdna files in R by writing and sharing a couple of functions.

@thoree
Copy link
Author

thoree commented Feb 3, 2021

Works nicely - thanks you so much!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants