-
Notifications
You must be signed in to change notification settings - Fork 35
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Gene level threshold #42
Comments
I'm not related to the eQTL Catalogue, but regarding your first question, once @kauralasoo told me that the "*.all.tsv.gz" files contain full summary statistics, and "*.cc.tsv.gz" contain all genes that had FDR < 0.01 and at least one fine mapped credible set. |
Dear Vibha, Sorry for the long delay in responding to this. Regarding the difference between ".cc.tsv.gz" and ".all.tsv.gz", please see this section from the eQTL Catalogue 2023 paper:
The ".cc.tsv.gz" files are filtred based on the fine mapping results and the and ".all.tsv.gz" are the unfiltered summary statistics. There is no option right now to determine a pvalue threshold for each gene for all variants based on the permutation results. Your best option is to either use a fixed genome-wide threshold (e.g. p < 1e-6 or something similar) or use the fine mapped credible sets to identify multiple indepedent signals per gene. Best wishes, |
Thank you so much Dr. Alasoo and Vitor :) |
Dear eQTLCatalogue team,
We are using eQTL catalogue to discover eQTLs in our selected variants. I have several confusions regarding the datasets and was wondering if you could provide some insights into it.
What is the difference between the *.cc.tsv files versus the *.all.tsv.gz files in the repository,
Also, we were thinking to apply gene-based threshold similar to GTEx pipeline , where the genome-wide threshold is determined using FDR and beta parameters of the minimum P value from permutation is used to determine the nominal P value threshold for each gene.
Although the p_beta, p_perm are available in *.permutation files, the beta parameters (shape_1, shape_2 aka alpha, beta) are not available.
I wonder if there is a way we can determine the pvalue threshold for each gene for all variants,
Thank you so much,
Regards,
Vibha
The text was updated successfully, but these errors were encountered: