-
Notifications
You must be signed in to change notification settings - Fork 16
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Data sparsity is a critical problem for the inference stability (ex. 10x data) #32
Comments
Hi, Thank you for the question. The imputation results look great! Another possibility for the low power is that the background cells are also moderately relevant to the disease (PBMCs) and may kill the signal. We are working on a feature that could potentially address this issue (see #30 (comment)). Have you tried adjusting for cell type proportions using |
Thank you very much for your quick response. I also tried --adj-prop, but the following error occurred on the pbmc3k dataset...
Installation of scDRS
|
Hi, Could you check that |
Thank you, it contains the cell_type column.
Here is the notebook for the preparation https://github.com/yyoshiaki/scDRS_example/blob/master/notebook/prep_data.ipynb and h5ad https://github.com/yyoshiaki/scDRS_example/blob/master/scanpy/pbmc/pbmc.h5ad file. |
Hi @yyoshiaki , Thank you for the information. The error corresponds to a numerical issue due to the inclusion of many genes with 0 expression (16159/32738 of genes in the data). This is a boundary case we are not handling perfectly right now, so scDRS may or may not give errors when you try different things. This probably explains why the error didn't occur before. I will add it to my to-do list. I suggest turning on the |
Hi @yyoshiaki , Thank you again for reporting the issue. We just pushed a new version To use this version, please pull info from the latest git repo and update using |
Thank you @yyoshiaki for sharing the results w/ and w/o Best, |
Helpful to see this exchange - thank you. |
Try MAGIC van Dijk Cell 2018. Our preliminary results show that it improves power and provides conservative estimates. |
Super, thanks! |
Hi, Thank you for the wonderful tool.
I noticed that the detection power for sparse data such as 10x is very low.
Imputation improved the stability, but I want to avoid using imputation if possible because it can distort the data.
I'm attaching an example using the pbmc3k dataset.
Regarding attaching examples, autoimmune diseases are not still significant...
Is there any way to deal with these?
Disease scores for raw expression value notebook
Disease scores for imputed expression value notebook
codes for the results
The text was updated successfully, but these errors were encountered: