Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

scDRS analysis on a large single-cell RNA-seq dataset #111

Open
AI-10 opened this issue Nov 25, 2024 · 3 comments
Open

scDRS analysis on a large single-cell RNA-seq dataset #111

AI-10 opened this issue Nov 25, 2024 · 3 comments

Comments

@AI-10
Copy link

AI-10 commented Nov 25, 2024

hey,

I am trying to run scDRS on a single-cell data containing three million cells, but the analysis failed due to server memory limitations. Are there any alternative ways to proceed with the analysis?

I plan to perform 50 random samplings of 5% of the cells for analysis, but I’m not sure if this is reasonable.

@AI-10 AI-10 changed the title scDRS analysis on a vary large single-cell RNA-seq dataset scDRS analysis on a large single-cell RNA-seq dataset Nov 25, 2024
@martinjzhang
Copy link
Owner

Sounds like a good plan. scDRS memory requirement is roughly 3X the size of the data. You may consider randomly splitting your into into X parts, run scDRS on each part, and then aggregate the results.

@AI-10
Copy link
Author

AI-10 commented Nov 25, 2024

Thanks for your timely reply

I'm considering how to combine the results of multiple parts? How do I get a combined Z value and P value? Can I average the Z-scores for the different parts?

Best regards,
Gong

@martinjzhang
Copy link
Owner

You can directly concatenate the results for the individual-cell level analyses.
For the group-level analysis, you can use Fisher's method (for a given cell type across subsets of data) to combine p-values.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants