Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

SCREADER: Prompting Large Language Models to Interpret scRNA-seq Data #70

Open
szalata opened this issue Jan 6, 2025 · 2 comments
Open
Labels
question Further information is requested

Comments

@szalata
Copy link
Collaborator

szalata commented Jan 6, 2025

https://arxiv.org/abs/2412.18156

@coco11563
Copy link

coco11563 commented Jan 15, 2025

scReader: Prompting Large Language Models to Interpret scRNA-seq Data

Method:
scReader leverages large language models (LLMs) to interpret single-cell RNA sequencing (scRNA-seq) data by generating gene embeddings based on NCBI functional descriptions (e.g., gene type, organism, and expression location). The pipeline begins by ranking and selecting the top 2048 highly variable genes (HVG). Each selected gene is assigned an embedding, derived by inputting its NCBI functional description into GPT-3.5. These gene embeddings are concatenated to form a cell embedding. This cell embedding, combined with an instruction embedding (generated from task-specific prompts), is fed into a transformer-based LLM. The model processes this input and directs the class token to downstream tasks such as cell type classification.

Findings:
Experimental results demonstrate that integrating LLMs into single-cell omics analysis pipelines significantly improves the interpretation and classification of cell types. Furthermore, this method shows potential for applications in multi-omics integration and rare cell type identification, offering valuable insights for precision medicine and developmental biology.

Dataset:
The experiments were conducted on two in-house datasets.

Reference:
@Article{li2024screader,
title={scReader: Prompting Large Language Models to Interpret scRNA-seq Data},
author={Li, Cong and Long, Qingqing and Zhou, Yuanchun and Xiao, Meng},
journal={arXiv preprint arXiv:2412.18156},
year={2024}
}

@Paulos2411
Copy link
Collaborator

Paulos2411 commented Jan 21, 2025

Hi,

thanks for your summary. Could you perhaps summarize the information in a way as it has been done on the website for your specific case.

https://theislab.github.io/single-cell-transformer-papers/

I will then add them or feel free to create a PR.

Thanks :)

@Paulos2411 Paulos2411 changed the title add screader SCREADER: Prompting Large Language Models to Interpret scRNA-seq Data Jan 28, 2025
@Paulos2411 Paulos2411 added the question Further information is requested label Jan 28, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

3 participants