Welcome to the first public release of scAgent, an AI-powered assistant for single-cell RNA-seq analysis. This version includes the core functionality for automatic preprocessing, QC visualization, and report generation — designed to be a starting point for researchers who want fast insights from their 10x or Anndata files.
- Upload support for raw 10x files or .h5ad objects
- Automatic preprocessing pipeline (no parameter tuning required)
- Best-practice defaults + intelligent data-based thresholds
- UMAP, PCA, and QC visualizations (Violin, Scatter, HVG)
- Clean UI with auto-pilot mode and side info panel
- Publication-ready PDF report generation (LaTeX-powered)
- GEO accession auto-detection with study metadata linking
- Docker + Streamlit app ready
- AI-powered cell type annotation chat interface using GPT-2
- Few-shot learning for accurate cell type predictions
- Interactive chat history and sample questions
- Enhanced LLM integration for natural language querying
- Chat-driven exploration of cell populations, marker genes, and more
- Real-time explanation of plots and QC steps
- RAG with GEO descriptions for study summaries
- Fine-tuning of language models for better cell type annotation
git clone https://github.com/vasighiz/scagentic.git
cd scagentic
docker build -t scagent .
docker run -p 8501:8501 scagent
git clone https://github.com/vasighiz/scagentic.git
cd scagentic
conda env create -f environment.yml
conda activate scagentic
streamlit run app.py
cd scagentic
streamlit run cell_type_chat.py
- Raw 10x folder: matrix.mtx.gz, features.tsv.gz, barcodes.tsv.gz
- Processed: .h5ad
MIT License
Akram Vasighi | LinkedIn | Website
If you find this tool helpful, give it a ⭐ on GitHub and share with other bioinformaticians!
Please open an issue or email me if you'd like to suggest a feature, report a bug, or collaborate.