This repository contains the BacSC pipeline for bacterial single-cell sequencing data analysis (Ostner et al., 2024). It also contains all code needed to reproduce the analyses in the publication (TODO).
For usage info, please refer to the tutorials in the respective directory.
Raw and intermediate data objects can be downloaded on zenodo.
Simply download the data directory from there and move it into the BacSC directory.
The tutorials directory contains two tutorial notebooks.
The easiest way to use BacSC on your data is to copy these notebooks and replace the respective paths at the top of the notebooks.
BacSC_tutorial_general_pipeline.ipynbshows how to perform Variance stabilization, dimension reduction and clustering.BacSC_cluster_analysis_tutorial.ipynbgives a template for differential expression testing with BacSC.
This repository is structured as follows:
- The
toolsdirectory contains all functions and code used throughout the BacSC pipeline. - The
tutorialsdirectory contains two tutorial notebooks. The easiest way to use BacSC on your data is to copy these notebooks and replace the respective paths at the top of the notebooks.BacSC_tutorial_general_pipeline.ipynbshows how to perform Variance stabilization, dimension reduction and clustering.BacSC_cluster_analysis_tutorial.ipynbgives a template for differential expression testing with BacSC.
- The
analysisdirectory provides notebooks for reproduction of the results in the publication (TODO). Grouped by dataset source, there are 3 notebooks for each dataset:<dataset>_final_pipeline(or<dataset>_final_pipeline_2) performs the general BacSC analsis up to cell type clustering.<dataset>_cluster_analysisperforms DA testing on the clusters determined before.<dataset>_summary_figuresgenerates the figures for each dataset in the manuscript.

