DeepCycle

This README outlines the steps required to reproduce the approach presented in the manuscript Rappez et al. DeepCycle reconstructs a cyclic cell cycle trajectory from unsegmented cell images using convolutional neural networks. Molecular Systems Biology, 2020. These instructions show how to run the model on the data used in the paper and on new unseen data.

Requirements

keras, UMAP, cv2, albumentations, classification_models
Install customized version of SOMPY

How to run

Prepare the DeepCycle paper data

Download data from EBI BioStudies repository and deep learning models from our EMBL hosting

Unzip to data/Timelapse_2019 folder preserving directory structure. You will have:

root|
    |-data|
    |     |-Timelapse_2019|
    |                     |-BF/
    |                     |-Cy3/
    |                     |-DAPI/
    |                     |-GFP/
    |                     |-curated_tracks.csv
    |                     |- ...
    |-src/
    |...

Prepare new timelapse and tracking data

Alternatively, DeepCycle can be run on newly generated data. For that, prepare the data as follows:

Organize the live-imaging microsocpy. Each channel of every timepoint are stored independently as:

root|
    |-data|
    |     |-New_timelapse_data|
    |                         |-BF/
    |                         |-Cy3/
    |                         |-DAPI/
    |                         |-GFP/

Run TrackMate FIJI plugin on the nuclear staining channel (DAPI in our case). The output should be called Spots in tracks statistics.csv.
Run src/TrackMate_filter.py with the correct paths (see comments in file) to manually curate the tracks with one or two divisions.

Run DeepCycle

cd src
Prepare the data:
python data_prepare.py
- Cleans and removes unnecessary columns. Stores as statistics_clean.csv in data/Timelapse_2019 dir
- Aligns the curated tracks based on division events, calculates mean intensities track/frame wise. Stores as intensities.csv
- Calculates intensity statistics and adds virtual class 1-4 to each tracked cell. Resulting data to be stored in statistics_mean_std.csv
Train the model:
python model_train.py
Trains the model on curated tracks (less double division tracks) using double division tracks as validation set. Saves best models in checkpoints dir
Generate cell descriptors with checkpoint.r34.sz48.03-0.73.hdf5 as default model:
- from validation set (double division tracks) only:
  python encode.py --mode encode_val
- from all available tracks:
  python encode.py --mode encode_all
  Descriptors are saved in descriptors.r34.sz48.pkl and descriptors_all.r34.sz48.pkl in data/Timelapse_2019 dir.
Generate embeddings for all dataset. Compute intense, consider using supplied embeddings_preds_all_batch<i>.npz instead:
python all_cells_prediction.py
cd ..
start jupyter notebook and open timelapse_projection2019.ipynb

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
checkpoints		checkpoints
data/Timelapse_2019		data/Timelapse_2019
images		images
logs		logs
models		models
notebooks		notebooks
src		src
.gitignore		.gitignore
README.md		README.md
__init__.py		__init__.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DeepCycle

Requirements

How to run

Prepare the DeepCycle paper data

Prepare new timelapse and tracking data

Run DeepCycle

About

Releases

Packages

Languages

alexandrovteam/DeepCycle

Folders and files

Latest commit

History

Repository files navigation

DeepCycle

Requirements

How to run

Prepare the DeepCycle paper data

Prepare new timelapse and tracking data

Run DeepCycle

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages