How to cluster datasets with important levels of noise or dropouts?

Let's start by defining the difference between noise and dropout:

Dropout = dataset non 0 values appear as 0 (single cell RNA-seq data)
Noise = the actual measured values have a certain additional noise (due to sensor calibration, experimental setup, etc)

This repository attempts to:

explain the theoretical notions behing spectral clustering and self tuned spectral clustering
implement the affinity matrix computation for self tuned spectral clustering
implement the eigenvalue gap heuristic for finding the optimal number of clusters

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.ipynb_checkpoints		.ipynb_checkpoints
__pycache__		__pycache__
DirichletMixture2D.ipynb		DirichletMixture2D.ipynb
README.md		README.md
Unsupervised feature selection.ipynb		Unsupervised feature selection.ipynb
graph-partitioning-louvain.ipynb		graph-partitioning-louvain.ipynb
robust_spectral_clustering.ipynb		robust_spectral_clustering.ipynb
spectral_clustering.ipynb		spectral_clustering.ipynb
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

How to cluster datasets with important levels of noise or dropouts?

About

Releases

Packages

Languages

pr-elhajji/high_noise_clustering

Folders and files

Latest commit

History

Repository files navigation

How to cluster datasets with important levels of noise or dropouts?

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages