Skip to content
@single-cell-data

The SOMA project

A flexible and extensible API for annotated 2D matrix data stored in multiple underlying formats

SOMA_github_banner1

Who we are

TileDB and the Chan Zuckerberg Initiative are collaborating to address emerging needs in the single-cell ecosystem.

The software specification

SOMA is a flexible, extensible, and open-source API enabling access to data in a variety of formats. SOMA is designed to be general-purpose for data that can be modeled as one or more sets of 2D annotated matrices with measurements of features across observations. The driving use case of SOMA is for single-cell data in the form of annotated matrices where observations are frequently cells and features are genes, proteins, or genomic regions.

Read more about the spec here.

The software package

The TileDB-SOMA package is a C++ library with APIs in Python and R, using TileDB Embedded to implement the SOMA specification.

Read more about the software package here.

Join our community

Pinned Loading

  1. SOMA SOMA Public

    A flexible and extensible API for annotated 2D matrix data stored in multiple underlying formats.

    Python 74 10

  2. TileDB-SOMA TileDB-SOMA Public

    Python and R SOMA APIs using TileDB’s cloud-native format. Ideal for single-cell data at any scale.

    Python 105 29

Repositories

Showing 6 of 6 repositories

Top languages

Loading…

Most used topics

Loading…