CEBRA-Lens

A python library for mechanistic interpretability of CEBRA models

CEBRA-Lens is a Python library for analyzing and interpreting neural representations learned by models trained with CEBRA. It provides tools for mechanistic interpretability, allowing users to probe, visualize, and understand the structure of learned embeddings. The library is designed to support in-depth analysis of representational geometry, feature selectivity, and latent space dynamics in neuroscience and beyond. 👋 We welcome contributions and will continue to expand the library in the coming years.

🦓🔎 CEBRA Lens

🛠️ Quick start

🚨 Make sure that the environment in which you trained the CEBRA models in has the same torch version as the environment used for CEBRA-Lens.

```bash
conda create -n CEBRAlens python=3.12
conda activate CEBRAlens
conda install -c conda-forge pytables==3.8.0

# install PyTorch with your desired CUDA version - check their website: https://pytorch.org/get-started/locally/
# example: GPU version of pytorch for CUDA 11.3
conda install pytorch cudatoolkit=11.3 -c pytorch

# install CEBRA and CEBRA-lens
pip install --pre 'cebra[datasets,demos]'
pip install --pre cebra_lens

🦓🔍 Analysis Methods

Implemented mechanistic interpretability methods for neural representation analysis are presented below.

Model performance analysis

Model decoding metrics:
- average $R^2$ score across labels
- $R^2$ score per label
- error score per label

Additionally, there is the possibility to analyze the decoding performance of each layer embeddings.

Layer visualizations

single unit activation - plotting the activation value for each neural network unit
high-dimensional embedding of population activity - 3D scatter plot using the first 3 dimensions
low-dimensional embedding of population activity with a 3 component tSNE (Cai & Ma, arXiv, 2022)

Population level comparison

Central Kernal Alignment (CKA) (Kim et al., arXiv, 2022)

This method allows for the comparison of corresponding layers for different models.
Representational Dissmilarity Matrix (RDM) (Kriegeskorte et al., Frontiers in Systems Neuroscience, 2008)

This method investigates population-level representations in competing models. This is done by calculating the correlation or cosine distance for each stimuli between the embeddings of a particular layer of a model. Possible plots for this analysis:
- plot model layer RDM
- plot correlation with Oracle RDM across layers

Distance metrics

These analyses quantify the change in the distance calculated per layer in a model. The distances which are implemented in this codebase are:

intra-class distance
inter-class distance
inter-repetition distance (only relevant if the model was trained on a dataset where there is repeating stimuli)

Demo

The current version of CEBRA-Lens supports specific analysis on the Allen Institute visual coding dataset (DeVries et al, Nature Neuro., 2020) and Hippocampus dataset (Grosmark & Buzáki, Science, 2016), and for general analysis on other datasets. See the example notebooks we provide.

📊Usage

The CEBRA-Lens package allows for analyzing the embeddings of CEBRA models, but also offers the functionality of comparing embeddings and behavior through layers between models. For this purpose the code logic is centered around "metric classes". Before every analysis you first must initalize the corresponding metric class with the necessary arguments, and then to compute the metric the overhead function compute_metric(data, metric_class) needs to be called, this is the same for plotting, plot_metric(data, metric_class). For example:

interbin_class = lens.Distance(
data=train_data,
label=train_label,
dataset_label=dataset_label,
metric=metric,
distance_label="interbin",
)
interbin_dict = lens.compute_metric(
    activations_dict,
    interbin_class
)
fig = lens.plot_metric(
    interbin_dict, 
    interbin_class, 
    title="Inter-bin distance"
)

Jupyter Notebooks

UsageDemoVISUAL: analysis on the Allen visual dataset, here.
UsageDemoGENERAL: analysis on the Hippocampus dataset, but without specific dataset functions, here.

These two notebooks showcase the different approach when analyzing a pre-defined dataset and a non-defined dataset.

Acknowledgements

This repository contains the code for Eloise's semester's project "Engineering software for neural representation analysis"(SPRING 2025), building on Riccardo's semester project "Exploring nonlinear encoders for robust vision decoding" (FALL 2024).
The work was supervised by Célia Benquet and Mackenzie Mathis at the Mathis Laboratory of Adaptive Intelligence.
We thank the DeepDraw project for some source code and analysis methods.

Other helpful tips:

📥 Download dataset

The utils.py file contains a overarching get_data function which checks for a pre-defined dataset label and accordingly loads the data based on specific functions for the dataset. If you want to load data from a non-defined dataset, you need to first import the loading function inside the utils.py file as so:

from .utils_new import get_datasets as get_datasets_new

then add an if clause for your new dataset:

elif dataset_label == "new_dataset":
        return get_datasets_new(session_id=session_id)

This is briefly repeated in the usage demo notebooks.

Contributing Guide

Fork the repository and create a new branch:
```
git checkout -b your-feature-name
```
Make your changes and ensure they are well-tested and well-documented.

Format your code using isort and yapf:

isort .
yapf -i -p -r cebra_lens
yapf -i -p -r tests

or the make command:

make format

Open a Pull Request to the main branch with a clear description of your changes.

Name		Name	Last commit message	Last commit date
Latest commit History 320 Commits
.github/workflows		.github/workflows
cebra_lens		cebra_lens
demos		demos
docs		docs
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
_config.yml		_config.yml
_toc.yml		_toc.yml
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CEBRA-Lens

🛠️ Quick start

🦓🔍 Analysis Methods

Model performance analysis

Layer visualizations

Population level comparison

Distance metrics

Demo

📊Usage

Jupyter Notebooks

Acknowledgements

Other helpful tips:

📥 Download dataset

Contributing Guide

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

CEBRA-Lens

🛠️ Quick start

🦓🔍 Analysis Methods

Model performance analysis

Layer visualizations

Population level comparison

Distance metrics

Demo

📊Usage

Jupyter Notebooks

Acknowledgements

Other helpful tips:

📥 Download dataset

Contributing Guide

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages