Movie-trained transformer reveals novel response properties to dynamic stimuli in mouse visual cortex

Code for the ViV1T model and paper "Movie-trained transformer reveals novel response properties to dynamic stimuli in mouse visual cortex".

Authors: Bryan M. Li, Wolf De Wulf, Danai Katsanevaki, Arno Onken, Nathalie L. Rochefort.

@article {li2025movie,
  author = {Li, Bryan M and De Wulf, Wolf and Katsanevaki, Danai and Onken, Arno and Rochefort, Nathalie L},
  title = {Movie-trained transformer reveals novel response properties to dynamic stimuli in mouse visual cortex},
  elocation-id = {2025.09.16.676524},
  year = {2025},
  doi = {10.1101/2025.09.16.676524},
  publisher = {Cold Spring Harbor Laboratory},
  URL = {https://www.biorxiv.org/content/early/2025/09/17/2025.09.16.676524},
  eprint = {https://www.biorxiv.org/content/early/2025/09/17/2025.09.16.676524.full.pdf},
  journal = {bioRxiv}
}

Most exciting centre	Most exciting surrounds
Most exciting centre	Grating video	Natural video	Generated video

most_exciting_stimulus/README.md shows more examples of model-selected and model-generated most exciting centre surround stimuli (MEIs, MEVs), which we then verified in semi-closed-loop in vivo experiments.

Acknowledgments

We sincerely thank Turishcheva et al. for organising the Sensorium 2023 challenge and for making their high-quality, large-scale mouse V1 recordings publicly available. The structure of this codebase is based on and inspired by bryanlimy/V1T, bryanlimy/ViV1T, ecker-lab/sensorium_2023, sinzlab/neuralpredictors and sinzlab/nnfabrik.

File structure

The codebase is divided into its own (mostly) self-contained folders to evaluate different aspects/properties of the trained model (direction tuning, size tuning, etc.). The following list shows the overall organisation of the codebase and the code used for each figure and/or analysis presented in the paper.

Check data/README.md for more information about the datasets and how to store them.
train.py is the main pipeline to train the model.
predict.py is the code to inference the trained model on the test set(s).
viv1t/ contains code to construct the model, save and load model weights, compute various metrics to evaluate the model, generate low-dimensional parametric stimuli, etc.
misc/ contains scripts for extracting metadata and visualisations that are useful for all the analyses. See misc/README.md; you may also find some of the scripts helpful.
tuning_retinotopy/ estimates the artificial receptive fields (aRFs) of each in silico neuron the models were trained on (Figure 7). The aRFs are used to estimate the centre of the receptive field for subsequent analyses. See tuning_retinotopy/README.md.
tuning_direction/ evaluates the direction tuning and spatial organisation of the trained models (Figure 2). See tuning_direction/README.md.
tuning_contextual_modulation/ evaluates the centre-surround contextual modulation properties of the trained models, mostly replicating the in vivo experiments from Keller et al. 2020 using the movie-trained model (Figure 2). See tuning_contextual_modulation/README.md.
tuning_feedbackRF/ evaluates the feedback-dependent contextual modulation of the trained models, mostly replicating the in vivo experiments from Keller et al. 2020 using the movie-trained model (Figure 3). See tuning_feedbackRF/README.md.
most_exciting_stimulus/ contains code to find the most exciting grating and natural centre-surround stimuli to single neuron and population responses. It also contains the code to generate the most exciting images and videos (MEIs and MEVs, Figure 5 and Supplemental Figure 4). See most_exciting_stimulus/README.md.
in_vivo_analysis/ contains code to analyse the in vivo experiments we conducted to verify the predictions made by the movie-trained ViV1T, including low vs. high contrast centre-surround contextual modulation, generating MEIs and MEVs, etc. (Figure 4, Figure 5, Supplemental Figure 3 and Supplemental Figure 4). See in_vivo_analysis/README.md.
Check .gitignore for the ignored files.

ViV1T-closed-loop/
  data/
    sensorium/
    rochefort-lab/
    README.md
    ...
  docker/
  figures/
  in_vivo_analysis/
  misc/
  most_exciting_stimulus/
  tuning_contextual_modulation/
  tuning_direction/
  tuning_feedbackRF/
  tuning_retinotopy/
  viv1t/
    data/
    model/
    most_exciting_stimulus/
    utils/
    __init__.py
    checkpoint.py
    criterions.py
    metrics.py
    optimizer.py
    scheduler.py
  .gitignore
  pyproject.toml
  README.md
  predict.py
  train.py
  ...

Installation

Create a new conda environment in Python 3.12.
```
conda create -n viv1t python=3.12
```
Activate the viv1t virtual environment
```
conda activate viv1t
```

Install all the relevant packages with:

conda install ffmpeg -c conda-forge 
pip install -e .

Alternatively, you can see the dockerfile we used docker/Dockerfile.

Train model

Activate the viv1t environment
```
conda activate viv1t
```

Train the default ViV1T model. The model checkpoints and logs will be stored in runs/001_viv1t.

python train.py --data_dir=data/ --output_dir=runs/001_viv1t --core=vivit --core_behavior_mode=2 --core_use_causal_attention --core_parallel_attention --readout=gaussian --output_mode=1 --schedule_free --compile --batch_size=1 --wandb=vivit --clear_output_dir

Check --help for all available arguments

> python train.py --help
usage: train.py [-h] [--data_dir DATA_DIR] --output_dir OUTPUT_DIR [--transform_input {0,1,2}] [--transform_output {0,1,2}]
                [--center_crop CENTER_CROP] [--mouse_ids MOUSE_IDS [MOUSE_IDS ...]] [--limit_data LIMIT_DATA]
                [--limit_neurons LIMIT_NEURONS] [--num_workers NUM_WORKERS] [--epochs EPOCHS] [--batch_size BATCH_SIZE]
                [--micro_batch_size MICRO_BATCH_SIZE] [--crop_frame CROP_FRAME] [--device {cpu,cuda,mps}] [--seed SEED]
                [--deterministic] [--autocast {auto,disable,enable}] [--grad_checkpointing {0,1}] [--restore RESTORE]
                [--adam_beta1 ADAM_BETA1] [--adam_beta2 ADAM_BETA2] [--adam_eps ADAM_EPS] [--schedule_free]
                [--adam_warmup_steps ADAM_WARMUP_STEPS] [--adam_r ADAM_R] [--adam_weight_lr_power ADAM_WEIGHT_LR_POWER]
                [--criterion CRITERION] [--ds_scale {0,1}] [--clip_grad CLIP_GRAD] [--wandb WANDB] [--wandb_id WANDB_ID]
                [--clear_output_dir] [--verbose {0,1,2,3}] --core CORE [--pretrained_core PRETRAINED_CORE] [--compile] --readout
                READOUT [--shifter_mode {0,1,2}] [--output_mode {0,1,2,3,4}]

options:
  -h, --help            show this help message and exit
  --data_dir DATA_DIR   path to directory where the dataset is stored.
  --output_dir OUTPUT_DIR
                        path to directory to log training performance and model checkpoint.
  --transform_input {0,1,2}
                        input transformation
                        0: no transformation
                        1: standardize input
                        2: normalize input
  --transform_output {0,1,2}
                        output transformation
                        0: no transformation
                        1: standardize output per neuron
                        2: normalize output per neuron
  --center_crop CENTER_CROP
                        center crop the video frame to center_crop percentage.
  --mouse_ids MOUSE_IDS [MOUSE_IDS ...]
                        Mouse to use for training. By default we use all 10 mice from the Sensorium 2023 dataset
  --limit_data LIMIT_DATA
                        limit the number of training samples.
  --limit_neurons LIMIT_NEURONS
                        limit the number of neurons to model.
  --num_workers NUM_WORKERS
                        number of workers for DataLoader.
  --epochs EPOCHS       maximum epochs to train the model.
  --batch_size BATCH_SIZE
  --micro_batch_size MICRO_BATCH_SIZE
                        micro batch size to train the model. if the model is being trained on CUDA device and micro batch size 0 is provided, then automatically increase micro batch size until OOM.
  --crop_frame CROP_FRAME
                        number of frames to take from each trial.
  --device {cpu,cuda,mps}
                        Device to use for computation. use the best available device if --device is not specified.
  --seed SEED           random seed.
  --deterministic       use deterministic algorithms in PyTorch
  --autocast {auto,disable,enable}
                        Use torch.autocast in torch.bfloat16 when training the model.
  --grad_checkpointing {0,1}
                        Enable gradient checkpointing for supported models if set to 1.
  --restore RESTORE     pretrained model to restore from before training begins.
  --adam_beta1 ADAM_BETA1
  --adam_beta2 ADAM_BETA2
  --adam_eps ADAM_EPS
  --schedule_free       use schedule-free optimizer
  --adam_warmup_steps ADAM_WARMUP_STEPS
  --adam_r ADAM_R
  --adam_weight_lr_power ADAM_WEIGHT_LR_POWER
  --criterion CRITERION
                        criterion (loss function) to use.
  --ds_scale {0,1}      scale loss by the size of the dataset
  --clip_grad CLIP_GRAD
                        clip gradient norm:
                        0: disable gradient clipping
                        -1: AutoClip (Seetharaman et al. 2020)
                        >0: clip to a specific value.
  --wandb WANDB         wandb group name, disable wandb logging if not provided.
  --wandb_id WANDB_ID   wandb run ID to resume from.
  --clear_output_dir    overwrite content in --output_dir
  --verbose {0,1,2,3}
  --core CORE           The core module to use.
  --pretrained_core PRETRAINED_CORE
                        Path to directory where the pre-trained model is stored
  --compile             torch.compile (part of) the model for faster training
  --readout READOUT     The readout module to use.
  --shifter_mode {0,1,2}
                        0: disable shifter
                        1: learn shift from pupil center
                        2: learn shift from pupil center and behavior variables
  --output_mode {0,1,2,3,4}
                        Output activation:
                        0: no activation
                        1: ELU + 1 activation
                        2: Exponential activation
                        3: SoftPlus activation with learnable beta value
                        4: sigmoid activation

Trained model weight

The model weights trained on the Sensorium 2023 challenge are available at huggingface.co/bryanlimy/ViV1T-closed-loop (Figure 2, Figure 3, Supplemental Figure 1, Supplemental Figure 2 and Supplemental Figure 3).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Repository files navigation

Movie-trained transformer reveals novel response properties to dynamic stimuli in mouse visual cortex

Acknowledgments

File structure

Installation

Train model

Trained model weight

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
data		data
docker		docker
figures		figures
in_vivo_analysis		in_vivo_analysis
misc		misc
most_exciting_stimulus		most_exciting_stimulus
tuning_contextual_modulation		tuning_contextual_modulation
tuning_direction		tuning_direction
tuning_feedbackRF		tuning_feedbackRF
tuning_retinotopy		tuning_retinotopy
viv1t		viv1t
.gitignore		.gitignore
CITATION.bib		CITATION.bib
LICENSE		LICENSE
README.md		README.md
ensemble.py		ensemble.py
limit_data.py		limit_data.py
predict.py		predict.py
pyproject.toml		pyproject.toml
ray_sweep.py		ray_sweep.py
train.py		train.py
wandb_sweep.py		wandb_sweep.py

Uh oh!

License

Uh oh!

onkenlab/ViV1T-closed-loop

Folders and files

Latest commit

History

Repository files navigation

Movie-trained transformer reveals novel response properties to dynamic stimuli in mouse visual cortex

Acknowledgments

File structure

Installation

Train model

Trained model weight

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages