Gene Expression INR

This repository contains the algorithms implemented in the paper: Brain-wide interpolation and conditioning of gene expression in the human brain using Implicit Neural Representations.

paper

Abstract

In this paper, we study the efficacy and utility of recent advances in non-local, non-linear image interpolation and extrapolation algorithms, specifically, ideas based on Implicit Neural Representations (INR), as a tool for analysis of spatial transcriptomics data. We seek to utilize the microarray gene expression data sparsely sampled in the healthy human brain, and produce fully resolved spatial maps of any given gene across the whole brain at a voxel-level resolution. To do so, we first obtained the 100 top AD risk genes, whose baseline spatial transcriptional profiles were obtained from the Allen Human Brain Atlas (AHBA). We adapted Implicit Neural Representation models so that the pipeline can produce robust voxel-resolution quantitative maps of all genes. We present a variety of experiments using interpolations obtained from Abagen as a baseline/reference.

Installation

conda env create -f environment.yml
conda activate inr

Code

Data

Following data are generated from abagen_compare.py, with customized abagen codebase

data/abagendata/abagen_output/...*.csv - abagen output data for baseline only
- <atlas>_microarray_<donor_id>.csv - region aggregated abagen output w/o interpolation for <donor_id> on atlas
- <atlas>_interpolation_microarray_<donor_id>.csv - region aggregated abagen output with interpolation for <donor_id> on atlas
- atlas naming:
  - <atlas> == 246 - BN_Atlas_246_1mm.nii.gz
  - <atlas> == grey - MNI152_T1_1mm_brain_grey_mask_int.nii.gz
  - <atlas> == white - MNI152_T1_1mm_brain_white_mask_int.nii.gz
data/abagendata/train/...*.csv - abagen output data for training without region aggregation, only preprocessed (dropped useless measurements)
- microarray_<donor_id>.csv - abagen preprocessed microarry
- annotation_<donor_id>.csv - abagen preprocessed annotation
- annotation_<donor_id>_4d.csv - abagen preprocessed annotation, adding 4th dimension of classification (whether on grey or white matter), generated from python src/data/generate4d.py
- pc/se_<donor_id>.csv - selected disease relevant gene expression names and its data, order by pc1 or spectrum embedding, generated from python src/data/pc1_se.py
- pc/se_<donor_id>_merged.csv, merge annotation and selected microarry data, and reformat the structure for model training generated from python src/data/data_merge.py

Results

nii_<donor_id>/<gene_symbol>_<atlas>_abagen.nii.gz - brain atlas, mapped gene expression on corresponding gene symbol name and atlas, generated from python src/plots/visualize_abagen.py using <atlas>_interpolation_microarray_<donor_id>.csv
nii_<donor_id>/<gene_symbol>_<atlas>_inr.nii.gz - brain atlas, mapped gene expression on corresponding gene symbol name and atlas, generated from python inference.py using model_test/<mode>_<gene_symbol>.pth, this result interpolates all mni measurements in the brain atlas
nii_<donor_id>/<gene_symbol>_<atlas>_inr_avg.nii.gz - brain atlas, postprocessed with python src/plots/avg_inr_atlas.py, which averages regions so we can compare with abagen baseline

Preprocess, Training, and Inference

src/atlas/...
- filter_nii.py - filter atlas nii file under certain threshold, MNI152_T1_1mm_brain_grey.nii.gz -> MNI152_T1_1mm_brain_grey_mask.nii.gz
- integer_nii.py - convert atlas to integer values to fit abagen input requirement, MNI152_T1_1mm_brain_grey_mask.nii.gz -> MNI152_T1_1mm_brain_grey_mask_int.nii.gz
src/data/...: Get training data, pipeline: generate4d.py -> pc1_se.py -> data_merge.py -> data/abagendata/train/se_<donor_id>_merged.csv
- i_generate4d.py - generate 4-dimentional data for training, that is whether certain point is on white or grey matter, white for 1, grey for -1, neither for 0
- ii_pc1_se.py - generate pc1/spectrum embedding order for relevant genes
- iii_data_merge.py
  - merge pc1/spectrum embedding order to gene x y z locations for training
- iv_encoding.py
src/plots/...
- similarity_gene.py - get similarity matrix from only gene values under se/pc1 ordering, generate 2 png files
- similarity_brain.py - get similarity matrix from brain images under se/pc1 ordering, generate 2 png files
- visualize_abagen.py - visualize abagen result in nii file
- visualize_se.py - generate git files, compare from separate trained result and whole trained result, under se ordering, require nii files to generated first from inference.py
- visualize.py - no interpolation, simply map gene points to nii
plot_... - plotting code for manuscript graph
inference.py - INR interpolation, require trained pth file
train_gene_net.py - Our main result, train all genes in one model with model backbone in your selection, in the paper we mainly reported results from siren
train_gene_net_sep.py - result baseline, train one gene on one model with model backbone in your selection
train_gene_net_noise.py - Our main result but adding noise for robustness ablation study, train all genes in one model with model backbone in your selection, in the paper we mainly reported results from siren
train_abagen.py - do not use, old code.
main.sh - training all gene expressions with one command

Data Release

Google Drive: https://drive.google.com/drive/folders/1zi8rKqYVd7GsrcfZ-EuZtzsQwL-zA_3V?usp=drive_link

Acknowledgements

We make use of code from Implicit Neural Representations with Periodic Activation Functions, WIRE: Wavelet Implicit Neural Representations, and abagen: A toolbox for the Allen Brain Atlas genetics data. We gratefully acknowledge their significant contributions to the field and their commitment to making high-quality research code publicly available.

Reference

If you find our paper helpful and/or use this code, please cite our publication.

@misc{yu2025brainwideinterpolationconditioninggene,
      title={Brain-wide interpolation and conditioning of gene expression in the human brain using Implicit Neural Representations}, 
      author={Xizheng Yu and Justin Torok and Sneha Pandya and Sourav Pal and Vikas Singh and Ashish Raj},
      year={2025},
      eprint={2506.11158},
      archivePrefix={arXiv},
      primaryClass={q-bio.GN},
      url={https://arxiv.org/abs/2506.11158}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 56 Commits
configs		configs
manuscript_imgs		manuscript_imgs
modules		modules
results		results
src		src
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md
abagen_compare.py		abagen_compare.py
abagen_sneha.py		abagen_sneha.py
avg_inr_atlas.py		avg_inr_atlas.py
environment.yml		environment.yml
gene_data.tex		gene_data.tex
gene_names.py		gene_names.py
inference.py		inference.py
main.sh		main.sh
numerical_comparasion.py		numerical_comparasion.py
plot_appendix_heatmap.py		plot_appendix_heatmap.py
plot_appendix_tau_violin.py		plot_appendix_tau_violin.py
plot_data_training.py		plot_data_training.py
plot_gene_inr_vs_abg.py		plot_gene_inr_vs_abg.py
plot_genecard.py		plot_genecard.py
plot_glassbrain.py		plot_glassbrain.py
plot_glassbrain_and_brain_slice.py		plot_glassbrain_and_brain_slice.py
plot_open_target.py		plot_open_target.py
plot_positional_encode.py		plot_positional_encode.py
plot_se.py		plot_se.py
plot_spectral.py		plot_spectral.py
plot_spectral_semantic.py		plot_spectral_semantic.py
plot_tau.py		plot_tau.py
plot_tau_label.py		plot_tau_label.py
tau.py		tau.py
test.py		test.py
train.py		train.py
train_abagen.py		train_abagen.py
train_gene_net.py		train_gene_net.py
train_gene_net_noisy.py		train_gene_net_noisy.py
train_gene_net_sep.py		train_gene_net_sep.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Gene Expression INR

Abstract

Installation

Code

Data

Results

Preprocess, Training, and Inference

Data Release

Acknowledgements

Reference

About

Uh oh!

Releases

Packages

Languages

vsingh-group/gene-expression-inr

Folders and files

Latest commit

History

Repository files navigation

Gene Expression INR

Abstract

Installation

Code

Data

Results

Preprocess, Training, and Inference

Data Release

Acknowledgements

Reference

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages