Codes of Facilitating Multimodal Classification via Dynamically Learning Modality Gap

Here is the official PyTorch implementation of ''Facilitating Multimodal Classification via Dynamically Learning Modality Gap''

Paper Title: "Facilitating Multimodal Classification via Dynamically Learning Modality Gap"

For replication inquiries or issues, feel free to contact us via email at [[email protected]] or [[email protected]].

Code instruction

The original datasets can be found: CREMA-D, Kinetics-Sounds,

The code use CREMA-D dataset as example. You can simply run the code using:


python Crema_epoch.py

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
data		data
dataset		dataset
hypergrad		hypergrad
model		model
utils		utils
Crema_epoch.py		Crema_epoch.py
Crema_epoch_learnable.py		Crema_epoch_learnable.py
README.md		README.md
getPretrain.py		getPretrain.py