This repo contains the official PyTorch implementation of vLMIG: Improving Visual Commonsense in Language Models via Multiple Image Generation
deep-learning
language-model
vision-and-language
multimodal-deep-learning
visual-commonsense-reasoning
visual-commonsense
-
Updated
Jul 1, 2024 - Python