Change the repository type filter
All
Repositories list
14 repositories
mlc-imp
Publicanetqa-code
Publicrosita
PublicROSITA: Enhancing Vision-and-Language Semantic Alignments via Cross- and Intra-modal Knowledge Integrationprophet
PublicImplementation of CVPR 2023 paper "Prompting Large Language Models with Answer Heuristics for Knowledge-based Visual Question Answering".xmchat
Public- A PyTorch reimplementation of bottom-up-attention models
openvqa
PublicA lightweight, scalable, and general framework for visual question answering researchmcan-vqa
PublicDeep Modular Co-Attention Networks for Visual Question Answeringactivitynet-qa
Publicmmnas
Publicmt-captioning
PublicA PyTorch implementation of the paper Multimodal Transformer with Multiview Visual Representation for Image Captioning