Postdoctoral Fellow at CUHK. Previously Ph.D. in Computer Science at the University of Wisconsin-Madison.
Pinned Loading
-
microsoft/RegionCLIP
microsoft/RegionCLIP Public[CVPR 2022] Official code for "RegionCLIP: Region-based Language-Image Pretraining"
-
LaVi-Lab/AIM
LaVi-Lab/AIM PublicOfficial code for "AIM: Adaptive Inference of Multi-Modal LLMs via Token Merging and Pruning"
Python 24
-
facebookresearch/ProcedureVRL
facebookresearch/ProcedureVRL Public archive[CVPR 2023] Official code for "Learning Procedure-aware Video Representation from Instructional Videos and Their Narrations"
Python 52
-
LaVi-Lab/Visual-Table
LaVi-Lab/Visual-Table Public[EMNLP 2024] Official code for "Beyond Embeddings: The Promise of Visual Table in Multi-Modal Models"
-
SGG_from_NLS
SGG_from_NLS Public[ICCV 2021] Official code for "Learning to Generate Scene Graph from Natural Language Supervision"
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.