Add our ICLR2025 work Dynamic-LLaVA #121

Blank-z0 · 2025-02-27T04:55:22Z

Add paper "Dynamic-LLaVA: Efficient Multimodal Large Language Models via Dynamic Vision-language Context Sparsification" Dynamic-LLaVA is the first MLLM acceleration framework that simultaneously sparsifies both vision and language contexts while integrating inference efficiency optimization across different MLLM inference modes into a unified framework. In practice, Dynamic-LLaVA can achieve additional inference efficiency throughout the entire generation process, with negligible understanding and generation ability degradation or even performance gains compared to the full-context inference baselines. GitHub: https://github.com/Osilly/dynamic_llava

DefTruth

LGTM

DefTruth self-requested a review February 27, 2025 05:06

DefTruth approved these changes Feb 27, 2025

View reviewed changes

DefTruth merged commit 4cb8763 into DefTruth:main Feb 27, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add our ICLR2025 work Dynamic-LLaVA #121

Add our ICLR2025 work Dynamic-LLaVA #121

Blank-z0 commented Feb 27, 2025

DefTruth left a comment

Add our ICLR2025 work Dynamic-LLaVA #121

Add our ICLR2025 work Dynamic-LLaVA #121

Conversation

Blank-z0 commented Feb 27, 2025

DefTruth left a comment

Choose a reason for hiding this comment