I am currently a PhD student at the School of Computer Science and Technology, Zhejiang University, jointly cultivated by Shanghai Artificial Intelligence Laboratory.
I am doing a research internship in the opencompass at the Shanghai Artificial Intelligence Laboratory.
And I'm one of the core contributor and maintainer of VLMEvalKit, you can contact me via [email protected] if you have any problems while using VLMEvalKit to evaluate Video understanding benchmarks or Video-LLMs.
Currently Study on Video Understanding and Large Vision/Video-language Model Evaluation, interested in real-time video understanding.
View the OpenVLM Video Leaderboard to quickly understand the real-world performance of existing MLLMs.