- 🌱 I am Yuxuan Wang (汪宇轩), a research engineer at BIGAI. I completed my Master's degree at Peking University (PKU) and interned at Johns Hopkins University (JHU). Additionally, I conduct part-time research at the University of California, Santa Cruz.
- 🔭 I am keen to explore “o” for “omni”.
- 🤝 I am continually open to all forms of collaborative opportunities.
- 🔋 I am seeking more computational resource support.
-
Peking University
- https://patrick-tssn.github.io
Pinned Loading
-
Awesome-Colorful-LLM
Awesome-Colorful-LLM PublicRecent advancements propelled by large language models (LLMs), encompassing an array of domains including Vision, Audio, Agent, Robotics, Fundamental Sciences such as Mathematics, and Ominous.
-
OmniMMI/M4
OmniMMI/M4 Public[CVPR 2025] OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts
Python 5
-
OmniMMI/OpenOmniNexus
OmniMMI/OpenOmniNexus Publica fully open-source implementation of a GPT-4o-like speech-to-speech video understanding model.
Python 3
-
VideoHallucer
VideoHallucer PublicVideoHallucer, The first comprehensive benchmark for hallucination detection in large video-language models (LVLMs)
Python 27
-
bigai-nlco/VideoLLaMB
bigai-nlco/VideoLLaMB PublicOfficial Repository of VideoLLaMB: Long Video Understanding with Recurrent Memory Bridges
If the problem persists, check the GitHub status page or contact support.