Skip to content
@MME-Benchmarks

MME Benchmarks

Multimodal LLM Benchmarks of MME series

Pinned Loading

  1. Video-MME Video-MME Public

    ✨✨[CVPR 2025] Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis

    509 20

  2. MME-RealWorld MME-RealWorld Public

    ✨✨ [ICLR 2025] MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?

    Python 107 8

  3. MME-CoT MME-CoT Public

    MME-CoT: Benchmarking Chain-of-Thought in LMMs for Reasoning Quality, Robustness, and Efficiency

    Python 93 3

  4. MME-Unify MME-Unify Public

    MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models

    Python 30 1

Repositories

Showing 4 of 4 repositories
  • MME-Unify Public

    MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models

    MME-Benchmarks/MME-Unify’s past year of commit activity
    Python 30 1 0 0 Updated Apr 10, 2025
  • MME-CoT Public

    MME-CoT: Benchmarking Chain-of-Thought in LMMs for Reasoning Quality, Robustness, and Efficiency

    MME-Benchmarks/MME-CoT’s past year of commit activity
    Python 93 3 1 0 Updated Mar 29, 2025
  • Video-MME Public

    ✨✨[CVPR 2025] Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis

    MME-Benchmarks/Video-MME’s past year of commit activity
    509 20 6 0 Updated Mar 28, 2025
  • MME-RealWorld Public

    ✨✨ [ICLR 2025] MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?

    MME-Benchmarks/MME-RealWorld’s past year of commit activity
    Python 107 8 2 0 Updated Mar 4, 2025

Top languages

Loading…

Most used topics

Loading…