HyperAI

M2RAG Multimodal Evaluation Benchmark Dataset

Date

2 months ago

Size

5.46 GB

Organization

Publish URL

huggingface.co

M2RAG is a multimodal dataset for evaluating the capabilities of multimodal large language models (MLLMs) in multimodal retrieval scenarios. It aims to evaluate the ability of MLLMs to use multimodal retrieval document knowledge in tasks such as image description, multimodal question answering, fact verification, and image re-ranking.Benchmarking Retrieval-Augmented Generation in Multi-Modal Contexts".

This dataset combines image and text data to simulate information retrieval and generation tasks in real scenarios, such as news event analysis and visual question answering. It focuses on evaluating the ability of MLLMs to use retrieved document knowledge in multimodal contexts, including understanding of image content, image-text association reasoning, and fact judgment.

M2RAG Benchmark Task Example
M2RAG.torrent
Seeding 1Downloading 0Completed 15Total Downloads 22
  • M2RAG/
    • README.md
      1.45 KB
    • README.txt
      2.9 KB
      • data/
        • M2RAG.zip
          5.46 GB