HyperAIHyperAI

Command Palette

Search for a command to run...

CompreCap Image Description Dataset

Date

a year ago

Size

46.29 MB

Organization

Ant Group

Publish URL

github.com

Paper URL

arxiv.org

The CompreCap dataset was jointly created by the University of Science and Technology of China and Ant Group in 2024 to evaluate the accuracy and comprehensiveness of large-scale visual-language models in generating detailed image descriptions. The relevant paper results are "Benchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image Captioning". The dataset contains 560 images, each of which has been finely semantically segmented and annotated with objects, attributes, and relationships to form a complete directional scene graph structure.

The dataset is based on the MSCOCO panoramic segmentation dataset, but has been extended and improved. The researchers built a vocabulary of common object categories from multiple well-known datasets and re-annotated these categories to provide more accurate semantic segmentation masks. To ensure the completeness of the annotations, only images whose segmented areas cover more than 95% image areas are retained. Subsequently, the researchers manually added detailed attribute descriptions for these objects and annotated important relationships between objects to form a complete directional scene graph structure.

The annotation information of the CompreCap dataset includes semantic segmentation masks of objects, detailed attribute descriptions, and directional relationships between objects. These annotations not only cover common object categories, but also capture the complex relationships between objects in the form of directional scene graphs, allowing the dataset to comprehensively evaluate the quality of generating detailed image descriptions.

CompreCap.torrent
Seeding 1Downloading 0Completed 98Total Downloads 180
  • CompreCap/
    • README.md
      2.05 KB
    • README.txt
      4.11 KB
      • data/
        • CompreCap.zip
          46.29 MB

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
CompreCap Image Description Dataset | Datasets | HyperAI