HyperAIHyperAI

Command Palette

Search for a command to run...

Zebra-CoT Text-to-Image Inference Dataset

Date

3 months ago

Size

63.04 GB

Organization

Columbia University
University of Southern California

Paper URL

arxiv.org

Zebra-CoT is a visual language reasoning dataset jointly released by Columbia University, University of Maryland, University of Southern California and New York University in 2025. The related paper results are "Zebra-CoT: A Dataset for Interleaved Vision Language Reasoning", which aims to promote the model to better understand the logical relationship between images and texts, and is widely used in fields such as visual question answering and image description generation to help improve reasoning ability and accuracy.

The dataset contains 182,384 samples covering 4 main categories: scientific reasoning, 2D visual reasoning, 3D visual reasoning, and visual logic and strategy games. These samples contain logically coherent interleaved text-image reasoning traces.

Dataset structure:

  • Problem Description: A text description of the problem.
  • Question Image: Depending on the nature of the question, this may be accompanied by zero or more images.
  • Reasoning images: There are at least one or more visual aids that support the intermediate reasoning steps in the problem-solving process.
  • Textual Reasoning Track: A series of textual reflections and corresponding visual sketches or diagram placeholders.
  • Final answer: solution to the problem.

Dataset field distribution map

Zebra-CoT.torrent
Seeding 1Downloading 0Completed 36Total Downloads 104
  • Zebra-CoT/
    • README.md
      1.9 KB
    • README.txt
      3.8 KB
      • data/
        • Zebra-CoT.zip
          63.04 GB

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp