HyperAIHyperAI

Command Palette

Search for a command to run...

Console

Envision Multi-Stage Event Visual Generation Dataset

Date

4 days ago

Organization

Shanghai Artificial Intelligence Laboratory

Paper URL

2512.01816

License

MIT

Envision is a multi-image text pair dataset released by the Shanghai Artificial Intelligence Laboratory in 2025. The related research paper is titled "Envision: Benchmarking Unified Understanding & Generation for Causal World Process InsightsThe aim is to test the model's ability to understand causality and generate multi-stage events in real-world situations.

The dataset contains 1,000 event sequences and 4,000 four-stage text prompts, covering six major fields: natural sciences and humanities/history. The event materials are sourced from textbooks and online resources, selected by experts, and generated and polished by GPT-4o to form narrative prompts with clear causal chains and progressive stage structures.

Data composition:

  • Subject coverage (6 categories in total)
    • Natural Sciences (75%): Physics, Chemistry, Biology, Meteorology, Geography
    • History and Culture (25%)
  • Causal structure type
    • Continuous causality: continuous changes within the same spatial scene, applicable to fine-grained physical and chemical processes.
    • Discrete causality: jumps across time and space stages, applicable to geological evolution, life cycle, and historical events.
Dataset distribution and examples

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Envision Multi-Stage Event Visual Generation Dataset | Datasets | HyperAI