Mantis-Instruct Multi-image Instruction Tuning Dataset
Date
8 months ago
Publish URL
Categories
The Mantis-Instruct dataset was released in 2024 by the University of Waterloo, Tsinghua University, and the Shanghai Artificial Intelligence Laboratory.MANTIS: Interleaved Multi-Image Instruction Tuning". A text-image interleaved multimodal dataset focused on multi-image instruction tuning, consisting of 14 subsets containing 721K examples, for training the Mantis model family. The dataset covers a variety of multi-image skills, including coreference, reasoning, comparison, and temporal understanding. It not only includes 10 subsets from existing datasets, but also adds 4 newly curated datasets to expand the model's reasoning skills and other multi-image skills.