HyperAI

ShareGPT4V Large-scale High-quality Image and Text Dataset

Date

a year ago

Size

466.32 MB

Organization

University of Science and Technology of China
Shanghai Artificial Intelligence Laboratory

Publish URL

github.com

License

CC BY-SA 4.0

特色图像

The ShareGPT4V dataset is a high-quality dataset consisting of a large number of image-text pairs, which is used to train visual-language models (VLMs) to improve the model's capabilities in image understanding and text generation. The dataset contains 1.2 million image-text pairs that effectively align visual and language features, enhance the model's ability to follow instructions, and incorporate more academic tasks such as ScienceQA, TextVQA, SBU, etc. By introducing this dataset, the model has been significantly improved in image-text alignment capabilities, which is a key aspect for multimodal representation learning.

This dataset was released by the University of Science and Technology of China, Shanghai Artificial Intelligence Laboratory in 2023.

ShareGPT4V.torrent
Seeding 1Downloading 1Completed 79Total Downloads 115
  • ShareGPT4V/
    • README.md
      1.51 KB
    • README.txt
      3.03 KB
      • data/
        • ShareGPT4V.zip
          466.32 MB