HyperAIHyperAI

Command Palette

Search for a command to run...

PubMedVision Large-Scale Medical VQA Dataset

Date

a year ago

Size

53.54 GB

Organization

Publish URL

github.com

Paper URL

arxiv.org

* This dataset supports online use.Click here to jump.

PubMedVision is a large-scale and high-quality medical multimodal dataset created in 2024 by a research team from Shenzhen Big Data Research Institute, the Chinese University of Hong Kong, and the National Health Data Institute. It contains 1.3 million medical VQA samples.HuatuoGPT-Vision, Towards Injecting Medical Visual Knowledge into Multimodal LLMs at Scale".

This dataset uses sophisticated data processing methods to select medical-related images and informative image descriptions from papers in the international medical journal PubMed, effectively filtering out a large number of medical-irrelevant images and context-irrelevant content. In order to improve the alignment of image and text data, the research team used the large visual model (GPT-4V) to re-describe the images and constructed 10 scene dialogues, rewriting the image and text data into a question-and-answer format, which enhanced the learning of medical visual knowledge.

PubMedVision.torrent
Seeding 1Downloading 0Completed 216Total Downloads 679
  • PubMedVision/
    • README.md
      1.46 KB
    • README.txt
      2.93 KB
      • data/
        • PubMedVision.zip
          53.54 GB

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp