Date

4 years ago

Organization

Publish URL

davar-lab.github.io

Paper URL

arxiv.org

License

Other

Tags

Video Understanding

Visual Question Answering

Image Understanding

Multimodal Representation

LSVTD stands for large-scale video text dataset, which contains 100 videos from 21 natural scenes. The dataset covers a wide range of 13 indoor (such as bookstores, shopping malls) and 9 outdoor scenes, and its diversity is more than 3 times that of the IC15 dataset.

This dataset is contributed by community users and is intended for educational and informational purposes only. If any content involves copyright infringement, please contact us at [email protected] for prompt review and removal.

Related Datasets

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

Discuss on Discord

Date

4 years ago

Organization

Publish URL

davar-lab.github.io

Paper URL

arxiv.org

License

Other

Related Datasets

DRACO Cross-Disciplinary Deep Research Benchmark Dataset

2 months ago

THINGS-EEG EEG Dataset

4 months ago

THINGS-MEG Magnetoencephalography Dataset

4 months ago

THINGS-fMRI Functional Magnetic Resonance Imaging Dataset

4 months ago

RoVid-X Robot Video Generation Dataset

2 months ago

TransPhy3D Transparent Reflection Synthesis Video Dataset

5 months ago

MCD-rPPG Multi-Camera Remote Photoplethysmography Dataset

5 months ago

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

LSVTD Video Text Understanding Dataset

Build AI with AI

HyperAI Newsletters

Command Palette

LSVTD Video Text Understanding Dataset

Related Datasets

DRACO Cross-Disciplinary Deep Research Benchmark Dataset

THINGS-EEG EEG Dataset

THINGS-MEG Magnetoencephalography Dataset

THINGS-fMRI Functional Magnetic Resonance Imaging Dataset

RoVid-X Robot Video Generation Dataset

TransPhy3D Transparent Reflection Synthesis Video Dataset

MCD-rPPG Multi-Camera Remote Photoplethysmography Dataset

Build AI with AI

HyperAI Newsletters

Command Palette

LSVTD Video Text Understanding Dataset

Related Datasets

DRACO Cross-Disciplinary Deep Research Benchmark Dataset

THINGS-EEG EEG Dataset

THINGS-MEG Magnetoencephalography Dataset

THINGS-fMRI Functional Magnetic Resonance Imaging Dataset

RoVid-X Robot Video Generation Dataset

TransPhy3D Transparent Reflection Synthesis Video Dataset

MCD-rPPG Multi-Camera Remote Photoplethysmography Dataset

Build AI with AI

HyperAI Newsletters

Related Datasets

DRACO Cross-Disciplinary Deep Research Benchmark Dataset

THINGS-EEG EEG Dataset

THINGS-MEG Magnetoencephalography Dataset

THINGS-fMRI Functional Magnetic Resonance Imaging Dataset

RoVid-X Robot Video Generation Dataset

TransPhy3D Transparent Reflection Synthesis Video Dataset

MCD-rPPG Multi-Camera Remote Photoplethysmography Dataset

Related Datasets

DRACO Cross-Disciplinary Deep Research Benchmark Dataset

THINGS-EEG EEG Dataset

THINGS-MEG Magnetoencephalography Dataset

THINGS-fMRI Functional Magnetic Resonance Imaging Dataset

RoVid-X Robot Video Generation Dataset

TransPhy3D Transparent Reflection Synthesis Video Dataset

MCD-rPPG Multi-Camera Remote Photoplethysmography Dataset