Date

7 months ago

Paper URL

License

Apache 2.0

Dataset structure:

The dataset contains 2 subsets, with a total of 11,184 samples:

sft_data: supervised fine-tuning for subtitle models (9,419 samples for supervised fine-tuning data)
mcts_vcb: Evaluated using MCTS-generated captions and keypoints (1,765 samples for evaluating the MCTS-VCB benchmark)

This dataset is contributed by community users and is intended for educational and informational purposes only. If any content involves copyright infringement, please contact us at [email protected] for prompt review and removal.

Related Datasets

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

Discuss on Discord

Date

7 months ago

Paper URL

arxiv.org

License

Apache 2.0

Dataset structure:

The dataset contains 2 subsets, with a total of 11,184 samples:

sft_data: supervised fine-tuning for subtitle models (9,419 samples for supervised fine-tuning data)
mcts_vcb: Evaluated using MCTS-generated captions and keypoints (1,765 samples for evaluating the MCTS-VCB benchmark)

Related Datasets

FrontierScience Inference Research Task Evaluation Dataset

2 months ago

VideoRewardBench Video Reward Model Evaluation Dataset

2 months ago

OST-Bench Spatiotemporal Scene Understanding Benchmark Dataset

3 months ago

25.58 GB60

VAP-Data Visual Action Performance Dataset

2 months ago

X-Dance Image-Driven Dance Motion Dataset

2 months ago

147.3 MB75

INFINITY-CHAT Real Open Question Answering Dataset

2 months ago

Arena-Write Writing Generation Evaluation Dataset

2 months ago

AutoDock-GPU_Output Docking Result Dataset

3 months ago

PhysDriver Physiological Test Dataset

2 months ago

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

AutoCaption Video Caption Benchmark Dataset

Dataset structure:

Build AI with AI

HyperAI Newsletters

Command Palette

AutoCaption Video Caption Benchmark Dataset

Dataset structure:

Related Datasets

FrontierScience Inference Research Task Evaluation Dataset

VideoRewardBench Video Reward Model Evaluation Dataset

OST-Bench Spatiotemporal Scene Understanding Benchmark Dataset

VAP-Data Visual Action Performance Dataset

X-Dance Image-Driven Dance Motion Dataset

INFINITY-CHAT Real Open Question Answering Dataset

Arena-Write Writing Generation Evaluation Dataset

AutoDock-GPU_Output Docking Result Dataset

PhysDriver Physiological Test Dataset

Build AI with AI

HyperAI Newsletters

Command Palette

AutoCaption Video Caption Benchmark Dataset

Dataset structure:

Related Datasets

FrontierScience Inference Research Task Evaluation Dataset

VideoRewardBench Video Reward Model Evaluation Dataset

OST-Bench Spatiotemporal Scene Understanding Benchmark Dataset

VAP-Data Visual Action Performance Dataset

X-Dance Image-Driven Dance Motion Dataset

INFINITY-CHAT Real Open Question Answering Dataset

Arena-Write Writing Generation Evaluation Dataset

AutoDock-GPU_Output Docking Result Dataset

PhysDriver Physiological Test Dataset

Build AI with AI

HyperAI Newsletters

Related Datasets

FrontierScience Inference Research Task Evaluation Dataset

VideoRewardBench Video Reward Model Evaluation Dataset

OST-Bench Spatiotemporal Scene Understanding Benchmark Dataset

VAP-Data Visual Action Performance Dataset

X-Dance Image-Driven Dance Motion Dataset

INFINITY-CHAT Real Open Question Answering Dataset

Arena-Write Writing Generation Evaluation Dataset

AutoDock-GPU_Output Docking Result Dataset

PhysDriver Physiological Test Dataset

Related Datasets

FrontierScience Inference Research Task Evaluation Dataset

VideoRewardBench Video Reward Model Evaluation Dataset

OST-Bench Spatiotemporal Scene Understanding Benchmark Dataset

VAP-Data Visual Action Performance Dataset

X-Dance Image-Driven Dance Motion Dataset

INFINITY-CHAT Real Open Question Answering Dataset

Arena-Write Writing Generation Evaluation Dataset

AutoDock-GPU_Output Docking Result Dataset

PhysDriver Physiological Test Dataset