HyperAI

Main

GPU

Console
Studio
Docs
Pricing

Pulse

News

Resources

Papers
Notebooks
Datasets
Wiki

Benchmarks

SOTA
LLM Models
GPU Leaderboard

Community

Events

Utility

About Terms of Service Privacy Policy
English

Command Palette

Search for a command to run...

HyperAI
SOTA
Zero-Shot Video Question Answer

Zero-Shot Video Question Answer

The Zero-Shot Video Question Answering task aims to evaluate the ability of large language models to answer questions on specific video data they have never seen before. This task falls under the category of inference, where the model analyzes the content of the video and generates accurate answers, thereby enhancing its application value in multimodal understanding and interaction.

BT-Adapter (zero-shot)

EgoSchema (fullset)

BIMBA-LLaVA-Qwen2-7B

EgoSchema (subset)

FrozenBiLM (with speech)

Video-MME (w/o subs)

Video-RAG (based on LLaVA-Video)

Zero-shot Video Question Answering on LongVideoBench

CinePile: A Long Video Question Answering Dataset and Benchmark

Build the Future of Artificial Intelligence

About

About Us Support Dataset Help

Products

News Papers Notebooks Datasets Wiki

Links

© HyperAI

GitHub Discord X (formerly Twitter)

HyperAI

Main

GPU

Console
Studio
Docs
Pricing

Pulse

News

Resources

Papers
Notebooks
Datasets
Wiki

Benchmarks

SOTA
LLM Models
GPU Leaderboard

Community

Events

Utility

About Terms of Service Privacy Policy
English

Command Palette

Search for a command to run...

HyperAI
SOTA
Zero-Shot Video Question Answer

Zero-Shot Video Question Answer

The Zero-Shot Video Question Answering task aims to evaluate the ability of large language models to answer questions on specific video data they have never seen before. This task falls under the category of inference, where the model analyzes the content of the video and generates accurate answers, thereby enhancing its application value in multimodal understanding and interaction.

BT-Adapter (zero-shot)

EgoSchema (fullset)

BIMBA-LLaVA-Qwen2-7B

EgoSchema (subset)

FrozenBiLM (with speech)

Video-MME (w/o subs)

Video-RAG (based on LLaVA-Video)

Zero-shot Video Question Answering on LongVideoBench

CinePile: A Long Video Question Answering Dataset and Benchmark

Build the Future of Artificial Intelligence

About

About Us Support Dataset Help

Products

News Papers Notebooks Datasets Wiki

Links

© HyperAI

GitHub Discord X (formerly Twitter)