Visual Question Answering Vqa On Activitynet 1

ClipMatch@1

ClipMatch@5

Contains

ExactMatch

Follow-up ClipMatch@1

Follow-up ClipMatch@5

Follow-up Contains

Follow-up ExactMatch

평가 결과

이 벤치마크에서 각 모델의 성능 결과

									Paper Title
BLIP-2 T5	53.39	74.71	15.70	7.07	62.02	75.13	18.09	8.84	Open-ended VQA benchmarking of Vision-Language models by exploiting Classification datasets and their semantic hierarchy

0 of 1 row(s) selected.