Visual Question Answering Vqa On Activitynet 1

평가 지표

ClipMatch@1
ClipMatch@5
Contains
ExactMatch
Follow-up ClipMatch@1
Follow-up ClipMatch@5
Follow-up Contains
Follow-up ExactMatch

평가 결과

이 벤치마크에서 각 모델의 성능 결과

모델 이름
ClipMatch@1
ClipMatch@5
Contains
ExactMatch
Follow-up ClipMatch@1
Follow-up ClipMatch@5
Follow-up Contains
Follow-up ExactMatch
Paper TitleRepository
BLIP-2 T553.3974.7115.707.0762.0275.1318.098.84Open-ended VQA benchmarking of Vision-Language models by exploiting Classification datasets and their semantic hierarchy
0 of 1 row(s) selected.
Visual Question Answering Vqa On Activitynet 1 | SOTA | HyperAI초신경