Physical Commonsense Reasoning On Physical
평가 지표
Without Audio (Acc %)
평가 결과
이 벤치마크에서 각 모델의 성능 결과
모델 이름 | Without Audio (Acc %) | Paper Title | Repository |
---|---|---|---|
UNITER (Large) | 60.6 ± 2.2 | PACS: A Dataset for Physical Audiovisual CommonSense Reasoning | |
Human | 90.5 ± 3.1 | PACS: A Dataset for Physical Audiovisual CommonSense Reasoning | |
Merlot Reserve (Large) | 68.4 ± 0.7 | PACS: A Dataset for Physical Audiovisual CommonSense Reasoning | |
Majority | 50.4 | PACS: A Dataset for Physical Audiovisual CommonSense Reasoning | |
CLIP/AudioCLIP | 56.3 ± 0.7 | PACS: A Dataset for Physical Audiovisual CommonSense Reasoning | |
Late Fusion | 52.5 ± 1.6 | PACS: A Dataset for Physical Audiovisual CommonSense Reasoning |
0 of 6 row(s) selected.