HyperAI

Video To Image Affordance Grounding On Opra

Metriken

KLD
Top-1 Action Accuracy

Ergebnisse

Leistungsergebnisse verschiedener Modelle zu diesem Benchmark

Modellname
KLD
Top-1 Action Accuracy
Paper TitleRepository
Afformer (ViTDet-B encoder)1.5152.27Affordance Grounding from Demonstration Video to Target Image
Afformer (ResNet-50-FPN encoder)1.5552.14Affordance Grounding from Demonstration Video to Target Image
Demo2Vec2.3440.79Demo2Vec: Reasoning Object Affordances From Online Videos-
0 of 3 row(s) selected.