HyperAI

Moment Retrieval On Charades Sta

Métriques

R@1 IoU=0.5
R@1 IoU=0.7

Résultats

Résultats de performance de divers modèles sur ce benchmark

Tableau comparatif
Nom du modèleR@1 IoU=0.5R@1 IoU=0.7
video-mamba-suite-state-space-model-as-a57.1836.05
saliency-guided-detr-for-moment-retrieval-and71.1052.80
timesuite-improving-mllms-for-long-video48.724.0
simvtp-simple-video-text-pre-training-with44.726.3
correlation-guided-query-dependency58.4436.34
unloc-a-unified-framework-for-video60.838.4
umt-unified-multi-modal-transformers-for49.3526.16
llava-mr-large-language-and-vision-assistant70.6549.58
saliency-guided-detr-for-moment-retrieval-and70.2049.50
ld-detr-loop-decoder-detection-transformer62.5841.56
bridging-the-gap-a-unified-video59.2536.64
qvhighlights-detecting-moments-and-highlights53.6331.37
umt-unified-multi-modal-transformers-for48.3129.25
internvideo2-scaling-video-foundation-models70.0348.95
videolights-feature-refinement-and-cross-task61.9641.05
internvideo2-scaling-video-foundation-models68.3645.03
overcoming-weak-visual-textual-alignment-for59.4838.33
unimd-towards-unifying-moment-retrieval-and63.9844.46
qvhighlights-detecting-moments-and-highlights55.6534.17
unloc-a-unified-framework-for-video58.135.4
bam-detr-boundary-aligned-moment-detection59.9539.38
timesuite-improving-mllms-for-long-video67.143.0
prior-knowledge-integration-via-llm-encoding58.3136.49
query-dependent-video-representation-for57.3132.55
flashvtg-feature-layering-and-adaptive-score70.3249.87