HyperAI초신경

Moment Retrieval On Qvhighlights

평가 지표

R@1 IoU=0.5
R@1 IoU=0.7
mAP
mAP@0.5
mAP@0.75

평가 결과

이 벤치마크에서 각 모델의 성능 결과

비교 표
모델 이름R@1 IoU=0.5R@1 IoU=0.7mAPmAP@0.5mAP@0.75
saliency-guided-detr-for-moment-retrieval-and72.2056.6054.1073.2055.80
prior-knowledge-integration-via-llm-encoding66.7349.9444.0565.7643.91
boundary-denoising-for-video-activity59.2745.07---
bam-detr-boundary-aligned-moment-detection62.7148.6445.3664.5746.33
umt-unified-multi-modal-transformers-for--36.12--
videolights-feature-refinement-and-cross-task70.3655.2547.9469.5349.17
univtg-towards-unified-video-language65.4350.0643.6364.0645.02
bridging-the-gap-a-unified-video64.5348.3143.864.7843.65
query-dependent-video-representation-for62.4044.9839.8662.5239.88
r-2-tuning-efficient-image-to-video-transfer-168.0349.3546.1769.0447.56
flashvtg-feature-layering-and-adaptive-score70.6953.9652.0072.3353.85
모델 1254.536.532.3--
query-dependent-video-representation-for63.0645.1040.1963.0440.10
bam-detr-boundary-aligned-moment-detection64.0748.1246.9165.6147.51
correlation-guided-query-dependency65.4348.3842.8664.5142.77
unloc-a-unified-framework-for-video66.146.7---
ld-detr-loop-decoder-detection-transformer 66.8051.0446.4167.61 46.99
length-aware-detr-for-robust-moment-retrieval63.9451.1047.9365.6549.44
llava-mr-large-language-and-vision-assistant76.5961.4852.7369.4154.40
overcoming-weak-visual-textual-alignment-for60.1243.0540.0863.0840.18
query-dependent-video-representation-for64.146.140.6264.340.5
saliency-guided-detr-for-moment-retrieval-and74.2060.4058.8076.2060.80
qvhighlights-detecting-moments-and-highlights59.7840.3336.1460.5135.36
correlation-guided-query-dependency68.4853.1147.9769.4049.12
query-dependent-video-representation-for63.245.240.063.440.4
umt-unified-multi-modal-transformers-for--38.08--
unloc-a-unified-framework-for-video64.548.8---
bam-detr-boundary-aligned-moment-detection63.8847.9246.6766.3348.22
univtg-towards-unified-video-language58.8640.8635.4757.6035.59
video-mamba-suite-state-space-model-as-a66.6552.1945.1864.3746.68
bridging-the-gap-a-unified-video63.5547.4743.1863.3742.67
internvideo2-scaling-video-foundation-models71.4256.4549.24--