HyperAI超神経
ホーム
ニュース
最新論文
チュートリアル
データセット
百科事典
SOTA
LLMモデル
GPU ランキング
学会
検索
サイトについて
日本語
HyperAI超神経
Toggle sidebar
サイトを検索…
⌘
K
ホーム
SOTA
Natural Language Moment Retrieval
Natural Language Moment Retrieval On Tacos
Natural Language Moment Retrieval On Tacos
評価指標
R@1,IoU=0.3
R@1,IoU=0.5
R@1,IoU=0.7
mIoU
評価結果
このベンチマークにおける各モデルのパフォーマンス結果
Columns
モデル名
R@1,IoU=0.3
R@1,IoU=0.5
R@1,IoU=0.7
mIoU
Paper Title
Repository
BAM-DETR
56.69
41.54
26.77
39.31
BAM-DETR: Boundary-Aligned Moment Detection Transformer for Temporal Sentence Grounding in Videos
GVL (paragraph-level)
48.29
36.07
-
-
Learning Grounded Vision-Language Representation for Versatile Understanding in Untrimmed Videos
GVL
45.92
34.57
-
-
Learning Grounded Vision-Language Representation for Versatile Understanding in Untrimmed Videos
SG-DETR
56.71
44.70
29.90
40.90
Saliency-Guided DETR for Moment Retrieval and Highlight Detection
LD-DETR
57.61
44.31
26.24
40.30
LD-DETR: Loop Decoder DEtection TRansformer for Video Moment Retrieval and Highlight Detection
UVCOM
-
36.39
23.32
-
Bridging the Gap: A Unified Video Comprehension Framework for Moment Retrieval and Highlight Detection
FlashVTG
53.71
41.76
24.74
37.61
FlashVTG: Feature Layering and Adaptive Score Handling Network for Video Temporal Grounding
UniVTG
51.44
34.97
21.07
35.76
UniVTG: Towards Unified Video-Language Temporal Grounding
SG-DETR (w/ PT)
58.10
46.40
33.90
42.40
Saliency-Guided DETR for Moment Retrieval and Highlight Detection
VLG-Net
45.46
34.19
-
-
VLG-Net: Video-Language Graph Matching Network for Video Grounding
LLMEPET
52.73
40.12
22.78
36.55
Prior Knowledge Integration via LLM Encoding and Pseudo Event Regulation for Video Moment Retrieval
CG-DETR
52.23
39.61
22.23
36.48
Correlation-Guided Query-Dependency Calibration for Video Temporal Grounding
0 of 12 row(s) selected.
Previous
Next