HyperAI超神经
首页
资讯
最新论文
教程
数据集
百科
SOTA
LLM 模型天梯
GPU 天梯
顶会
开源项目
全站搜索
关于
中文
HyperAI超神经
Toggle sidebar
全站搜索…
⌘
K
首页
SOTA
Action Segmentation
Action Segmentation On Coin
Action Segmentation On Coin
评估指标
Frame accuracy
评测结果
各个模型在此基准测试上的表现结果
Columns
模型名称
Frame accuracy
Paper Title
Repository
Norton
69.8
Multi-granularity Correspondence Learning from Long-term Noisy Videos
VLM
68.4
VLM: Task-agnostic Video-Language Model Pre-training for Video Understanding
CBT
53.9
End-to-End Learning of Visual Representations from Uncurated Instructional Videos
MIL-NCE
61.0
End-to-End Learning of Visual Representations from Uncurated Instructional Videos
VideoClip
68.7
VideoCLIP: Contrastive Pre-training for Zero-shot Video-Text Understanding
UnLoc-L
72.8
UnLoc: A Unified Framework for Video Localization Tasks
TACo
68.4
TACo: Token-aware Cascade Contrastive Learning for Video-Text Alignment
-
ActBERT
57.0
ActBERT: Learning Global-Local Video-Text Representations
-
Univl
70.0
UniVL: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation
0 of 9 row(s) selected.
Previous
Next