HyperAI초신경
홈
뉴스
최신 연구 논문
튜토리얼
데이터셋
백과사전
SOTA
LLM 모델
GPU 랭킹
컨퍼런스
전체 검색
소개
한국어
HyperAI초신경
Toggle sidebar
전체 사이트 검색...
⌘
K
홈
SOTA
Action Segmentation
Action Segmentation On Coin
Action Segmentation On Coin
평가 지표
Frame accuracy
평가 결과
이 벤치마크에서 각 모델의 성능 결과
Columns
모델 이름
Frame accuracy
Paper Title
Repository
Norton
69.8
Multi-granularity Correspondence Learning from Long-term Noisy Videos
VLM
68.4
VLM: Task-agnostic Video-Language Model Pre-training for Video Understanding
CBT
53.9
End-to-End Learning of Visual Representations from Uncurated Instructional Videos
MIL-NCE
61.0
End-to-End Learning of Visual Representations from Uncurated Instructional Videos
VideoClip
68.7
VideoCLIP: Contrastive Pre-training for Zero-shot Video-Text Understanding
UnLoc-L
72.8
UnLoc: A Unified Framework for Video Localization Tasks
TACo
68.4
TACo: Token-aware Cascade Contrastive Learning for Video-Text Alignment
-
ActBERT
57.0
ActBERT: Learning Global-Local Video-Text Representations
-
Univl
70.0
UniVL: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation
0 of 9 row(s) selected.
Previous
Next