HyperAI초신경

Action Segmentation On Coin

평가 지표

Frame accuracy

평가 결과

이 벤치마크에서 각 모델의 성능 결과

		Paper Title
UnLoc-L	72.8	UnLoc: A Unified Framework for Video Localization Tasks
Univl	70.0	UniVL: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation
Norton	69.8	Multi-granularity Correspondence Learning from Long-term Noisy Videos
VideoClip	68.7	VideoCLIP: Contrastive Pre-training for Zero-shot Video-Text Understanding
VLM	68.4	VLM: Task-agnostic Video-Language Model Pre-training for Video Understanding
TACo	68.4	TACo: Token-aware Cascade Contrastive Learning for Video-Text Alignment
MIL-NCE	61.0	End-to-End Learning of Visual Representations from Uncurated Instructional Videos
ActBERT	57.0	ActBERT: Learning Global-Local Video-Text Representations
CBT	53.9	End-to-End Learning of Visual Representations from Uncurated Instructional Videos

0 of 9 row(s) selected.

Action Segmentation On Coin | SOTA | HyperAI초신경