HyperAI초신경

홈 뉴스 연구 논문 튜토리얼 데이터셋 백과사전 SOTA LLM 모델 GPU 랭킹 컨퍼런스

한국어

HyperAI초신경

Open Vocabulary Attribute Detection On Ovad 1

평가 지표

mean average precision

평가 결과

이 벤치마크에서 각 모델의 성능 결과

모델 이름	mean average precision	Paper Title	Repository
X-VLM	28.0	Multi-Grained Vision Language Pre-Training: Aligning Texts with Visual Concepts
CLIP VIT-B16	16.6	Learning Transferable Visual Models From Natural Language Supervision
BLIP	24.3	BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Open CLIP ViT-B32	17.0	Reproducible scaling laws for contrastive language-image learning
OVAD-Baseline-Box	21.4	Open-vocabulary Attribute Detection
BLIP 2 (pretrained)	25.5	BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
ALBEF	21.0	Align before Fuse: Vision and Language Representation Learning with Momentum Distillation

0 of 7 row(s) selected.