HyperAI초신경

홈 뉴스 연구 논문 튜토리얼 데이터셋 백과사전 SOTA LLM 모델 GPU 랭킹 컨퍼런스

한국어

HyperAI초신경

Sound Prompted Semantic Segmentation On

평가 지표

mAP

mIoU

평가 결과

이 벤치마크에서 각 모델의 성능 결과

모델 이름	mAP	mIoU	Paper Title	Repository
DenseAV	32.7	24.7	Separating the "Chirp" from the "Chat": Self-supervised Visual Grounding of Sound and Language
DAVENet	16.8	18.1	Jointly Discovering Visual Objects and Spoken Words from Raw Sensory Input	-
ImageBIND	19.7	20.5	ImageBind: One Embedding Space To Bind Them All
CAVMAE	26.0	17.0	Contrastive Audio-Visual Masked Autoencoder

0 of 4 row(s) selected.