HyperAIHyperAI초신경
홈뉴스연구 논문튜토리얼데이터셋백과사전SOTALLM 모델GPU 랭킹컨퍼런스
전체 검색
소개
한국어
HyperAIHyperAI초신경
  1. 홈
  2. SOTA
  3. 시각적 질문 응답
  4. Visual Question Answering On Benchlmm

Visual Question Answering On Benchlmm

평가 지표

GPT-3.5 score

평가 결과

이 벤치마크에서 각 모델의 성능 결과

모델 이름
GPT-3.5 score
Paper TitleRepository
MiniGPT4-13B34.93MiniGPT-4: Enhancing Vision-Language Understanding with Advanced Large Language Models
InstructBLIP-7B44.63InstructBLIP: Towards General-purpose Vision-Language Models with Instruction Tuning
LLaVA-1.5-13B55.53Improved Baselines with Visual Instruction Tuning
Sphinx-V2-1K57.43SPHINX: The Joint Mixing of Weights, Tasks, and Visual Embeddings for Multi-modal Large Language Models
LLaVA-1.5-7B46.83Visual Instruction Tuning
InstructBLIP-13B45.03InstructBLIP: Towards General-purpose Vision-Language Models with Instruction Tuning
MiniGPTv2-7B30.1MiniGPT-v2: large language model as a unified interface for vision-language multi-task learning
GPT-4V58.37GPT-4 Technical Report
LLaVA-1-13B43.50Visual Instruction Tuning
Otter-7B39.13Otter: A Multi-Modal Model with In-Context Instruction Tuning
0 of 10 row(s) selected.
HyperAI

학습, 이해, 실천, 커뮤니티와 함께 인공지능의 미래를 구축하다

한국어

소개

회사 소개데이터셋 도움말

제품

뉴스튜토리얼데이터셋백과사전

링크

TVM 한국어Apache TVMOpenBayes

© HyperAI초신경

TwitterBilibili