HyperAIHyperAI超神经
首页资讯最新论文教程数据集百科SOTALLM 模型天梯GPU 天梯顶会
全站搜索
关于
中文
HyperAIHyperAI超神经
  1. 首页
  2. SOTA
  3. 视觉问答
  4. Visual Question Answering On Benchlmm

Visual Question Answering On Benchlmm

评估指标

GPT-3.5 score

评测结果

各个模型在此基准测试上的表现结果

模型名称
GPT-3.5 score
Paper TitleRepository
MiniGPT4-13B34.93MiniGPT-4: Enhancing Vision-Language Understanding with Advanced Large Language Models-
InstructBLIP-7B44.63InstructBLIP: Towards General-purpose Vision-Language Models with Instruction Tuning-
LLaVA-1.5-13B55.53Improved Baselines with Visual Instruction Tuning-
Sphinx-V2-1K57.43SPHINX: The Joint Mixing of Weights, Tasks, and Visual Embeddings for Multi-modal Large Language Models-
LLaVA-1.5-7B46.83Visual Instruction Tuning-
InstructBLIP-13B45.03InstructBLIP: Towards General-purpose Vision-Language Models with Instruction Tuning-
MiniGPTv2-7B30.1MiniGPT-v2: large language model as a unified interface for vision-language multi-task learning-
GPT-4V58.37GPT-4 Technical Report-
LLaVA-1-13B43.50Visual Instruction Tuning-
Otter-7B39.13Otter: A Multi-Modal Model with In-Context Instruction Tuning-
0 of 10 row(s) selected.
HyperAI

学习、理解、实践,与社区一起构建人工智能的未来

中文

关于

关于我们数据集帮助

产品

资讯教程数据集百科

链接

TVM 中文Apache TVMOpenBayes

© HyperAI超神经

津ICP备17010941号-1京公网安备11010502038810号京公网安备11010502038810号
TwitterBilibili