HyperAI

Referring Expression Generation On Coloninst 1

Metrics

Accuray

Results

Performance results of various models on this benchmark

Model Name
Accuray
Paper TitleRepository
LLaVA-Med-v1.5 (w/ LoRA, w/ extra data)70.00LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day
LLaVA-Med-v1.5 (w/ LoRA, w/o extra data)73.05LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day
LLaVA-v1 (w/ LoRA, w/ extra data)46.85Visual Instruction Tuning
MiniGPT-v2 (w/ LoRA, w/ extra data)70.23MiniGPT-v2: large language model as a unified interface for vision-language multi-task learning
MGM-2B (w/o LoRA, w/ extra data)74.30Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models
MobileVLM-1.7B (w/o LoRA, w/ extra data)73.14MobileVLM : A Fast, Strong and Open Vision Language Assistant for Mobile Devices
MiniGPT-v2 (w/ LoRA, w/o extra data)72.05MiniGPT-v2: large language model as a unified interface for vision-language multi-task learning
MGM-2B (w/o LoRA, w/o extra data)69.81Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models
LLaVA-Med-v1.0 (w/o LoRA, w/o extra data)75.07LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day
LLaVA-v1 (w/ LoRA, w/o extra data)68.11Visual Instruction Tuning
ColonGPT (w/ LoRA, w/o extra data)80.18Frontiers in Intelligent Colonoscopy
LLaVA-Med-v1.0 (w/o LoRA, w/ extra data)75.25LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day
Bunny-v1.0-3B (w/ LoRA, w/ extra data)75.08Efficient Multimodal Learning from Data-centric Perspective
LLaVA-v1.5 (w/ LoRA, w/o extra data)70.38Improved Baselines with Visual Instruction Tuning
LLaVA-v1.5 (w/ LoRA, w/ extra data)72.88Improved Baselines with Visual Instruction Tuning
Bunny-v1.0-3B (w/ LoRA, w/o extra data)69.45Efficient Multimodal Learning from Data-centric Perspective
MobileVLM-1.7B (w/ LoRA, w/ extra data)78.03MobileVLM : A Fast, Strong and Open Vision Language Assistant for Mobile Devices
0 of 17 row(s) selected.