Visual Question Answering On Mmhal Bench

评估指标

Hallucination Rate
Score

评测结果

各个模型在此基准测试上的表现结果

模型名称
Hallucination Rate
Score
Paper TitleRepository
RLAIF-V 7B29.23.06RLAIF-V: Open-Source AI Feedback Leads to Super GPT-4V Trustworthiness-
RLAIF-V 12B29.23.36RLAIF-V: Open-Source AI Feedback Leads to Super GPT-4V Trustworthiness-
0 of 2 row(s) selected.
Visual Question Answering On Mmhal Bench | SOTA | HyperAI超神经