Visual Question Answering On Mmhal Bench

評価指標

Hallucination Rate
Score

評価結果

このベンチマークにおける各モデルのパフォーマンス結果

モデル名
Hallucination Rate
Score
Paper TitleRepository
RLAIF-V 7B29.23.06RLAIF-V: Open-Source AI Feedback Leads to Super GPT-4V Trustworthiness-
RLAIF-V 12B29.23.36RLAIF-V: Open-Source AI Feedback Leads to Super GPT-4V Trustworthiness-
0 of 2 row(s) selected.
Visual Question Answering On Mmhal Bench | SOTA | HyperAI超神経