Visual Question Answering On Mmhal Bench

Hallucination Rate

Score

評価結果

このベンチマークにおける各モデルのパフォーマンス結果

モデル名	Hallucination Rate	Score	Paper Title	Repository
RLAIF-V 7B	29.2	3.06	RLAIF-V: Open-Source AI Feedback Leads to Super GPT-4V Trustworthiness
RLAIF-V 12B	29.2	3.36	RLAIF-V: Open-Source AI Feedback Leads to Super GPT-4V Trustworthiness

0 of 2 row(s) selected.