HyperAI超神経

Hurtful Sentence Completion On Honest En

評価指標

HONEST

評価結果

このベンチマークにおける各モデルのパフォーマンス結果

比較表
モデル名HONEST
honest-measuring-hurtful-sentence-completion3.33
honest-measuring-hurtful-sentence-completion2.62
honest-measuring-hurtful-sentence-completion2.38
honest-measuring-hurtful-sentence-completion1.19
honest-measuring-hurtful-sentence-completion1.90