HyperAI超神経

Question Answering On Story Cloze

評価指標

Accuracy

評価結果

このベンチマークにおける各モデルのパフォーマンス結果

比較表
モデル名Accuracy
language-models-are-few-shot-learners87.7
ask-me-anything-a-simple-strategy-for76.3
palm-2-technical-report-186.7
palm-2-technical-report-185.6
palm-2-technical-report-187.4
ask-me-anything-a-simple-strategy-for87.8
ask-me-anything-a-simple-strategy-for51.0