Mmlu

メトリクス

key

mmlu

mmluhumanities

mmluother

mmlusocialscience

mmlustem

model

num

org

rank

time

結果

このベンチマークにおける各種モデルのパフォーマンス結果

												論文タイトル	コード
Chat	1.000000	83.000000	87.000000	83.600000	89.800000	75.700000	GPT-4	N/A	OpenAI	1.000000	2023/3/15	-

0 of 1 row(s) selected.

Mmlu

メトリクス

key

mmlu

mmluhumanities

mmluother

mmlusocialscience

mmlustem

model

num

org

rank

time

結果

このベンチマークにおける各種モデルのパフォーマンス結果

												論文タイトル	コード
Chat	1.000000	83.000000	87.000000	83.600000	89.800000	75.700000	GPT-4	N/A	OpenAI	1.000000	2023/3/15	-

0 of 1 row(s) selected.