Mmlu

评估指标

key
mmlu
mmluhumanities
mmluother
mmlusocialscience
mmlustem
model
num
org
rank
time

评测结果

各个模型在此基准测试上的表现结果

模型名称
key
mmlu
mmluhumanities
mmluother
mmlusocialscience
mmlustem
model
num
org
rank
time
Paper TitleRepository
Chat1.00000083.00000087.00000083.60000089.80000075.700000GPT-4N/AOpenAI1.0000002023/3/15--
0 of 1 row(s) selected.