HyperAI超神经

Theoremqa

评估指标

all
bool
csu0026ee
finance
float
integer
list
llm_model
math
model_url
option
organization
parameters
physics
release_date
updated_time

评测结果

各个模型在此基准测试上的表现结果

模型名称
all
bool
csu0026ee
finance
float
integer
list
llm_model
math
model_url
option
organization
parameters
physics
release_date
updated_time
Paper TitleRepository
API16.646.634.212.311.711.66.8GPT-315.8https://openai.com/index/gpt-3-apps/27.8OpenAIN/A2.32022.3.12023.12.6--
0 of 1 row(s) selected.