HyperAI超神经

Arithmetic Reasoning On Game Of 24

评估指标

Success

评测结果

各个模型在此基准测试上的表现结果

模型名称
Success
Paper TitleRepository
Tree of Thoughts (b=5)0.74Tree of Thoughts: Deliberate Problem Solving with Large Language Models
0 of 1 row(s) selected.