HyperAI
HyperAI超神经
首页
算力平台
文档
资讯
论文
教程
数据集
百科
SOTA
LLM 模型天梯
GPU 天梯
顶会
开源项目
全站搜索
关于
服务条款
隐私政策
中文
HyperAI
HyperAI超神经
Toggle Sidebar
全站搜索…
⌘
K
Command Palette
Search for a command to run...
算力平台
首页
SOTA
强化学习常识推理
Commonsense Rl On Commonsense Rl
Commonsense Rl On Commonsense Rl
评估指标
Avg #Steps
评测结果
各个模型在此基准测试上的表现结果
Columns
模型名称
Avg #Steps
Paper Title
KG-A2C
49.36 ± 7.50
Text-based RL Agents with Commonsense Knowledge: New Challenges, Environments and Baselines
LSTM-A2C
49.21 ± 0.58
Text-based RL Agents with Commonsense Knowledge: New Challenges, Environments and Baselines
TNC-A2C
43.27 ± 0.70
Text-based RL Agents with Commonsense Knowledge: New Challenges, Environments and Baselines
Human
15.00 ± 3.29
Text-based RL Agents with Commonsense Knowledge: New Challenges, Environments and Baselines
Optimal
15.00 ± 2.00
Text-based RL Agents with Commonsense Knowledge: New Challenges, Environments and Baselines
0 of 5 row(s) selected.
Previous
Next