HyperAIHyperAI超神经
首页资讯论文教程数据集百科SOTALLM 模型天梯GPU 天梯顶会
全站搜索
关于
中文
HyperAIHyperAI超神经
  1. 首页
  2. SOTA
  3. 强化学习常识推理
  4. Commonsense Rl On Commonsense Rl

Commonsense Rl On Commonsense Rl

评估指标

Avg #Steps

评测结果

各个模型在此基准测试上的表现结果

模型名称
Avg #Steps
Paper TitleRepository
Human15.00 ± 3.29Text-based RL Agents with Commonsense Knowledge: New Challenges, Environments and Baselines-
Optimal15.00 ± 2.00Text-based RL Agents with Commonsense Knowledge: New Challenges, Environments and Baselines-
KG-A2C49.36 ± 7.50Text-based RL Agents with Commonsense Knowledge: New Challenges, Environments and Baselines-
TNC-A2C43.27 ± 0.70Text-based RL Agents with Commonsense Knowledge: New Challenges, Environments and Baselines-
LSTM-A2C49.21 ± 0.58Text-based RL Agents with Commonsense Knowledge: New Challenges, Environments and Baselines-
0 of 5 row(s) selected.
HyperAI

学习、理解、实践,与社区一起构建人工智能的未来

中文

关于

关于我们数据集帮助

产品

资讯教程数据集百科

链接

TVM 中文Apache TVMOpenBayes

© HyperAI超神经

津ICP备17010941号-1京公网安备11010502038810号京公网安备11010502038810号
TwitterBilibili