HyperAIHyperAI超神经
首页资讯最新论文教程数据集百科SOTALLM 模型天梯GPU 天梯顶会
全站搜索
关于
中文
HyperAIHyperAI超神经
  1. 首页
  2. SOTA
  3. 问答
  4. Question Answering On Strategyqa

Question Answering On Strategyqa

评估指标

Accuracy

评测结果

各个模型在此基准测试上的表现结果

模型名称
Accuracy
Paper TitleRepository
Rethinking with retrieval (GPT-3)77.73Rethinking with Retrieval: Faithful Large Language Model Inference-
SearchChain-Chain-of-Action: Faithful and Multimodal Question Answering through Large Language Models-
PaLM 540B76.4Transcending Scaling Laws with 0.1% Extra Compute-
Least-to-Most-Least-to-Most Prompting Enables Complex Reasoning in Large Language Models-
Self-Evaluation Guided Decoding (Codex, CoT, single reasoning chain, 6-shot gen, 4-shot eval)77.2--
SearchChain-Search-in-the-Chain: Interactively Enhancing Large Language Models with Search for Knowledge-intensive Tasks-
CoA w/o actions-Chain-of-Action: Faithful and Multimodal Question Answering through Large Language Models-
CoA-Chain-of-Action: Faithful and Multimodal Question Answering through Large Language Models-
Least-to-Most-Chain-of-Action: Faithful and Multimodal Question Answering through Large Language Models-
U-PaLM 540B76.6Transcending Scaling Laws with 0.1% Extra Compute-
Minerva 540B61.9Transcending Scaling Laws with 0.1% Extra Compute-
PaLM 2 (few-shot, CoT, SC)90.4PaLM 2 Technical Report-
0 of 12 row(s) selected.
HyperAI

学习、理解、实践,与社区一起构建人工智能的未来

中文

关于

关于我们数据集帮助

产品

资讯教程数据集百科

链接

TVM 中文Apache TVMOpenBayes

© HyperAI超神经

津ICP备17010941号-1京公网安备11010502038810号京公网安备11010502038810号
TwitterBilibili