BriefGPT.xyz
Oct, 2022
挑战BIG-Bench任务及连贯思维是否能解决它们
Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them
HTML
PDF
Mirac Suzgun, Nathan Scales, Nathanael Schärli, Sebastian Gehrmann, Yi Tay...
TL;DR
评估语言模型的任务套件BIG-Bench在多步推理方面的表现,通过应用“chain-of-thought”提示,可以提高模型性能,证明多数任务要求此类提示以获得更好的性能,并且此提示与模型规模具有交互作用。
Abstract
big-bench
(Srivastava et al., 2022) is a diverse evaluation suite that focuses on tasks believed to be beyond the capabilities of current
language models
.
→