BriefGPT.xyz
Aug, 2023
TaskLAMA: 探究语言模型的复杂任务理解能力
TaskLAMA: Probing the Complex Task Understanding of Language Models
HTML
PDF
Quan Yuan, Mehran Kazemi, Xin Xu, Isaac Noble, Vaiva Imbrasaite...
TL;DR
通过使用大型语言模型,我们从高质量的人工标注数据集中提取知识,并引入了新的评估指标,发现结构化复杂任务分解能够有效地将复杂任务分解为个别步骤,相对于基准实验的最大改进幅度为280%,但在预测两两时间依赖性方面仍存在困难。
Abstract
structured complex task decomposition
(SCTD) is the problem of breaking down a complex real-world task (such as planning a wedding) into a
directed acyclic graph
over individual steps that contribute to achieving
→