对大型语言模型在受控生成任务中的评估

Oct, 2023

对大型语言模型在受控生成任务中的评估

Evaluating Large Language Models on Controlled Generation Tasks

Jiao Sun, Yufei Tian, Wangchunshu Zhou, Nan Xu, Qian Hu...

TL;DR大型语言模型在生成任务中的可控性和精细硬性约束方面存在挑战。

Abstract

While recent studies have looked into the abilities of large language models in various benchmark tasks, including question generation, reading comprehension, multilingual and etc, there have been few studies looking into the →