BriefGPT.xyz
Mar, 2024
走向零数据、可控、自适应的对话系统
Towards a Zero-Data, Controllable, Adaptive Dialog System
HTML
PDF
Dirk Väth, Lindsey Vanderlyn, Ngoc Thang Vu
TL;DR
将对话树搜索应用于可控对话系统,通过对话树来塑造强化学习代理的行为,发现对话树生成的合成数据能够在对话成功方面与使用人类数据训练的模型相媲美。
Abstract
conversational tree search
(V\"ath et al., 2023) is a recent approach to
controllable dialog systems
, where domain experts shape the behavior of a
→