BriefGPT.xyz
Jul, 2022
使用强化学习进行开放式对话的动态规划
Dynamic Planning in Open-Ended Dialogue using Reinforcement Learning
HTML
PDF
Deborah Cohen, Moonkyung Ryu, Yinlam Chow, Orgad Keller, Ido Greenberg...
TL;DR
本研究利用强化学习技术结合最先进的自然语言理解模型创造了一个实时的对话系统,并在使用谷歌智能助手的实验中,使用众包数据进行训练,显著超越了强化模型,证明其对于自然人对话有较高的开放性和可行性。
Abstract
Despite recent advances in
natural language understanding
and generation, and decades of research on the development of
conversational bots
, building automated agents that can carry on rich open-ended conversatio
→