基于熵的数据过滤来提升神经对话模型

May, 2019

基于熵的数据过滤来提升神经对话模型

Improving Neural Conversational Models with Entropy-Based Data Filtering

Richard Csaky, Patrik Purgai, Gabor Recski

TL;DR采用基于熵的方法，从对话数据集中过滤通用语句，以改善聊天机器人生成开放式回复时的多样性。通过17种评估指标的比较，我们证明使用经过此种过滤的数据集训练对话模型可以提高对话质量。

Abstract

Current neural-network based conversational models lack diversity and generate boring responses to open-ended utterances. priors such as persona, emotion, or topic provide additional information to dialog models to aid response generation, but annotating a dataset with →