Neural generative models have achieved promising performance on dialog generation tasks if given a huge data set. However, the lack of high-quality dialog data and the expensive data annotation process greatly limit their application in real-world settings. We propose a paraphrase augmented response generation (PARG) framework that jointly trains a paraphrase model and a response generation model to improve the dialog generation performance. We also design a method to automatically construct paraphrase training data set based on dialog state and dialog act labels. PARG is applicable to various dialog generation models, such as TSCP (Lei et al., 2018) and DAMD (Zhang et al., 2019). Experimental results show that the proposed framework improves these state-of-the-art dialog models further on CamRest676 and MultiWOZ. PARG also significantly outperforms other data augmentation methods in dialog generation tasks, especially under low resource settings.

该研究提出了一种基于替换词增强的响应生成(PARG)框架，该框架联合训练了一个替换模型和一个响应生成模型，以提高对话生成的性能，并通过对话状态和对话行为标签自动构建替换培训数据集。实验结果表明，所提出的框架进一步改善了CamRest676和MultiWOZ上最先进的对话模型，并在对话生成任务中显着优于其他数据增强方法，特别是在资源不足的情况下。

基於改寫的任務導向對話生成