TL;DR本文提出了一种基于 SWAN(Schematised Weighted Average Nugget)算法的文本对话系统审计通用框架并给出了一些相关参数,从而预防对话系统对用户和社会产生负面影响。
Abstract
We present a simple and generic framework for auditing a given textual conversational system, given some samples of its conversation sessions as its input. The framework computes a SWAN (Schematised Weighted Average Nugget) score based on →