语言模型中的连贯性是否令人惊讶？有针对性地评估连贯性预测

May, 2021

语言模型中的连贯性是否令人惊讶？有针对性地评估连贯性预测

Is Incoherence Surprising? Targeted Evaluation of Coherence Prediction from Language Models

Anne Beyer, Sharid Loáiciga, David Schlangen

TL;DR本文通过设计一系列测试集来评估神经语言模型是否编码了逻辑关系、内在一致性和世界知识这些与上下文有关的复杂语言结构，研究发现通过这样的测试集，可以更好的评估语言模型的质量。

Abstract

coherent discourse is distinguished from a mere collection of utterances by the satisfaction of a diverse set of constraints, for example choice of expression, logical relation between denoted events, and implicit compatibility with world-knowledge. Do →