BriefGPT.xyz
Sep, 2021
LM-Critic: 无监督语法错误修正的语言模型
LM-Critic: Language Models for Unsupervised Grammatical Error Correction
HTML
PDF
Michihiro Yasunaga, Jure Leskovec, Percy Liang
TL;DR
本文介绍了如何使用预训练语言模型来识别语法是否正确并使用 Break-It-Fix-It 框架进行训练。通过在多个领域的数据集上进行实验,我们发现这种方法在无监督学习和有监督学习下都优于现有方法。
Abstract
Training a model for
grammatical error correction
(GEC) requires a set of labeled ungrammatical / grammatical sentence pairs, but manually annotating such pairs can be expensive. Recently, the
break-it-fix-it
(BI
→