英语语言最小对比基准: BLiMP

Dec, 2019

BLiMP: A Benchmark of Linguistic Minimal Pairs for English

Alex Warstadt, Alicia Parrish, Haokun Liu, Anhad Mohananey, Wei Peng...

TL;DRBLiMP是一套挑战集，用于评估语言模型对英语中主要语法现象的理解水平。研究表明，现有模型能够可靠地识别形态对比，但在限定词和否定极性项分布以及提取岛等微妙的语法现象上仍面临挑战。

Abstract

We introduce The Benchmark of Linguistic Minimal Pairs (shortened to blimp), a challenge set for evaluating what language models (LMs) know about major →