BriefGPT.xyz
Mar, 2020
使用Likert量表数据进行词汇复杂度预测的新语料库CompLex
CompLex --- A New Corpus for Lexical Complexity Predicition from Likert Scale Data
HTML
PDF
Matthew Shardlow, Michael Cooper, Marcos Zampieri
TL;DR
本文介绍了第一个英语数据集,以连续的词汇复杂度预测为目标,通过使用一种5点Likert量表方案,注释文本中来自三个领域的复杂单词并得出: 9,476个句子的语料库。
Abstract
Predicting which words are considered hard to understand for a given target population is a vital step in many
nlp
applications such as
text simplification
. This task is commonly referred to as
→