Mukayese：土耳其自然语言处理反击

Mar, 2022

Mukayese：土耳其自然语言处理反击

Mukayese: Turkish NLP Strikes Back

Ali Safaya, Emirhan Kurtuluş, Arda Göktoğan, Deniz Yuret

TL;DR本文主要介绍了一个名为Mukayese的NLP基准集，它为土耳其语提供了语言建模、句子段落化和拼写检查等多项基准测试，并且为每个基准测试提供多个数据集和基准值。

Abstract

Having sufficient resources for language X lifts it from the under-resourced languages class, but not necessarily from the under-researched class. In this paper, we address the problem of the absence of organized benchmarks in the turkish language. We demonstrate that languages such as