BriefGPT.xyz
Aug, 2023
僧伽罗语-英语平行词典数据集
Sinhala-English Parallel Word Dictionary Dataset
HTML
PDF
Kasun Wickramasinghe, Nisansa de Silva
TL;DR
为了解决低资源语言缺乏人工标注的问题,本研究提出了三个用于英语和僧伽罗语自然语言处理任务的平行英-僧伽罗词典数据集,并介绍了数据集创建流程和验证数据集质量的实验结果。
Abstract
parallel datasets
are vital for performing and evaluating any kind of multilingual task. However, in the cases where one of the considered language pairs is a
low-resource language
, the existing top-down parallel
→