BriefGPT.xyz
Sep, 2024
AraDiCE:大型语言模型的方言和文化能力基准
AraDiCE: Benchmarks for Dialectal and Cultural Capabilities in LLMs
HTML
PDF
Basel Mousi, Nadir Durrani, Fatema Ahmad, Md. Arid Hasan, Maram Hasanain...
TL;DR
本研究旨在解决阿拉伯语在大型语言模型中方言表现不足的问题,提出了七个合成数据集,并创建了AraDiCE基准,以评估阿拉伯方言和文化意识。研究发现,虽然特定阿拉伯模型在方言任务上表现优于多语言模型,但在方言识别和生成方面仍面临重大挑战,从而彰显了定制训练的重要性。
Abstract
Arabic
, with its rich diversity of dialects, remains significantly underrepresented in Large
Language Models
, particularly in dialectal variations. We address this gap by introducing seven synthetic datasets in d
→