BriefGPT.xyz
Feb, 2024
大规模多元文化知识获取与语言模型基准测试
Massively Multi-Cultural Knowledge Acquisition & LM Benchmarking
HTML
PDF
Yi Fung, Ruining Zhao, Jae Doo, Chenkai Sun, Heng Ji
TL;DR
通过从维基百科文献到链接页面的导航,建立文化知识的多元多样化采集方法与CultureAtlas数据集,该数据集涵盖了各种亚国家地理区域和族群,用于评估语言模型在文化多元背景下的表现和开发具有文化敏感和意识的语言模型,以促进数字领域中全球文化的更具包容性和平衡的表达。
Abstract
pretrained large language models
have revolutionized many applications but still face challenges related to
cultural bias
and a lack of
cultural
→