BriefGPT.xyz
Nov, 2023
CPopQA: 通过LLMs对文化概念的热度进行排名
CPopQA: Ranking Cultural Concept Popularity by LLMs
HTML
PDF
Ming Jiang, Mansi Joshi
TL;DR
该研究通过引入一种新的少样本问答任务(CPopQA),评估了大型语言模型(LLMs)对长尾文化概念(如假期)的统计排名能力,特别关注这些概念在美国和英国的受欢迎程度,并发现GPT-3.5在跨大洲识别地理文化接近性方面表现出卓越性能。
Abstract
Prior work has demonstrated
large language models
' (LLMs) potential to discern statistical tendencies within their pre-training corpora. Despite that, many examinations of LLMs'
knowledge capacity
focus on knowle
→