BriefGPT.xyz
Jun, 2024
易于语言模型的是哪些语言?从学习概率正则语言的角度看
What Languages are Easy to Language-Model? A Perspective from Learning Probabilistic Regular Languages
HTML
PDF
Nadav Borenstein, Anej Svete, Robin Chan, Josef Valvoda, Franz Nowak...
TL;DR
大规模语言模型的学习能力主要集中在概率语言的学习上,其中正则语言模型的等级和样本字符串的预期长度是学习能力的重要预测因子。
Abstract
What can
large language models
learn? By definition, language models (LM) are distributions over strings. Therefore, an intuitive way of addressing the above question is to formalize it as a matter of
learnability
→