BriefGPT.xyz
Mar, 2024
语言模型从常见现象中学习罕见现象:缺失的AANNs案例
Language Models Learn Rare Phenomena from Less Rare Phenomena: The Case of the Missing AANNs
HTML
PDF
Kanishka Misra, Kyle Mahowald
TL;DR
语言模型通过从不太罕见的现象进行泛化学习,证明它们能够学习罕见的语法现象而非仅依赖死记硬背。
Abstract
language models
learn rare syntactic phenomena, but it has been argued that they rely on
rote memorization
, as opposed to
grammatical generalizat
→