Jun, 2024

大型语言模型的上下文学习中的分布式规则向量

TL;DRLarge Language Models demonstrate In-Context Learning through an information aggregation mechanism, where task vectors are not present, but rule vectors encode high-level abstractions of rules extracted from multiple demonstrations.