Oct, 2022

以 0.1% 的额外计算能力超越比例定律

TL;DRUL2R method improves scaling properties of language models with minimal extra compute, demonstrating emergent abilities on challenging BIG-Bench NLP tasks, and outperforming PaLM on many few-shot setups.