BriefGPT.xyz
Nov, 2024
学习动态揭示大型语言模型推理中的泛化机制
What Do Learning Dynamics Reveal About Generalization in LLM Reasoning?
HTML
PDF
Katie Kang, Amrith Setlur, Dibya Ghosh, Jacob Steinhardt, Claire Tomlin...
TL;DR
本研究探讨了大型语言模型(LLM)微调过程中学习动态对后续泛化的影响,特别是在推理任务中。通过引入“预记忆训练准确度”这一训练指标,本文表明该指标能有效预测测试准确度并指导数据选择,从而在数据效率上实现显著提升。
Abstract
Despite the remarkable capabilities of modern
Large Language Models
(LLMs), the mechanisms behind their problem-solving abilities remain elusive. In this work, we aim to better understand how the
Learning Dynamics
→