BriefGPT.xyz
May, 2024
数据重复有利于 SGD 学习高维多索引函数
Repetita Iuvant: Data Repetition Allows SGD to Learn High-Dimensional Multi-Index Functions
HTML
PDF
Luca Arnaboldi, Yatin Dandi, Florent Krzakala, Luca Pesce, Ludovic Stephan
TL;DR
神经网络通过高维嘈杂的数据识别低维相关结构,我们对其工作原理的数学理解仍然有限。本文研究了使用基于梯度的算法训练的两层浅层神经网络的训练动态,并讨论了它们在具有低维相关方向的多指标模型中学习相关特征的方式。
Abstract
neural networks
can identify low-dimensional
relevant structures
within high-dimensional noisy data, yet our mathematical understanding of how they do so remains scarce. Here, we investigate the
→