跨尺度泛化误差的建设性预测

Sep, 2019

A Constructive Prediction of the Generalization Error Across Scales

Jonathan S. Rosenfeld, Amir Rosenfeld, Yonatan Belinkov, Nir Shavit

TL;DR本论文提出基于模型缩放的方法来构建适合各类模型和数据规模的函数形式，针对神经网络的泛化误差进行观测并给出了精确预测。

Abstract

The dependency of the generalization error of neural networks on model and dataset size is of critical importance both in practice and for understanding the theory of