BriefGPT.xyz
Nov, 2020
理解双重下降需要进行精细的偏差-方差分解
Understanding Double Descent Requires a Fine-Grained Bias-Variance Decomposition
HTML
PDF
Ben Adlam, Jeffrey Pennington
TL;DR
通过对方差进行可解释的对称分解,探讨了深度学习算法的偏差与方差之间的关系,发现随着网络宽度的增加,偏差单调下降,但方差存在非单调行为,并可以通过集成学习消除互作用导致的方差发散。
Abstract
Classical learning theory suggests that the optimal generalization performance of a
machine learning
model should occur at an intermediate model complexity, with simpler models exhibiting high
bias
and more compl
→