BriefGPT.xyz
May, 2023
ResNets 中的残留缩放优化信号传播
Optimal signal propagation in ResNets through residual scaling
HTML
PDF
Kirsten Fischer, David Dahmen, Moritz Helias
TL;DR
通过有限尺寸理论,研究残差网络的信号传播及其依赖残差分支的伸缩,发现最优伸缩参数范围在最大灵敏度范围内,并给出一个理论框架指导ResNets的最优伸缩。
Abstract
residual networks
(ResNets) have significantly better trainability and thus performance than
feed-forward networks
at large depth. Introducing
sk
→