无限深度的神经网络作为扩散过程

May, 2019

无限深度的神经网络作为扩散过程

Neural Stochastic Differential Equations

Stefano Peluchetti, Stefano Favaro

TL;DR本文从分布的角度出发，研究了神经网络的深度问题。通过引入随机微分方程的方法，解决了深度叠加会引起的输入依赖性和功能约束等问题。

Abstract

Deep neural networks whose parameters are distributed according to typical initialization schemes exhibit undesirable properties that can emerge as the number of layers increases. These issues include a vanishing dependency on the input and a concentration on restrictive families of fu