学习大型复杂模型的相对自然梯度

Jun, 2016

Relative Natural Gradient for Learning Large Complex Models

Ke Sun, Frank Nielsen

TL;DR通过提取神经元系统的局部组件，定义相对的费舍尔信息度量并演示了如何利用这一概念进一步改进优化，提高神经网络的学习效果。

Abstract

fisher information and natural gradient provided deep insights and powerful tools to artificial neural networks. However related analysis