BriefGPT.xyz
Nov, 2016
具体分布:离散随机变量的连续松弛
The Concrete Distribution: A Continuous Relaxation of Discrete Random Variables
HTML
PDF
Chris J. Maddison, Andriy Mnih, Yee Whye Teh
TL;DR
该论文提供了一种通过引入Concrete随机变量的连续放松方法解决离散状态下无法使用重参数化技巧的问题,使得在离散计算图上也能有效地使用自动微分来产生低方差偏向梯度和低方差无偏梯度以优化损失函数。
Abstract
The
reparameterization trick
enables the optimization of large scale
stochastic computation graphs
via gradient descent. The essence of the trick is to refactor each stochastic node into a differentiable function
→