BriefGPT.xyz
Jun, 2021
离散潜变量的联合梯度估计器
Coupled Gradient Estimators for Discrete Latent Variables
HTML
PDF
Zhe Dong, Andriy Mnih, George Tucker
TL;DR
该研究提出了一种基于重要性采样和统计耦合的派生估计器,将分类变量重新参数化作为二进制序列,并进行Rao-Blackwellization,结果表明该方法在离散潜变量训练中具有最先进的性能。
Abstract
Training models with
discrete latent variables
is challenging due to the high variance of unbiased
gradient estimators
. While low-variance reparameterization gradients of a continuous relaxation can provide an ef
→