Plackett-Luce分布的低方差黑盒梯度估计

Nov, 2019

Plackett-Luce分布的低方差黑盒梯度估计

Low-variance Black-box Gradient Estimates for the Plackett-Luce Distribution

Artyom Gadetsky, Kirill Struminsky, Christopher Robinson, Novi Quadrianto, Dmitry Vetrov

TL;DR本文提出了一种使用控制变量的Plackett-Luce分布来进行离散潜在变量学习模型的随机梯度下降的方法，能够在非可微、离散和连续数据的因果关系学习任务中胜过其他对比松弛优化方法。

Abstract

Learning models with discrete latent variables using stochastic gradient descent remains a challenge due to the high variance of gradient estimates. Modern variance reduction techniques mostly consider categorical distributions and have limited applicability when the number of possible