BriefGPT.xyz
May, 2019
可微分的游戏机制
Differentiable Game Mechanics
HTML
PDF
Alistair Letcher, David Balduzzi, Sebastien Racaniere, James Martens, Jakob Foerster...
TL;DR
本文针对深度学习建立在梯度下降收敛局部极小值的基础上这一保证在生成对抗网络等存在多个交互损失的情况下失效问题,研究了N人不可微分博弈的动态性,提出了一种新的算法 Symplectic Gradient Adjustment (SGA) 可以在更一般的情境下应用,并有基于理论保证的鲁棒性。
Abstract
deep learning
is built on the foundational guarantee that gradient descent on an objective function converges to local minima. Unfortunately, this guarantee fails in settings, such as
generative adversarial nets
,
→