BriefGPT.xyz
Nov, 2018
可微分游戏中的稳定对手塑造
Stable Opponent Shaping in Differentiable Games
HTML
PDF
Alistair Letcher, Jakob Foerster, David Balduzzi, Tim Rocktäschel, Shimon Whiteson
TL;DR
该论文提出了稳定对手塑造方法,该方法通过插值实现了区分对手学习(LOLA)和稳定对手塑造的最佳属性,并在可微分游戏中表现出卓越的性能。
Abstract
A growing number of learning methods are actually \emph{games} which optimise multiple, interdependent objectives in parallel -- from GANs and intrinsic curiosity to
multi-agent rl
.
opponent shaping
is a powerful
→