BriefGPT.xyz
Oct, 2020
理解 SPIGOT 的机制:用作潜在结构学习的替代导数
Understanding the Mechanics of SPIGOT: Surrogate Gradients for Latent Structure Learning
HTML
PDF
Tsvetomila Mihaylova, Vlad Niculae, André F. T. Martins
TL;DR
本文讨论了拉回下游学习目标方法来探索潜在结构学习的原理,从而发现了STE和SPIGOT的原则动机,这导致了相同家族中的新算法,并将已知的和新的拉回估计器与流行的选择进行了实证比较,为实践者提供了新的见识,并揭示了有趣的失败案例。
Abstract
latent structure models
are a powerful tool for modeling language data: they can mitigate the error propagation and annotation bottleneck in pipeline systems, while simultaneously uncovering linguistic insights about the data. One challenge with
→