BriefGPT.xyz
May, 2023
基于概率潜在表示的块局部学习
Block-local learning with probabilistic latent representations
HTML
PDF
David Kappel, Khaleelulla Khan Nazeer, Cabrel Teguemne Fokam, Christian Mayr, Anand Subramoney
TL;DR
通过引入双网络的反向传播方法和将网络中的层激活视作概率分布的参数,本文提出了一种解决反向传播中锁死和权重传输问题的新方法,从而实现对大型网络的分布式高效训练。相应的实验结果表明了其在多种任务和结构上的优越表现。
Abstract
The ubiquitous
backpropagation
algorithm requires sequential updates across blocks of a network, introducing a locking problem. Moreover,
backpropagation
relies on the transpose of weight matrices to calculate up
→