BriefGPT.xyz
Jan, 2023
外部存储器在提高预测模型容量方面的能力
The Power of External Memory in Increasing Predictive Model Capacity
HTML
PDF
Cenk Baykal, Dylan J Cutler, Nishanth Dikkala, Nikhil Ghosh, Rina Panigrahy...
TL;DR
本文介绍了一种在深度网络中引入稀疏性的方法,使用外部参数表作为网络各层的稀疏索引,重点探讨了如何进行索引和利用这些索引内容的方法,并提出了一种新的交替更新方法来增加标记维度并提高语言建模效果。
Abstract
One way of introducing
sparsity
into
deep networks
is by attaching an external table of parameters that is sparsely looked up at different layers of the network. By storing the bulk of the parameters in the exter
→