词嵌入蒸馏：一种编码方法

Jun, 2015

Distilling Word Embeddings: An Encoding Approach

Lili Mou, Ge Li, Yan Xu, Lu Zhang, Zhi Jin

TL;DR本文提出一种编码方法，用于从高维词嵌入中提取特定任务的知识，旨在解决在各种资源受限系统中高性能的轻量级神经网络的需求问题。实验结果表明，从笨重的嵌入中提取知识优于使用小型嵌入直接训练神经网络，能保证高准确性的同时大幅减少模型复杂度。

Abstract

distilling knowledge from a well-trained cumbersome network to a small one has become a new research topic recently, as lightweight neural networks with high performance are particularly in need in various resource-restricted systems. This paper addresses the problem of →