We first observe a potential weakness of continuous vector representations of
symbols in neural machine translation. That is, the continuous vector
representation, or a word embedding vector, of a symbol encodes multiple
dimensions of similarity, equivalent to encoding more than one me