code generation maps a program description to executable source code in a
programming language. Existing approaches mainly rely on a recurrent neural
network (RNN) as the decoder. However, we find that a program contains
significantly more tokens than a natural language sentence, and t