The dominant neural machine translation models are based on the
encoder-decoder structure, and many of them rely on an unconstrained receptive
field over source and target sequences. In this paper we study a new
architecture that breaks with both conventions. Our simplified architectur