In the machine learning fields, recurrent neural network (RNN) has become a
popular architecture for sequential data modeling. However, behind the
impressive performance, RNNs require a large number of parameters for both
training and inference. In this paper, we are trying to reduce t