pointer generator networks have been used successfully for abstractive
summarization. Along with the capability to generate novel words, it also
allows the model to copy from the input text to handle out-of-vocabulary words.
In this paper, we point out two key shortcomings of the summa