Collections of research article data harvested from the web have become
common recently since they are important resources for experimenting on tasks
such as named entity recognition, text summarization, or keyword generation. In
fact, certain types of experiments require collections t