TL;DR通过构建多语言数据集 Global Voices,提取社交网络数据以对 15 种语言中的英文总结进行低成本的评估。并通过对人类进行调查和筛选,研究了翻译质量对跨语言总结的影响,同时还对 ROUGE 指标进行了限制的分析。
Abstract
We construct Global Voices, a multilingual dataset for evaluating
cross-lingual summarization methods. We extract social-network descriptions of
Global Voices news articles to cheaply collect evaluation data for into-English
and from-English summarization in 15 languages. Especially, f