Emilio Ferrara, Pasquale De Meo, Giacomo Fiumara, Robert Baumgartner
TL;DR本文综述了 Web 数据提取的现有应用,并将其分成企业级和社交 Web 级别,讨论了其在商业智能和数据分析中的重要性以及用于收集和分析结构化数据在社交媒体上的潜力。
Abstract
web data extraction is an important problem that has been studied by means of
different scientific tools and in a broad range of applications. Many
approaches to extracting data from the Web have been designed to solve specific
problems and operate in ad-hoc domains. Other approaches,