BriefGPT.xyz
Oct, 2020
社交媒体医疗实体链接语料库COMETA
COMETA: A Corpus for Medical Entity Linking in the Social Media
HTML
PDF
Marco Basaldella, Fangyu Liu, Ehsan Shareghi, Nigel Collier
TL;DR
COMETA语料库是一个新的英文生物医学实体提及数据集,其对于医疗术语和语言的复杂性有很高的覆盖率和质量,作者通过2种不同的评估场景,对20种不同的实体链接方法进行测试,发现没有完美的方案,基于不同数据视角的特性融合是最佳方法。
Abstract
Whilst there has been growing progress in
entity linking
(EL) for general language, existing datasets fail to address the complex nature of
health terminology
in layman's language. Meanwhile, there is a growing n
→