BriefGPT.xyz
Feb, 2021
探究多模态嵌入在语言属性中的应用:视觉-语义案例
Probing Multimodal Embeddings for Linguistic Properties: the Visual-Semantic Case
HTML
PDF
Adam Dahlgren Lindström, Suna Bensch, Johanna Björklund, Frank Drewes
TL;DR
本篇论文提出了一种探测任务的方法,通过训练分类器来比较各种最新的文本-图像语义嵌入,揭示了语义嵌入中存在的问题并提出了问题解决方案。实验结果表明,视觉-语义嵌入的识别准确率比单媒体嵌入提高了12%以上。
Abstract
semantic embeddings
have advanced the state of the art for countless natural language processing tasks, and various extensions to multimodal domains, such as visual-
semantic embeddings
, have been proposed. While
→