BriefGPT.xyz
Jun, 2016
将文本描述转化为高层视觉表征
Picture It In Your Mind: Generating High Level Visual Representations From Textual Descriptions
HTML
PDF
Fabio Carrara, Andrea Esuli, Tiziano Fagni, Fabrizio Falchi, Alejandro Moreo Fernández
TL;DR
本文介绍了一种利用神经网络模型Text2Vis在视觉特征空间中实现基于短文本描述信息的图像搜索方法,并通过针对文本和视觉损失函数的优化来提高搜索效率和精确度,并在MS-COCO数据集上进行了初步结果呈现。
Abstract
In this paper we tackle the problem of
image search
when the query is a short
textual description
of the image the user is looking for. We choose to implement the actual search process as a similarity search in a
→