BriefGPT.xyz
Aug, 2019
语言特征的重要性:用于视觉-语言任务的有效语言表示
Language Features Matter: Effective Language Representations for Vision-Language Tasks
HTML
PDF
Andrea Burns, Reuben Tan, Kate Saenko, Stan Sclaroff, Bryan A. Plummer
TL;DR
文章研究了在视觉 -语言 (VL) 任务中如何处理语言和视觉特征,提出了一些对于语言发挥更大作用的最佳实践,包括使用平均嵌入语言模型,进行多任务训练以及采用图形导向的视觉 -语言嵌入模型(GrOVLE)来整合语言特征。
Abstract
Shouldn't language and vision features be treated equally in
vision-language
(VL) tasks? Many VL approaches treat the language component as an afterthought, using simple language models that are either built upon fixed
→