BriefGPT.xyz
Mar, 2022
视觉-语言智能:任务、表示学习与大模型
Vision-Language Intelligence: Tasks, Representation Learning, and Large Models
HTML
PDF
Feng Li, Hao Zhang, Yi-Fan Zhang, Shilong Liu, Jian Guo...
TL;DR
这篇论文从时间的角度对视觉语言智能进行了全面的调研, 总结了三个时期的发展, 包括特定任务方法, 视觉-语言预训练方法和通过大规模弱标签数据增强的更大模型, 并讨论了未来的发展趋势.
Abstract
This paper presents a comprehensive survey of
vision-language
(VL)
intelligence
from the perspective of time. This survey is inspired by the remarkable progress in both
→