BriefGPT.xyz
Oct, 2023
MiniGPT-v2:大型语言模型作为视觉语言多任务学习的统一接口
MiniGPT-v2: large language model as a unified interface for vision-language multi-task learning
HTML
PDF
Jun Chen, Deyao Zhu, Xiaoqian Shen, Xiang Li, Zechun Liu...
TL;DR
利用MiniGPT-v2建立一个统一的界面,有效地处理各种视觉-语言任务,包括图像描述、视觉问答和视觉定位等,并通过使用唯一标识符提高模型在每个任务中的学习效率。
Abstract
large language models
have shown their remarkable capabilities as a general interface for various language-related applications. Motivated by this, we target to build a unified interface for completing many
vision-langu
→