BriefGPT.xyz
May, 2023
语言空间中的图像:探索大语言模型在视觉和语言任务中的适用性
Images in Language Space: Exploring the Suitability of Large Language Models for Vision & Language Tasks
HTML
PDF
Sherzod Hakimov, David Schlangen
TL;DR
本篇文章研究了如何通过联合对话模型和语言模型使其能够有效地处理视觉信息,解决了在有限样本时视觉-语言任务的问题,使输出更易于解释。
Abstract
large language models
have demonstrated robust performance on various language tasks using zero-shot or few-shot learning paradigms. While being actively researched,
multimodal models
that can additionally handle
→