BriefGPT.xyz
Jun, 2024
让任何多模态大型语言模型都能高效地进行上下文学习
AIM: Let Any Multi-modal Large Language Models Embrace Efficient In-Context Learning
HTML
PDF
Jun Gao, Qian Qiao, Ziqiang Cao, Zili Wang, Wenjie Li
TL;DR
通过聚合多模态演示的图像信息到相应的语言部分的密集潜在空间,我们提出了一种称为AIM的通用轻量级框架来解决多模态ICL的两个问题。
Abstract
in-context learning
(ICL) facilitates Large Language Models (LLMs) exhibiting emergent ability on downstream tasks without updating billions of parameters. However, in the area of
multi-modal large language models
→