Jiahao Zhang, Bowen Wang, Liangzhi Li, Yuta Nakashima, Hajime Nagahara
TL;DR通过引入可学习的扰动(提示),我们提出了一种名为Instruct Me More(InMeMo)的方法,以增强视觉上下文学习的性能,对前景分割和单物体检测任务的mIoU分数分别提高了7.35和15.13。
Abstract
large-scale models trained on extensive datasets, have emerged as the preferred approach due to their high generalizability across various tasks. in-context learning (ICL), a popular strategy in natural language