BriefGPT.xyz
Nov, 2023
视觉背景提示
Visual In-Context Prompting
HTML
PDF
Feng Li, Qing Jiang, Hao Zhang, Tianhe Ren, Shilong Liu...
TL;DR
本文介绍了一种通用的视觉上下文提示框架,以支持涂鸦、方框和点等各种提示类型,并进一步改进以支持任意数量的上下文。通过在COCO和SA-1B上进行联合训练,我们的模型在COCO上达到57.7 PQ,在ADE20K上达到23.2 PQ。
Abstract
in-context prompting
in large language models (LLMs) has become a prevalent approach to improve zero-shot capabilities, but this idea is less explored in the vision domain. Existing
visual prompting methods
focus
→