Hyojin Bahng, Ali Jahanian, Swami Sankaranarayanan, Phillip Isola
TL;DR通过视觉提示来适应视觉中的大规模模型,这种方法在适应预先训练模型方面非常有效。
Abstract
Prompting has recently become a popular paradigm for adapting language models to downstream tasks. Rather than fine-tuning model parameters or adding task-specific heads, this approach steers a model to perform a new task simply by adding a text prompt to the model's inputs. In this paper, we explore the question: can we create prompts with pixels instead? I