BriefGPT.xyz
Feb, 2024
点和指导:通过统一直接操作和文本指令实现精确图像编辑
Point and Instruct: Enabling Precise Image Editing by Unifying Direct Manipulation and Text Instructions
HTML
PDF
Alec Helbling, Seongmin Lee, Polo Chau
TL;DR
将直接操作和文本指令相结合,可实现精确图像操作。用户可以通过视觉标记对象和位置,然后在文本指令中引用它们,从而在自然语言的视觉描述性和直接操作的空间精度之间实现有益的结合。
Abstract
machine learning
has enabled the development of powerful systems capable of editing images from
natural language instructions
. However, in many common scenarios it is difficult for users to specify precise image
→