HyperAIHyperAI
2 months ago

Visual Prompting via Image Inpainting

Bar, Amir ; Gandelsman, Yossi ; Darrell, Trevor ; Globerson, Amir ; Efros, Alexei A.
Visual Prompting via Image Inpainting
Abstract

How does one adapt a pre-trained visual model to novel downstream taskswithout task-specific finetuning or any model modification? Inspired byprompting in NLP, this paper investigates visual prompting: given input-outputimage example(s) of a new task at test time and a new input image, the goal isto automatically produce the output image, consistent with the given examples.We show that posing this problem as simple image inpainting - literally justfilling in a hole in a concatenated visual prompt image - turns out to besurprisingly effective, provided that the inpainting algorithm has been trainedon the right data. We train masked auto-encoders on a new dataset that wecurated - 88k unlabeled figures from academic papers sources on Arxiv. We applyvisual prompting to these pretrained models and demonstrate results on variousdownstream image-to-image tasks, including foreground segmentation, singleobject detection, colorization, edge detection, etc.

Visual Prompting via Image Inpainting | Latest Papers | HyperAI