This paper studies the problem of object discovery -- separating objects from
the background without manual labels. Existing approaches utilize appearance
cues, such as color, texture, and location, to group pixels into object-like
regions. However, by relying on appearance alone, thes