The ability to ask questions is a powerful tool to gather information in
order to learn about the world and resolve ambiguities. In this paper, we
explore a novel problem of generating discriminative questions to help
disambiguate visual instances. Our work can be seen as a complement and new
extension to the rich research studies on image captioning and que