Whether what you see in Figure 1 is a "flamingo" or a "bird", is the question
we ask in this paper. While fine-grained visual classification (FGVC) strives
to arrive at the former, for the majority of us non-experts just "bird" would
probably suffice. The real question is therefore --