fine-grained image classification has emerged as a significant challenge
because objects in such images have small inter-class visual differences but
with large variations in pose, lighting, and viewpoints, etc. Most existing
work focuses on highly customized feature extraction via dee