In the last decade convolutional neural networks have become gargantuan.
Pre-trained models, when used as initializers are able to fine-tune ever larger
networks on small datasets. Consequently, not all the convolutional features
that these fine-tuned models detect are requisite for th