图像分类中的专家混合：最佳平衡点是什么？

Nov, 2024

图像分类中的专家混合：最佳平衡点是什么？

Mixture of Experts in Image Classification: What's the Sweet Spot?

Mathurin Videau, Alessandro Leite, Marc Schoenauer, Olivier Teytaud

TL;DR本研究解决了专家混合模型在计算机视觉中应用有限的问题，探讨了如何在图像分类模型中有效整合MoE层。研究发现，适度数量的激活参数能够取得最佳效果，但当激活参数过多时，这种改进逐渐消失，指出了模型设计中的重要平衡点。

Abstract

Mixture-of-Experts (MoE) models have shown promising potential for parameter-efficient scaling across various domains. However, the implementation in Computer Vision remains limited, and often requires large-scale datasets comprising billions of samples. In this study, we investigate t