masked image modeling (MIM)-based models, such as SdAE, CAE, GreenMIM, and
MixAE, have explored different strategies to enhance the performance of Masked
Autoencoders (MAE) by modifying prediction, loss functions, or incorporating
additional architectural components. In this paper, we