multimodal machine learning has gained significant attention in recent years
due to its potential for integrating information from multiple modalities to
enhance learning and decision-making processes. However, it is commonly
observed that unimodal models outperform multimodal models,