Feb, 2024

具有低计算成本保证的多模态 Transformer

TL;DRTransformer-based models have significantly improved performance in multimodal understanding tasks, but suffer from high computational cost, so a Low-Cost Multimodal Transformer (LoCoMT) is introduced to reduce cost while maintaining or outperforming existing models.