Oct, 2023

GraFT: 渐进融合变换器用于多模态再识别

TL;DRGradual Fusion Transformer (GraFT) is proposed for multimodal Object Re-Identification (ReID), employing learnable fusion tokens to capture modality-specific and object-specific features, optimizing the ReID feature embedding space through a novel training paradigm combined with an augmented triplet loss, and demonstrating superior performance over established benchmarks while offering model size and performance balance with integrated neural network pruning.