BriefGPT.xyz
Feb, 2024
TAMM:三适配器多模态学习用于3D形状理解
TAMM: TriAdapter Multi-Modal Learning for 3D Shape Understanding
HTML
PDF
Zhihao Zhang, Shengcao Cao, Yu-Xiong Wang
TL;DR
通过TriAdapter Multi-Modal Learning(TAMM),在多模态预训练中引入了三个协同适配器,以更有效地利用2D图像和语言模态,缩小3D形状数据集的规模限制,提高对3D形状的理解和表示学习。
Abstract
The limited scale of current 3D shape datasets hinders the advancements in
3d shape understanding
, and motivates
multi-modal learning
approaches which transfer learned knowledge from data-abundant 2D image and la
→