关键词3d vision-language grounding
搜索结果 - 2
  • SceneVerse:面向基于场景的三维视觉语言学习的规模化
    PDF5 months ago
  • 3D-VisTA: 预训练的 Transformer 用于 3D 视觉和文本对齐
    PDFa year ago
Prev
Next