BriefGPT.xyz
May, 2024
Reason3D:基于大规模语言模型的3D分割搜索和推理
Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model
HTML
PDF
Kuan-Chih Huang, Xiangtai Li, Lu Qi, Shuicheng Yan, Ming-Hsuan Yang
TL;DR
Reason3D是一种新型的多模态大型语言模型,通过点云数据和文本提示作为输入,生成文本回答和分割遮罩,实现3D推理分割、分层搜索、精确引用和问题回答等高级任务。
Abstract
Recent advancements in
multimodal large language models
(LLMs) have shown their potential in various domains, especially concept reasoning. Despite these developments, applications in understanding
3d environments
→