BriefGPT.xyz
Apr, 2024
手术室场景图生成的时间动态三模态融合
Tri-modal Confluence with Temporal Dynamics for Scene Graph Generation in Operating Rooms
HTML
PDF
Diandian Guo, Manxi Lin, Jialun Pei, He Tang, Yueming Jin...
TL;DR
通过使用TriTemp-OR框架,整合图像、点云和语言三种模态,结合时间动态,并借助大规模语言模型,实现对手术场景的综合理解,以预测关系并生成场景图。
Abstract
A comprehensive understanding of
surgical scenes
allows for monitoring of the surgical process, reducing the occurrence of accidents and enhancing efficiency for medical professionals.
semantic modeling
within op
→