BriefGPT.xyz
Dec, 2023
将手术视频编码为隐式时空图,用于对象与解剖驱动的推理
Encoding Surgical Videos as Latent Spatiotemporal Graphs for Object and Anatomy-Driven Reasoning
HTML
PDF
Aditya Murali, Deepak Alapatt, Pietro Mascagni, Armine Vardazaryan, Alain Garcia...
TL;DR
利用潜在时空图对外科视频进行建模,以表示其组成的解剖结构和工具随时间的变化,通过添加长期时间边增加对手术场景演化的建模,并引入新颖的图编辑模块,评估了两项下游任务,取得了强大的结果,证明了学到的表示的质量和灵活性。
Abstract
Recently,
spatiotemporal graphs
have emerged as a concise and elegant manner of representing video clips in an object-centric fashion, and have shown to be useful for
downstream tasks
such as action recognition.
→