BriefGPT.xyz
May, 2024
CLIP中的语言增强技术对多模态医学图像的改进解剖检测
Language Augmentation in CLIP for Improved Anatomy Detection on Multi-modal Medical Images
HTML
PDF
Mansi Kakkar, Dattesh Shanbhag, Chandan Aladahalli, Gurunath Reddy M
TL;DR
使用多模态的医学影像,利用视觉语言模型(CLIP)自动生成整体身体的标准化分区和器官列表,相较于基线模型(PubMedCLIP),提高性能达到47.6%。
Abstract
vision-language models
have emerged as a powerful tool for previously challenging
multi-modal classification
problem in the medical domain. This development has led to the exploration of
→