BriefGPT.xyz
Mar, 2024
通过梯度引导的模型扰动增强医学视觉问答任务的泛化能力
Enhancing Generalization in Medical Visual Question Answering Tasks via Gradient-Guided Model Perturbation
HTML
PDF
Gang Liu, Hongyang Li, Zerui He, Shenjun Zhong
TL;DR
通过利用预训练的视觉语言模型,并结合数据增强、正则化方法以及基于梯度引导的参数扰动,该研究提出了一种改善医学可视化问答任务的模型泛化性能的方法,并在两个数据集上获得了有竞争力的结果。
Abstract
Leveraging
pre-trained visual language models
has become a widely adopted approach for improving performance in downstream visual question answering (VQA) applications. However, in the specialized field of
medical vqa
→