预训练的ViT模型在医疗图像中得到了多用途的表示

Mar, 2023

预训练的ViT模型在医疗图像中得到了多用途的表示

Pretrained ViTs Yield Versatile Representations For Medical Images

Christos Matsoukas, Johan Fredin Haslum, Magnus Söderberg, Kevin Smith

TL;DR本研究探讨了视觉 Transformer 在医学图像分类中的优劣，并发现使用预训练模型时，视觉 Transformer 可以与卷积神经网络媲美，成为 CNN 的一种可行替代方法。

Abstract

convolutional neural networks (CNNs) have reigned for a decade as the de facto approach to automated medical image diagnosis, pushing the state-of-the-art in classification, detection and segmentation tasks. Over