BriefGPT.xyz
Nov, 2021
Florence: 计算机视觉领域的新基础模型
Florence: A New Foundation Model for Computer Vision
HTML
PDF
Lu Yuan, Dongdong Chen, Yi-Ling Chen, Noel Codella, Xiyang Dai...
TL;DR
该研究介绍了一种名为Florence的计算机视觉基础模型,采用Web规模的图像文本数据包含通用视觉语言表示,可轻松用于各种计算机视觉任务,如分类、检索、物体检测、图像字幕、视频检索和动作识别,达到了许多转移学习方面的最新成果。
Abstract
Automated visual understanding of our diverse and open world demands
computer vision
models to generalize well with minimal customization for specific tasks, similar to human vision.
computer vision
foundation mo
→