Qwen-VL: 具备多功能能力的前沿大规模视觉语言模型

Aug, 2023

Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities

Jinze Bai, Shuai Bai, Shusheng Yang, Shijie Wang, Sinan Tan...

TL;DR介绍了Qwen-VL系列，这是一组大规模视觉语言模型，旨在感知和理解文本和图像，以提高多模态人工智能的性能。

Abstract

We introduce the qwen-vl series, a set of large-scale vision-language models designed to perceive and understand both text and images. Com