BriefGPT.xyz
Jul, 2021
OPT: Omni-Perception Pre-Trainer 用于跨模态理解和生成
OPT: Omni-Perception Pre-Trainer for Cross-Modal Understanding and Generation
HTML
PDF
Jing Liu, Xinxin Zhu, Fei Liu, Longteng Guo, Zijia Zhao...
TL;DR
本文提出了一种跨模态的全视觉感知预训练器,其采用了多任务预训练策略从不同数据粒度学习了对图片、文字和音频的跨模态理解与生成。
Abstract
In this paper, we propose an
omni-perception
pre-trainer
(OPT) for
cross-modal
understanding and generation, by jointly modeling visual, t
→