BriefGPT.xyz
Apr, 2023
MoMo: 一种用于文本、图像和多模态表示的共享编码器模型
MoMo: A shared encoder Model for text, image and multi-Modal representations
HTML
PDF
Rakesh Chada, Zhaoheng Zheng, Pradeep Natarajan
TL;DR
本文提出了一种自主监督的共享编码器模型,在数据、内存和运行时效率高的同时,在几个视觉、语言和多模式基准测试中取得了强大结果。
Abstract
We propose a
self-supervised shared encoder model
that achieves strong results on several visual, language and
multimodal
benchmarks while being data, memory and run-time efficient. We make three key contribution
→