BriefGPT.xyz
Jun, 2023
多模机器翻译的视觉语言预训练调查
A Survey of Vision-Language Pre-training from the Lens of Multimodal Machine Translation
HTML
PDF
Jeremy Gwinnup, Kevin Duh
TL;DR
通过调查文献并从多模机器翻译的角度审视语言和视觉预训练的通用架构和预训练目标以及数据集,探索大型预训练模型在多模机器翻译任务中的应用。
Abstract
large language models
such as BERT and the GPT series started a paradigm shift that calls for building general-purpose models via
pre-training
on large datasets, followed by fine-tuning on task-specific datasets.
→