CVPRApr, 2024
学习纠正:零样本生成视觉 - 语言推理的高效调节任务
Learning by Correction: Efficient Tuning Task for Zero-Shot Generative Vision-Language Reasoning
Rongjie Li, Yu Wu, Xuming He
TL;DR通过 Image-Conditioned Caption Correction(ICCC)指导的二次调整,提高图像与语言之间的零 - shot 推理性能。