BriefGPT.xyz
Jun, 2023
视觉推理与基础合理性:看、记住和推理
Look, Remember and Reason: Visual Reasoning with Grounded Rationales
HTML
PDF
Apratim Bhattacharyya, Sunny Panchal, Mingu Lee, Reza Pourreza, Pulkit Madan...
TL;DR
该研究旨在通过模仿人类视觉问题解决中的“看、记住、推理”模式,引入基于视觉输入的原理来整合低级视觉能力,使现有的大型语言模型能够在视觉推理问题上取得竞争性表现。
Abstract
large language models
have recently shown human level performance on a variety of reasoning tasks. However, the ability of these models to perform complex
visual reasoning
has not been studied in detail yet. A ke
→