BriefGPT.xyz
Dec, 2021
视觉语义提高了场景文本识别中的文本推理水平
Visual Semantics Allow for Textual Reasoning Better in Scene Text Recognition
HTML
PDF
Yue He, Chen Chen, Jing Zhang, Juhua Liu, Fengxiang He...
TL;DR
该研究提出了一种基于图卷积网络的文本推理(GTR)方法并将其应用于场景文本识别中,该方法可以利用像素之间的空间关联来提高文本识别的性能,并在六个具有挑战性的基准测试中获得最新的最佳结果。
Abstract
Existing
scene text recognition
(STR) methods typically use a language model to optimize the joint probability of the 1D character sequence predicted by a visual recognition (VR) model, which ignore the 2D spatial context of
→