BriefGPT.xyz
Mar, 2020
基于Transformer的场景文本识别
Scene Text Recognition via Transformer
HTML
PDF
Xinjie Feng, Hongxun Yao, Yuankai Yi, Jun Zhang, Shengping Zhang
TL;DR
本论文提出了一种基于transformer的简单但极其有效的场景文本识别方法,只需要空间注意力而不需要矫正图像,仅使用卷积特征图作为单词嵌入输入到transformer中,并在大规模实验中取得了显著的优越性能。
Abstract
scene text recognition
with
arbitrary shape
is very challenging due to large variations in text shapes, fonts, colors, backgrounds, etc. Most state-of-the-art algorithms rectify the input image into the normalize
→