BriefGPT.xyz
Nov, 2021
多模态文本识别网络:视觉和语义特征之间的交互增强
Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features
HTML
PDF
Byeonghu Na, Yoonsik Kim, Sungrae Park
TL;DR
本篇论文介绍了一种名为MATRN(Multi-modAl Text Recognition Network)的新方法,通过促进视觉和语义特征之间的互动,提高了文字识别的性能,并证明其在7项基准测试上取得了最先进的表现。
Abstract
linguistic knowledge
has brought great benefits to
scene text recognition
by providing semantics to refine character sequences. However, since
li
→