Jiawei Wang, Shunchi Zhang, Kai Hu, Chixiang Ma, Zhuoyao Zhong...
TL;DR通过将Contextual Text Block Detection任务作为图生成问题,利用DQ-DETR和Dynamic Relation Transformer等先进技术,该研究提出了一种图生成框架,能够以高效准确的方式检测上下文文本块,取得了最先进的结果。
Abstract
contextual text block detection (CTBD) is the task of identifying coherent text blocks within the complexity of natural scenes. Previous methodologies have treated CTBD as either a visual relation extraction challenge within computer vision or as a sequence modeling problem from the pe