BriefGPT.xyz
Aug, 2023
面向零样本字符识别的黄金标准数据集及基于部首级别的标注
Toward Zero-shot Character Recognition: A Gold Standard Dataset with Radical-level Annotations
HTML
PDF
Xiaolei Diao, Daqian Shi, Jian Li, Lida Shi, Mingzhe Yue...
TL;DR
构建一个包含基本水平和字符水平注释的古代汉字图像数据集,并提出一种基于字符分解和重组的零样本光学字符识别基准模型,实验证明了数据集和基准模型的有效性。
Abstract
optical character recognition
(OCR) methods have been applied to diverse tasks, e.g., street view text recognition and document analysis. Recently,
zero-shot ocr
has piqued the interest of the research community
→