BriefGPT.xyz
Jan, 2021
使用多模态特征进行设备端文档分类
On-Device Document Classification using multimodal features
HTML
PDF
Sugam Garg, Harichandana, Sumit Kumar
TL;DR
本文介绍了一种将光学字符识别(OCR)与模型架构集成的新型分类文档的方法,用于在设备上进行分类,防止私人用户数据传输到服务器,并展示在FOOD-101多模态数据集上,将模型压缩30%后展示了竞争性的结果。
Abstract
From small screenshots to large videos,
documents
take up a bulk of space in a modern smartphone.
documents
in a phone can accumulate from various sources, and with the high storage capacity of mobiles, hundreds
→