BriefGPT.xyz
Jun, 2024
基于CNN和分层注意力的可解释图像标题生成
Explainable Image Captioning using CNN- CNN architecture and Hierarchical Attention
HTML
PDF
Rishi Kesav Mohan, Sanjay Sureshkumar, Vignesh Sivasubramaniam
TL;DR
利用可解释AI的方法来进行图像标题生成,通过使用CNN解码器和分层注意力机制的新架构,提高生成速度和准确性,并且向模型中添加可解释性,使其在应用中更加可信赖。模型通过MSCOCO数据集进行训练和评估,并且文章中给出了定量和定性结果。
Abstract
image captioning
is a technology that produces text-based descriptions for an image.
deep learning
-based solutions built on top of feature recognition may very well serve the purpose. But as with any other machin
→