BriefGPT.xyz
Sep, 2023
TMac: 音频事件分类的时间多模态图学习
TMac: Temporal Multi-Modal Graph Learning for Acoustic Event Classification
HTML
PDF
Meng Liu, Ke Liang, Dayu Hu, Hao Yu, Yue Liu...
TL;DR
我们提出了一种基于时态多模态图学习技术的音频事件分类方法TMac,通过建模这种时态信息,我们构建了每个音频事件的时态图,通过利用图学习技术来捕捉模态内部和模态间的动态信息,实现了优于其他最先进模型的性能。
Abstract
audiovisual data
is everywhere in this digital age, which raises higher requirements for the
deep learning models
developed on them. To well handle the information of the
→