BriefGPT.xyz
Oct, 2023
LLark: 一个用于音乐的多模态基础模型
LLark: A Multimodal Foundation Model for Music
HTML
PDF
Josh Gardner, Simon Durand, Daniel Stoller, Rachel M. Bittner
TL;DR
音乐理解和LLark的多模态模型的数据集创建、多模态架构、以及基于开源音乐数据和模型进行训练的结果和代码。
Abstract
Music has a unique and complex structure which is challenging for both expert humans and existing AI systems to understand, and presents unique challenges relative to other forms of audio. We present
llark
, an
instructi
→