BriefGPT.xyz
Ask
alpha
关键词
crossmodal supervision
搜索结果 - 1
SyncVSR: 数据高效的视觉语音识别与端到端跨模态音频令牌同步
Visual Speech Recognition (VSR) aims to interpret spoken content from visual cues, and SyncVSR presents an end-to-end le
→
PDF
18 days ago
Prev
Next