Yueen Ma, Zixing Song, Yuzheng Zhuang, Jianye Hao, Irwin King
TL;DR综合调查了深度学习、多模态模型、视觉-语言-动作模型、具身人工智能的快速发展。
Abstract
deep learning has demonstrated remarkable success across many domains, including computer vision, natural language processing, and reinforcement learning. Representative artificial neural networks in these fields span convolutional neural networks, Transformers, and deep Q-networks. Bu