BriefGPT.xyz
Jun, 2021
隐空间中的基于模型的规划的时间预测编码
Temporal Predictive Coding For Model-Based Planning In Latent Space
HTML
PDF
Tung Nguyen, Rui Shu, Tuan Pham, Hung Bui, Stefano Ermon
TL;DR
本文使用时间预测编码等方法,构建了一种信息论方法的强化学习模型,可帮助解决高维度观测值与复杂背景的问题。
Abstract
High-dimensional observations are a major challenge in the application of model-based
reinforcement learning
(MBRL) to real-world environments. To handle high-dimensional sensory inputs, existing approaches use
represen
→