预测信息加速强化学习

Jul, 2020

Predictive Information Accelerates Learning in RL

Kuang-Huei Lee, Ian Fischer, Anthony Liu, Yijie Guo, Honglak Lee...

TL;DR本文通过使用有监督训练的压缩表示学习了强化学习环境动态的预测信息，通过提高样本效率使得 Soft Actor-Critic 代理人可以大幅度地改善在连续控制任务中的表现。

Abstract

The predictive information is the mutual information between the past and the future, I(X_past; X_future). We hypothesize that capturing the predictive information is useful in RL, since the ability to model what