Information theory is a powerful tool to express principles to drive
autonomous systems because it is domain invariant and allows for an intuitive
interpretation. This paper studies the use of the predictive information (PI),
also called excess entropy or effective measure complexity,
通过引入准确的表示学习机制 ——Predictive Information QT-Opt(PI-QT-Opt),我们研究了预测信息对机器人智能代理的建模以及其在从大量数据中培养具备各种技能的通用代理方面的重要性。实验结果表明,这种机制的应用能有效地提高任务求解的速度,并实现对无尝试性新任务的更好的转移学习。