元训练智能体实现贝叶斯最优智能体

Oct, 2020

元训练智能体实现贝叶斯最优智能体

Meta-trained agents implement Bayes-optimal agents

Vladimir Mikulik, Grégoire Delétang, Tom McGrath, Tim Genewein, Miljan Martic...

TL;DR该研究通过在一些预测和赌博任务上的实验，发现元学习可以作为近似数值逼近贝叶斯最优智能体的一般技术。实验结果表明，memory-based meta-learning可以使一些不可解的任务变得可解。

Abstract

memory-based meta-learning is a powerful technique to build agents that adapt fast to any task within a target distribution. A previous theoretical study has argued that this remarkable performance is because the meta-training protocol incentivises agents to behave →