BriefGPT.xyz
Dec, 2016
基于贝叶斯优化的因素化情境策略搜索
Factored Contextual Policy Search with Bayesian Optimization
HTML
PDF
Peter Karkus, Andras Kupcsik, David Hsu, Wee Sun Lee
TL;DR
提出基于贝叶斯优化的因式化上下文策略搜索方法来提高机器人学习数据效率,通过将通常考虑的文本刻画为目标类型上下文和环境类型上下文两个部分,从而实现经验在目标类型上下文中直接泛化。初步结果表明,该方法在模拟玩具问题上可以更快地泛化策略。
Abstract
Scarce data is a major challenge to scaling
robot learning
to truly complex tasks, as we need to generalize locally learned policies over different "contexts".
bayesian optimization
approaches to
→