BriefGPT.xyz
Jun, 2022
线性贝叶斯中交互学习偏好约束
Interactively Learning Preference Constraints in Linear Bandits
HTML
PDF
David Lindner, Sebastian Tschiatschek, Katja Hofmann, Andreas Krause
TL;DR
探讨了利用Adaptive Constraint Learning算法解决具有昂贵人类偏好未知约束的序列决策问题,特别是在驾驶行为中体现的安全和舒适性约束的识别,该算法在驾驶模拟中比其他算法更加高效
Abstract
We study
sequential decision-making
with known rewards and unknown
constraints
, motivated by situations where the
constraints
represent ex
→