BriefGPT.xyz
Jun, 2023
策略泛化的效果不变机制
Effect-Invariant Mechanisms for Policy Generalization
HTML
PDF
Sorawit Saengkyongam, Niklas Pfister, Predrag Klasnja, Susan Murphy, Jonas Peters
TL;DR
本研究提出了一个放松完全不变性假设的条件分布放缓变化,称之为因果关系不变性,并证明它是零样本和少量样本策略泛化的足够充分条件。
Abstract
policy learning
is an important component of many real-world learning systems. A major challenge in
policy learning
is how to adapt efficiently to unseen environments or tasks. Recently, it has been suggested to
→