BriefGPT.xyz
Mar, 2013
广义汤普森抽样用于顺序决策和因果推断
Generalized Thompson Sampling for Sequential Decision-Making and Causal Inference
HTML
PDF
Pedro A. Ortega, Daniel A. Braun
TL;DR
该论文讨论了Thompson采样如何是贝叶斯策略不确定性建模的自然后果、如何用于多个自适应智能体之间的交互研究和如何应用于推断环境中的因果关系等,在自适应顺序决策和因果推断问题中可能不仅是有用的启发式方法,而且也是一个原则性的方法。
Abstract
Recently, it has been shown how sampling actions from the predictive distribution over the optimal action-sometimes called
thompson sampling
-can be applied to solve
sequential adaptive control
problems, when the
→