具函数噪声的连续状态空间中保护隐私的Q学习

Jan, 2019

具函数噪声的连续状态空间中保护隐私的Q学习

Private Q-Learning with Functional Noise in Continuous Spaces

Baoxiang Wang, Nidhi Hegde

TL;DR通过在训练中迭代地向价值函数添加函数噪声，本文在连续空间中考虑了保护差分隐私强化学习算法的价值函数逼近器，并证明了其隐私保证和近似最优性。

Abstract

We consider privacy-preserving algorithms for deep reinforcement learning. State-of-the-art methods that guarantee differential privacy are not extendable to very large state spaces because the noise level necessary to ensure privacy would scale to infinity. We address the problem of p