视觉强化学习中无界数据增强的配方

May, 2024

视觉强化学习中无界数据增强的配方

A Recipe for Unbounded Data Augmentation in Visual Reinforcement Learning

Abdulaziz Almuzairee, Nicklas Hansen, Henrik I. Christensen

TL;DR通过数据增强的广义方法 SADA，可以提高 Q-学习算法在视觉观察训练中的稳定性和泛化能力，适用于各种数据增强方式。

Abstract

$Q$-learning algorithms are appealing for real-world applications due to their data-efficiency, but they are very prone to overfitting and training instabilities when trained from visual observations. Prior work, namely SVEA, finds that selective application of data augmentation can im