BriefGPT.xyz
Jun, 2022
深度强化学习的数据增强高效调度
Efficient Scheduling of Data Augmentation for Deep Reinforcement Learning
HTML
PDF
Byungchan Ko, Jungseul Ok
TL;DR
通过网络蒸馏方法,将语义一致性先验注入深度强化学习中以提高样本使用效率和泛化性能。
Abstract
In
deep reinforcement learning
(RL),
data augmentation
is widely considered as a tool to induce a set of useful priors about semantic consistency and improve
→