TL;DR本研究中,我们采用了三种 Deep Q-Networks 算法,分别使用了智能采样策略来解决 URRLC 消息的发送问题,证明了方差和最大熵探索的效率比标准的贪婪探索方法更高。
Abstract
The quality of data driven learning algorithms scales significantly with the
quality of data available. One of the most straight-forward ways to generate
good data is to sample or explore the data source intelligently.