BriefGPT.xyz
Jul, 2023
结构化信用分配与协调探索
Structural Credit Assignment with Coordinated Exploration
HTML
PDF
Stephen Chung
TL;DR
使用Boltzmann机器或经常性网络进行协调探索,从而加快多个基于REINFORCE的随机和离散单元的训练速度,甚至超过直接传递估计器(STE)反向传播算法。
Abstract
A biologically plausible method for training an
artificial neural network
(ANN) involves treating each unit as a stochastic
reinforcement learning
(RL) agent, thereby considering the network as a team of agents.
→