BriefGPT.xyz
Feb, 2024
使用深度强化学习和行为规范掌握官旦游戏
Mastering the Game of Guandan with Deep Reinforcement Learning and Behavior Regulating
HTML
PDF
Yifan Yanggong, Hao Pan, Lei Wang
TL;DR
提出了一种名为GuanZero的框架,通过蒙特卡洛方法和深度神经网络使AI代理能够掌握Guandan游戏,主要贡献在于通过精心设计的神经网络编码方案调节代理的行为,通过与最先进的方法进行比较证明了该框架的有效性。
Abstract
games
are a simplified model of reality and often serve as a favored platform for
artificial intelligence
(AI) research. Much of the research is concerned with game-playing agents and their decision making proces
→