使用深度强化学习和行为规范掌握官旦游戏

Feb, 2024

使用深度强化学习和行为规范掌握官旦游戏

Mastering the Game of Guandan with Deep Reinforcement Learning and Behavior Regulating

Yifan Yanggong, Hao Pan, Lei Wang

TL;DR提出了一种名为GuanZero的框架，通过蒙特卡洛方法和深度神经网络使AI代理能够掌握Guandan游戏，主要贡献在于通过精心设计的神经网络编码方案调节代理的行为，通过与最先进的方法进行比较证明了该框架的有效性。

Abstract

games are a simplified model of reality and often serve as a favored platform for artificial intelligence (AI) research. Much of the research is concerned with game-playing agents and their decision making proces