BriefGPT.xyz
Jun, 2024
对比稀疏自编码器解释国际象棋智能体的规划
Contrastive Sparse Autoencoders for Interpreting Planning of Chess-Playing Agents
HTML
PDF
Yoann Poupart
TL;DR
基于对对局轨迹的对比稀疏自编码器(CSAE)提取和解释对国际象棋代理计划有意义的概念,通过定性分析CSAE特性并提出自动特性分类法,进一步利用合理性检查评估算法的质量。
Abstract
ai led chess systems
to a superhuman level, yet these systems heavily rely on black-box algorithms. This is unsustainable in ensuring
transparency
to the end-user, particularly when these systems are responsible
→