随机格上的表征学习

Apr, 2025

Representation Learning on a Random Lattice

Aryeh Brill

TL;DR本研究解决了深度神经网络学习的表征不够可解释的问题。提出了将特征视为嵌入数据分布的学习坐标系统的几何视角，采用随机格模型分析其特性。研究结果表明，学习到的特征可以分为上下文、成分和表面特征，提供了对机制可解释性的见解，并指引了未来的研究方向。

Abstract

Decomposing a deep neural network's learned representations into Interpretable Features could greatly enhance its safety and reliability. To better understand features, we adopt a Geometric Perspective, viewing t