BriefGPT.xyz
Oct, 2023
DeepDecipher:大规模语言模型中神经元激活的访问和研究
DeepDecipher: Accessing and Investigating Neuron Activation in Large Language Models
HTML
PDF
Albert Garde, Esben Kran, Fazl Barez
TL;DR
通过API和界面,DeepDecipher为调查Transformer模型MLP层中的神经元提供先进的可解释性技术,使大型语言模型(LLMs)更具透明度和可靠性,可以帮助研究人员、工程师和开发人员快速诊断问题、审计系统并推动该领域的发展。
Abstract
As
large language models
(LLMs) become more capable, there is an urgent need for
interpretable
and
transparent tools
. Current methods are
→