深层ReLU神经网络在逐步可分数据上的可解释全局极小值

May, 2024

深层ReLU神经网络在逐步可分数据上的可解释全局极小值

Interpretable global minima of deep ReLU neural networks on sequentially separable data

Thomas Chen, Patricia Muñoz Ewald

TL;DR明确构建了零损失神经网络分类器，以累积参数来表示权重矩阵和偏差向量，在输入空间上递归地作用于截断映射，针对训练数据的配置包括小且良好分离的聚类和可线性分离的等价类，最佳情况下在M维空间的Q个类别数据中，全局最小化能够用Q(M+2)个参数来描述。

Abstract

We explicitly construct zero loss neural network classifiers. We write the weight matrices and bias vectors in terms of cumulative parameters, which determine →