简化贝叶斯深度学习中的预测过程

Nov, 2024

简化贝叶斯深度学习中的预测过程

Streamlining Prediction in Bayesian Deep Learning

Rui Li, Marcus Klasson, Arno Solin, Martin Trapp

TL;DR本研究针对贝叶斯深度学习（BDL）中预测计算效率不足的问题，提出了一种通过单次前向传播而无需采样的方法。通过对激活函数的局部线性化和线性层的局部高斯近似，我们能够分析性地计算后验预测分布的近似值，从而提升预测效率。我们的方案在多层感知机和变压器模型（如ViT和GPT-2）上的回归和分类任务中表现良好。

Abstract

The rising interest in Bayesian Deep Learning (BDL) has led to a plethora of methods for estimating the posterior distribution. However, efficient computation of inferences, such as predictions, has been largely overlooked with →