BriefGPT.xyz
Jun, 2024
优化超参数与检验点模型权重的改进
Improving Hyperparameter Optimization with Checkpointed Model Weights
HTML
PDF
Nikhil Mehta, Jonathan Lorraine, Steve Masson, Ramanathan Arunachalam, Zaid Pervaiz Bhat...
TL;DR
该论文提出了一种用于神经网络的超参数优化方法,使用已训练权重的记录检查点来引导未来的超参数选择,借助灰盒超参数优化方法,通过嵌入权重到高斯过程深度核代理模型来提高效率,并通过一个置换不变图元网络实现数据效率。
Abstract
When training
deep learning models
, the performance depends largely on the selected hyperparameters. However,
hyperparameter optimization
(HPO) is often one of the most expensive parts of model design. Classical
→