BriefGPT.xyz
May, 2023
当梯度下降与无导数优化相遇:黑盒场景中的完美匹配
When Gradient Descent Meets Derivative-Free Optimization: A Match Made in Black-Box Scenario
HTML
PDF
Chengcheng Han, Liqing Cui, Renyu Zhu, Jianing Wang, Nuo Chen...
TL;DR
本文介绍了一种融合梯度下降和无导数优化的全新方法GDFO,并使用知识蒸馏的方式成功地将梯度下降引入黑盒调整中以优化任务特定的连续提示。实验结果表明,GDFO可以取得显著的性能提升。
Abstract
Large
pre-trained language models
(PLMs) have garnered significant attention for their versatility and potential for solving a wide spectrum of
natural language processing
(NLP) tasks. However, the cost of runnin
→