BriefGPT.xyz
Jul, 2024
加速语言模型评估
On Speeding Up Language Model Evaluation
HTML
PDF
Jin Peng Zhou, Christian K. Belardi, Ruihan Wu, Travis Zhang, Carla P. Gomes...
TL;DR
利用低秩分解的多臂赌博算法,我们的方法能够在仅使用通常所需资源的5-15%情况下,显著降低资源消耗,并且能够识别出性能最好的方法,从而降低成本85-95%。
Abstract
large language models
(LLMs) currently dominate the field of
natural language processing
(NLP), representing the state-of-the-art across a diverse array of tasks. Developing a model of this nature, from training
→