BriefGPT.xyz
Jun, 2024
令牌经济中的推理:对LLM推理策略的预算感知评估
Reasoning in Token Economies: Budget-Aware Evaluation of LLM Reasoning Strategies
HTML
PDF
Junlin Wang, Siddhartha Jain, Dejiao Zhang, Baishakhi Ray, Varun Kumar...
TL;DR
考虑计算预算并结合性能指标,对比了不同推理策略在语言模型中的效果,发现复杂推理策略的成功并非仅仅基于算法的巧妙设计,更取决于分配的计算资源。
Abstract
A diverse array of
reasoning strategies
has been proposed to elicit the capabilities of large language models. However, in this paper, we point out that traditional evaluations which focus solely on
performance metrics<
→