BriefGPT.xyz
Jul, 2024
CLAVE: 一种适应性框架用于评估LLM生成的回复的价值
CLAVE: An Adaptive Framework for Evaluating Values of LLM Generated Responses
HTML
PDF
Jing Yao, Xiaoyuan Yi, Xing Xie
TL;DR
通过CLAVE框架和ValEval数据集,我们研究了大型语言模型的价值评估,发现结合微调模型和基于提示的大型模型可以在价值评估中取得更好的平衡。
Abstract
The rapid progress in
large language models
(LLMs) poses potential risks such as generating unethical content. Assessing LLMs'
values
can help expose their misalignment, but relies on reference-free
→