ABC Align：大型语言模型的安全性与准确性对齐

Aug, 2024

ABC Align：大型语言模型的安全性与准确性对齐

ABC Align: Large Language Model Alignment for Safety & Accuracy

Gareth Seneque, Lap-Hang Ho, Ariel Kuperman, Nafise Erfanian Saeedi, Jeffrey Molendijk

TL;DR本研究解决了大型语言模型对齐问题的缺失，提出了一种新颖的方法——ABC Align，通过整合大型媒体组织的标准和偏好，实现对模型的优化。研究发现，该方法有效降低了偏见，提高了准确性，同时保持了推理能力，具有重要的应用潜力。

Abstract

Alignment of Large Language Models (LLMs) remains an unsolved problem. Human preferences are highly distributed and can be captured at multiple levels of abstraction, from the individual to diverse populations. O