BriefGPT.xyz
Feb, 2024
多元一致性路线图
A Roadmap to Pluralistic Alignment
HTML
PDF
Taylor Sorensen, Jared Moore, Jillian Fisher, Mitchell Gordon, Niloofar Mireshghallah...
TL;DR
AI系统的多元对齐是一个重要问题,本文提出了一个在语言模型中测试多元对齐的路线图,并通过多个实验和其他工作的经验证明了当前的对齐技术在多元对齐方面存在局限性,并强调了对多元对齐的进一步研究的需求。
Abstract
With increased power and prevalence of
ai systems
, it is ever more critical that
ai systems
are designed to serve all, i.e., people with diverse values and perspectives. However, aligning models to serve pluralis
→