Oct, 2023
AI对齐:一项综合调查
AI Alignment: A Comprehensive Survey
TL;DRAI alignment aims to build AI systems in accordance with human intentions and values, addressing the risks of misaligned systems with superhuman capabilities through forward and backward alignment methodologies.