Oct, 2023

AI对齐:一项综合调查

TL;DRAI alignment aims to build AI systems in accordance with human intentions and values, addressing the risks of misaligned systems with superhuman capabilities through forward and backward alignment methodologies.