CoMPosT: LLM模拟中描绘和评估卡通画

Oct, 2023

CoMPosT: LLM模拟中描绘和评估卡通画

CoMPosT: Characterizing and Evaluating Caricature in LLM Simulations

Myra Cheng, Tiziano Piccardi, Diyi Yang

TL;DR使用CoMPosT框架对LLM模拟进行表征，评估其扁平化夸张的程度，并发现GPT-4模拟政治和边缘化群体以及一般、无争议话题时高度易于夸张。

Abstract

Recent work has aimed to capture nuances of human behavior by using llms to simulate responses from particular demographics in settings like social science experiments and public opinion surveys. However, there are currently no established ways to discuss or evaluate the quality of suc