This paper investigates biases of Large Language Models (LLMs) through the lens of grammatical gender. Drawing inspiration from seminal works in psycholinguistics, particularly the study of gender's influence on language perception, we leverage multilingual LLMs to revisit and expand upon the foundational experiments of Boroditsky (2003). Employing LLMs as a novel method for examining psycholinguistic biases related to grammatical gender, we prompt a model to describe nouns with adjectives in various languages, focusing specifically on languages with grammatical gender. In particular, we look at adjective co-occurrences across gender and languages, and train a binary classifier to predict grammatical gender given adjectives an LLM uses to describe a noun. Surprisingly, we find that a simple classifier can not only predict noun gender above chance but also exhibit cross-language transferability. We show that while LLMs may describe words differently in different languages, they are biased similarly.

通过大型语言模型（LLMs）从语法性别的角度研究偏见，利用多语种LLMs重新审视和扩展Boroditsky（2003）的基础实验，发现简单的分类器不仅可以预测名词性别，还可以具有跨语言迁移能力，表明LLMs在不同语言中存在相似的偏见。

一个优雅的桥梁：多语言LLM在不同语言中的偏见相似