As AI systems become increasingly powerful, the need for safe AI has become more pressing. Humans are an attractive model for AI Safety: as the only known agents capable of general intelligence, they perform robustly even under conditions that deviate significantly from prior experienc