Anthropic studied what gives an AI system its ‘personality’ — and what makes it ‘evil’
is The Verge’s senior AI reporter. An AI beat reporter for more than five years, her work has also appeared in CNBC, MIT Technology Review, Wired UK, and other outlets. On Friday, Anthropic debuted research unpacking how an AI system’s “personality” — as in, tone, responses, and overarching motivation — changes and why. Researchers also tracked what makes a model “evil.” The Verge spoke with Jack Lindsey, an Anthropic researcher working on interpretability, who has also been tapped to lead the