Benchmarking LLM social skills with an elimination game
Published on: 2025-07-18 03:54:41
Elimination Game Benchmark: Social Reasoning, Strategy, and Deception in Multi-Agent LLM Dynamics
The Elimination Game is a multi-player tournament that tests LLMs in social reasoning, strategy, and deception. Players engage in public and private conversations, form alliances, and vote to eliminate each other round by round until only two remain. A jury of eliminated players then casts deciding votes to crown the winner. This benchmark goes beyond simple dialogues by creating a rich environment where models must navigate:
Public vs. Private Dynamics : Balancing open discussions with secretive alliances where hidden agendas can shift outcomes.
: Balancing open discussions with secretive alliances where hidden agendas can shift outcomes. Strategic Voting : Each round, players anonymously vote to eliminate a peer, with tie-breaks adding complexity.
: Each round, players anonymously vote to eliminate a peer, with tie-breaks adding complexity. Jury Persuasion: Finalists must convince th
... Read full article.