Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning Paper โข 2502.06060 โข Published 29 days ago โข 34