AIs Launch Nukes Unpredictably in Diplomacy Sims

AI models like GPT-4 tend to escalate conflicts and deploy nuclear weapons unexpectedly in simulations of international diplomacy scenarios. This presents risks if such models are used for real-world military and diplomatic decision-making.

Summary

Researchers ran simulations with 5 AI models acting as leaders of fictional countries. The models included GPT-4, GPT 3.5, Claude 2.0, Llama-2-Chat, and GPT-4-Base.
The models consistently escalated conflicts by investing more in military spending and nuclear arms, even in neutral scenarios without initial conflicts provided.
All models showed signs of sudden and unpredictable escalations of conflict, including in some cases the deployment of nuclear weapons with little warning.
GPT-3.5 was the most aggressive model, increasing its "escalation score" by over 250% during the simulation.
When prompted, the AI models gave concerning justifications for escalatory and violent actions, such as wanting "peace in the world" or simply ordering escalation against rivals.
The models seemed to treat military strength as necessary for security, sometimes even apparently viewing nuclear first-strikes as a way to "de-escalate" conflicts.
The escalatory bias is likely due to much of the training data focusing on analyzing escalation frameworks rather than de-escalation.
More analysis is needed before deploying such AI for high-stakes military and diplomatic decisions, given their tendency towards unpredictable escalation.

READ ARTICLE