This paper asks under what conditions do social corrections promote engagement from corrected users, allowing for greater insight into how users respond to debunking messages (even if such responses are negative).
This paper reports results from a three-round Delphi study conducted late 2025 with 272 international AI experts who rated 24 AI risks on harm probability and severity, sector and actor vulnerability, actor responsibility, and overall concern.
Sinan Aral |
Director, MIT IDE; Professor, MIT Sloan School of Management
Harang Ju |
Assistant Professor, Johns Hopkins University
Michelle Vaccaro
Michael Caosun
Jared R. Curhan
This research reveals that principles from human negotiation theory remain crucial even in AI–AI contexts and that AI-specific technical strategies also create success. The results suggest the need to establish a new theory of AI negotiation that integrates classic negotiation theory with AI-specific negotiation theories to better understand autonomous negotiations and optimize agent performance.
Artificial intelligence (AI) is reshaping society, from video generation to medical diagnosis, coding agents to autonomous vehicles. Yet researchers, policymakers, and technology companies lack shared terminology for discussing AI risks. This paper addresses this challenge by creating a comprehensive catalog of AI risks.
These findings highlight the need to address LLMs’ shortcomings in handling exceptions in order to guide the development of agentic AI toward models that can effectively align with human judgment and simultaneously adapt to novel contexts.
Sinan Aral |
Director, MIT IDE; Professor, MIT Sloan School of Management
David Holtz |
Assistant Professor, Columbia Business School
P. Alex Dow | Microsoft Research
In this paper, the research team presents results from what they believe is the first ever field experiment to randomize the introduction of a reputation system into a massive online marketplace.
This paper studies prompt adaptation—how users adjust their inputs in response to evolving model behavior—using a common experimental design applied to two preregistered tasks with 3,750 total participants who submitted nearly 37,000 prompts.