LLM Agents Fail to Beat Classifiers at Predicting Reactions
A new benchmark of 120K+ AI personas simulating 1,511 real humans shows LLM agents can predict social media reactions, but fail to outperform simpler text classifiers—raising questions about persona-based simulation value.