Microsoft Research
Microsoft's Webwright Agent Hits 60.1% on Odysseys Bench
Microsoft Research unveils Webwright, a terminal-native web agent framework that nearly doubles GPT-5.4's baseline score on the Odysseys benchmark, jumping from 33.5% to 60.1%.