Microsoft

Microsoft Fara1.5 Agents Beat OpenAI, Gemini on Web Tasks

Microsoft's new Fara1.5 family of browser computer-use agents (4B/9B/27B) outperforms OpenAI Operator and Gemini 2.5 Computer Use on the Online-Mind2Web benchmark, marking a major step for open-weight web agents.

Microsoft has released Fara1.5, a new family of browser-native computer-use agents available in 4B, 9B, and 27B parameter sizes. According to benchmark results published alongside the release, the Fara1.5 models outperform OpenAI's Operator and Google's Gemini 2.5 Computer Use on the Online-Mind2Web benchmark — a key evaluation suite for agents that interact with live websites the way a human user would.

The release signals Microsoft's intent to compete aggressively in the rapidly expanding category of agentic AI systems, where models don't merely answer questions but actively navigate, click, type, and complete tasks across real web interfaces.

What Fara1.5 Does Differently

Unlike conversational LLMs, browser computer-use agents must perceive a webpage, plan multi-step actions, and execute them reliably across the unpredictable terrain of modern web apps. Fara1.5 is purpose-built for this loop: visual understanding of rendered pages, grounded action generation (clicks, scrolls, form fills), and recovery from errors when pages don't behave as expected.

By shipping three model sizes — 4B, 9B, and 27B — Microsoft is targeting different deployment profiles. The 4B variant is small enough for edge or latency-sensitive inference, while the 27B model is positioned as the flagship capable of complex, multi-step browsing tasks. This tiering mirrors the strategy Meta and Mistral have used with open-weight LLMs, but applied to the agentic domain where most competitors (OpenAI Operator, Anthropic's Claude Computer Use, Gemini 2.5) remain closed APIs.

Online-Mind2Web Benchmark Performance

Online-Mind2Web is a live-web evaluation that tests agents against real websites with dynamic content, anti-bot measures, and authentic UI complexity. Outperforming OpenAI Operator and Gemini 2.5 Computer Use on this benchmark is significant because both competitors are backed by frontier-scale infrastructure and proprietary training pipelines.

If Microsoft's numbers hold up to independent replication, Fara1.5 represents a meaningful efficiency win: smaller models matching or beating much larger closed systems on grounded web tasks. This pattern — specialization beating scale — has been recurring across the agentic AI space throughout 2025 and 2026.

Why It Matters for Synthetic Media and Authenticity

Browser agents that can autonomously navigate the web have direct implications for digital authenticity. The same capabilities that let an agent book a flight or fill out a research form can be repurposed to mass-produce synthetic engagement, scrape and remix media, or generate convincing fake activity at scale. As agentic systems become more capable, content provenance, watermarking, and bot-detection systems will face mounting pressure.

On the constructive side, capable agents can automate the verification of media authenticity — cross-referencing claims against source databases, retrieving original assets, and assembling evidence chains. Microsoft has been investing heavily in content credentials (C2PA) and provenance tooling, and an in-house agent stack could integrate with those authenticity pipelines.

Strategic Context

Microsoft's release lands amid an arms race in computer-use agents. OpenAI's Operator, Anthropic's Claude with computer use, Google's Gemini 2.5 Computer Use, and a wave of open-source projects have all pushed the state of the art forward in the past year. By releasing multiple model sizes — and by reportedly outperforming the closed competition on a respected benchmark — Microsoft is positioning itself as a serious player not just in foundation models but in the application layer where agents will run.

For enterprises, Fara1.5 offers something the closed API competitors largely don't: the prospect of running browser agents on controlled infrastructure, behind a firewall, with auditable behavior. That is increasingly important for regulated industries that want to deploy agentic automation without sending sensitive workflow data through third-party APIs.

What to Watch Next

Key questions remain: How does Fara1.5 perform on longer-horizon tasks beyond Online-Mind2Web? What licensing terms accompany the model weights? And how will it integrate with Microsoft's broader Copilot and Azure AI agent stack? Independent reproductions of the benchmark numbers and red-team evaluations of misuse potential will be critical in the coming weeks.

For now, Fara1.5 marks another step in the shift from chat-based AI to action-based AI — and Microsoft has staked a competitive claim in that future.

View Source

Stay informed on AI video and digital authenticity. Follow Skrew AI News.