UiPath Screen Agent Powered by Claude Opus 4.5 Receives Top Ranking on OSWorld-Verified Benchmark for Agentic Automation
Agent and agentic benchmarks validate AI’s effectiveness in real use cases and task environments, giving enterprises the confidence to deploy AI across multiple workflows. The OSWorld benchmark provides a unified, integrated computer environment for assessing open-ended computer tasks that involve arbitrary applications. It uses a first-of-its-kind scalable, real computer environment for multimodal agents, providing validation across 369 computer tasks involving web and desktop apps in open domains, OS file I/O, and workflows spanning multiple operating systems and applications.
A core technology for UiPath ScreenPlay, UiPath Screen Agent uses common large language models (LLMs) that allow for the use of natural language to simply and easily create user interfaces (UI) to automate and execute end-to-end complex tasks. The ranking of UiPath Screen Agent powered by Claude Opus 4.5 validates its effectiveness, weighing its performance against both general-purpose and specialized computer-using models, as well as other agentic frameworks evaluated in the benchmark.
“Having had an early look at UiPath ScreenPlay, we’re excited about its potential to meaningfully improve how we scale automation. Its adaptive intelligence could support our growing partner ecosystem while helping reduce ongoing maintenance so our teams can stay focused on growth,” said
This milestone builds on UiPath’s continued progress in advancing UI automation with agentic AI, following the
“Organizations need the confidence that their large-scale commitments to AI will pay off, which is where benchmarks can be incredibly helpful in validating specific use cases and critical workflows,” said
For more on UiPath Screen Agent, UiPath ScreenPlay the UiPath Platform, click here.
About
View source version on businesswire.com: https://www.businesswire.com/news/home/20260114950785/en/
UiPath Media Contact
pr@uipath.com
UiPath Investor Relations
investor.relations@uipath.com
Source: