UiPath Screen Agent
Search documents
UiPath Screen Agent Powered by Claude Opus 4.5 Receives Top Ranking on OSWorld-Verified Benchmark for Agentic Automation
Businesswire· 2026-01-14 21:32
Core Insights - UiPath Screen Agent powered by Claude Opus 4.5 achieved the No. 1 ranking on the OSWorld-Verified benchmark, validating its effectiveness for enterprise-wide agentic AI deployments [1][3] Group 1: Benchmarking and Validation - The OSWorld benchmark assesses the effectiveness of AI in real use cases and task environments, providing a unified environment for evaluating multimodal agents across 369 computer tasks [2] - The benchmark uses a scalable, real computer environment to validate agentic AI across various applications, including web and desktop apps [2] Group 2: Technology and Applications - UiPath Screen Agent utilizes large language models (LLMs) to create user interfaces for automating complex tasks, demonstrating its effectiveness against both general-purpose and specialized models [3] - The ranking of UiPath Screen Agent reflects its performance in comparison to other agentic frameworks evaluated in the benchmark [3] Group 3: Industry Impact and Future Outlook - The achievement of the No. 1 ranking builds on UiPath's progress in UI automation with agentic AI, following a previous ranking of No. 2 for UiPath Screen Agent powered by OpenAI GPT-5 [5] - The company emphasizes the importance of benchmarks in providing confidence for organizations making large-scale AI commitments, highlighting its ongoing investment in enterprise-grade capabilities [6]