Windows AI助手免费进化，能操作电脑、登录网页、生成代码

Core Insights - Microsoft has officially updated Windows Copilot, making the AI assistant available for free to all users, enhancing the capabilities of Microsoft 365 Copilot with a new feature called "Researcher" that includes "Computer Use" capabilities [1] Group 1: New Features and Capabilities - The "Computer Use" capability allows for smarter research, deeper insights, and more comprehensive reports by securely accessing enterprise internal data that requires login authentication [1] - The Researcher agent can generate PowerPoint presentations, spreadsheets, or applications using code [1] - Users can enhance work reports by utilizing private meeting notes, documents, and chat records [1] Group 2: Technical Implementation - The Researcher capability is supported by a series of new tools that can be orchestrated by the Researcher agent, connecting to a sandbox environment that provides screenshots of each operation [2] - When an operation is required, a virtual machine running on Windows 365 is initiated, isolated from the internal network and user devices, ensuring security [4] - The virtual machine operates in a temporary sandbox environment, with all necessary components pre-installed, and user credentials are not stored or transmitted outside the sandbox [4] Group 3: Performance Testing - The Researcher with Computer Use was evaluated using GAIA and BrowseComp benchmark tests, showing a 44% performance improvement in complex multi-step browsing tasks compared to the current version [6] - In the GAIA test, the performance improved by 6%, demonstrating the model's ability to find, verify, and reason with real-world data [6] - Specific examples of tasks completed by the Researcher include piecing together information from various web pages to answer complex questions [6] Group 4: Competitive Context - Microsoft has not disclosed the original scores of the tests, making it difficult to assess the absolute performance improvements [7] - The performance metrics can be indirectly compared to OpenAI's DeepResearch results, with recent data from Qwen providing a reference point [7]