Workflow
OpenAI的Agent来了,被批“鸡肋”升级?
2 1 Shi Ji Jing Ji Bao Dao·2025-07-18 11:26

Core Viewpoint - The AI Agent competition is intensifying, but there remains a gap between capability and practicality, as demonstrated by OpenAI's recent launch of ChatGPT Agent, which aims to serve as a comprehensive assistant for complex tasks [1][5]. Group 1: Product Features and Performance - ChatGPT Agent integrates the visual interaction capabilities of Operator with the information synthesis abilities of DeepResearch, allowing it to manage visual browsers, text browsers, and code terminals simultaneously [2]. - The Agent can perform complex task chains, such as automating office tasks, generating meeting briefs, conducting competitive analysis, planning weekly menus, and creating detailed research reports [3]. - In performance tests, ChatGPT Agent achieved a pass@1 score of 41.6% in the HLE test and an overall accuracy of 45.54% in the SpreadsheetBench test, outperforming Microsoft's Copilot in Excel [3]. Group 2: User Experience and Feedback - Despite impressive performance metrics, user experiences have been mixed, with some reporting that the Agent's task completion rate is around 50%, and efficiency issues have been noted, such as a task taking significantly longer than manual completion [4]. - The PPT generation feature has received criticism for its aesthetic quality, being deemed inferior to other general-purpose agents [4]. - Concerns have been raised regarding the security of connecting the Agent to private data sources like Google Drive and Gmail, with potential risks highlighted if errors occur in sensitive transactions [4]. Group 3: Market Position and Future Outlook - The release of ChatGPT Agent appears to be more of a routine upgrade rather than a groundbreaking innovation, reflecting a shift in focus from dramatic technological breakthroughs to refining existing product shortcomings [5]. - The AI competition is entering a new phase where the emphasis is on practical usability and user willingness to pay for services, rather than just performance metrics [5]. - OpenAI is exploring sustainable business models amid high operational costs and the need for reliable server performance, indicating that the true potential of AI Agents will only be realized once user trust and functionality are established [6].