Workflow
OpenAI发布ChatGPT智能体,却被批“鸡肋”
21世纪经济报道·2025-07-18 12:21

Core Viewpoint - The article discusses the recent launch of ChatGPT Agent by OpenAI, highlighting its potential as a versatile assistant while also addressing the mixed reactions from users regarding its practicality and performance [1][8]. Group 1: Product Features and Capabilities - ChatGPT Agent integrates the visual interaction capabilities of Operator with the information synthesis abilities of DeepResearch, aiming to handle complex tasks across various scenarios [3][4]. - The agent can automate office tasks, analyze calendars, generate meeting briefs, conduct competitive analysis, and create editable presentations [4]. - It can also assist in personal tasks, such as planning weekly menus and facilitating online shopping [5]. - In performance tests, ChatGPT Agent achieved a pass@1 score of 41.6% in the HLE test and an overall accuracy of 45.54% in the SpreadsheetBench test, outperforming Microsoft's Copilot in Excel [6][7]. Group 2: User Reception and Criticism - Despite impressive performance metrics, user experiences have been disappointing, with some reporting that tasks took significantly longer than expected, leading to efficiency concerns [9][11]. - The PPT generation feature received negative feedback for its aesthetic quality, and there are significant security concerns regarding the agent's access to sensitive data [12]. - The functionality is currently limited to Pro, Plus, and Team users, with a low usage quota that does not align with its "all-in-one assistant" branding [12]. Group 3: Industry Trends and Future Outlook - The article suggests a shift in the AI competition from "brute force" technological advancements to a focus on refining existing products and addressing user needs [13]. - OpenAI's approach appears to be a large-scale public testing rather than a full commercial rollout, indicating a search for a sustainable business model amid high operational costs [13]. - The company acknowledges the risks associated with the model's capabilities, particularly in sensitive areas, and has implemented safety measures to mitigate these risks [13].