Workflow
一手体验:首款通用Agent产品Manus,效果如何?
虎嗅APP·2025-03-06 10:23

Core Viewpoint - The article discusses the launch of Manus, the first general-purpose AI agent product, which is perceived as a significant advancement in AI capabilities, surpassing existing models like OpenAI's DeepResearch and Claude's Computer Use [2][5][8]. Group 1: Manus Overview - Manus is described as a groundbreaking project that combines the best features of existing AI models and can perform complex tasks such as coding and task planning [5][6]. - It has achieved a high score in the GAIA (General AI Assistants) benchmark, surpassing OpenAI's DeepResearch [8][10]. Group 2: GAIA Benchmark - GAIA is a benchmark testing system for general AI assistants, introduced in 2023 by Meta AI and Hugging Face, consisting of 466 carefully designed questions [10][11]. - The benchmark assesses various capabilities, including web search, tool usage, programming, and document processing, with a success rate of 90% for humans and only 15% for GPT-4 at the first level [14][13]. Group 3: Manus Capabilities - Manus can decompose complex tasks into manageable steps and execute them autonomously in the cloud, providing users with real-time updates on progress [22][24][36]. - An example task involved converting a PDF paper into a PowerPoint presentation, where Manus successfully extracted information, summarized it, and formatted it according to specific requirements [25][40]. Group 4: User Experience - The user interface of Manus is designed for intuitive interaction, allowing users to see the progress of tasks in real-time, enhancing the overall experience [37][38]. - Users have reported high satisfaction with the output quality, noting that Manus can produce well-structured and visually appealing documents [41][60]. Group 5: Competitive Landscape - Manus is positioned as a strong competitor in the AI space, with its capabilities leading to comparisons with existing models like OpenAI's DeepResearch, which, while high quality, lacks the same level of readability and interactivity [56][57]. - The article emphasizes the rapid advancements in AI technology, suggesting that Manus represents a new height in agent engineering [69][70].