Workflow
百度智能云千帆
icon
Search documents
Agent开发中的坑与解_殷杰 百度智能云高级产品经理
Sou Hu Cai Jing· 2025-10-14 03:57
Core Insights - The report discusses the challenges and solutions in the development of Agents, highlighting the contrast between ideal expectations and real-world difficulties [1][2]. Pre-Launch Phase - Common pitfalls include unclear goals, neglecting data tools, lack of valuable business scenarios, and insufficient ROI evaluation [9][10]. - Solutions involve focusing on small, pain-point-driven topics, ensuring data accessibility and quality, clarifying customer needs, and setting quantifiable ROI metrics [9][10][11]. Development Phase - Issues faced during development include model selection difficulties, improper usage, cost overruns, vague prompts, chaotic knowledge management, and weak security measures [2][20]. - Strategies to address these include utilizing platforms like Baidu's Qianfan for model selection, designing clear prompts akin to PRD writing, optimizing knowledge management, and establishing a robust security framework [2][20][26]. Post-Launch Phase - Common problems after launch include lack of monitoring alerts, inadequate scaling and disaster recovery mechanisms, and insufficient user feedback systems [2][20]. - Recommendations include identifying resource dependencies, configuring redundant capacities, establishing comprehensive logging and monitoring systems, and enhancing user feedback mechanisms for continuous optimization [2][20]. Overall Development Approach - The development of Agents should adhere to a multi-faceted principle, balancing key elements to ensure high availability and continuous improvement, ultimately creating intelligent agents that meet user needs [1][2].
大模型专题:2025年大模型智能体开发平台技术能力测试研究报告
Sou Hu Cai Jing· 2025-08-14 15:48
Core Insights - The report evaluates the technical capabilities of four major AI model development platforms: Alibaba Cloud's Bailian, Tencent Cloud's Intelligent Agent Development Platform, Kouzi, and Baidu Intelligent Cloud Qianfan, focusing on RAG capabilities, workflow capabilities, and agent capabilities [1][7][8]. RAG Capability Testing - RAG capability testing assesses knowledge enhancement mechanisms, including multi-modal knowledge processing, task complexity adaptation, and interaction mechanism completeness [7][8]. - In text question answering, all platforms demonstrated high accuracy, with over 80% accuracy in multi-document responses, although some platforms showed stability issues during API calls [20][21]. - Baidu Intelligent Cloud Qianfan exhibited stable performance in complex query scenarios for structured data, while Tencent Cloud achieved 100% refusal for out-of-knowledge-base questions [21][23]. - The platforms showed differences in handling refusal and clarification, with Tencent Cloud providing 100% refusals for non-knowledge-base questions [21][22]. Workflow Capability Testing - Workflow capability testing focuses on dynamic parameter extraction, exception rollback, intent recognition, and fault tolerance [35][36]. - The end-to-end accuracy for workflow processes ranged from 61.5% to 93.3%, with Tencent Cloud leading in intent recognition accuracy at 100% [36][37]. - The platforms demonstrated basic usability in workflow systems, but there is room for improvement in complex information processing [38][39]. Agent Capability Testing - Agent capability testing evaluates the ability to call tools, focusing on intent understanding, operational coordination, feedback effectiveness, and mechanism completeness [44][45]. - All platforms achieved high single-tool call completion rates (83%-92%), but multi-tool collaboration and prompt calling showed potential for improvement [48][50]. - Tencent Cloud's Intelligent Agent Development Platform excelled in tool call success rates due to its robust ecosystem and process optimization [49][50].
记者实测|智能体按下“加速键” 大厂争当MCP“应用商店”
Bei Ke Cai Jing· 2025-04-30 08:40
Core Insights - The launch of Manus and the popularity of the Model Context Protocol (MCP) have accelerated the development of intelligent agents among major companies since April 2023 [1][24] - Various companies have introduced MCP services, enhancing the capabilities of their intelligent agents and breaking down software barriers, leading to improved efficiency and accuracy [3][24] Group 1: Company Developments - Alibaba Cloud launched the MCP service on April 9, 2023, followed by Ant Group, ByteDance, and Baidu introducing their respective MCP integrations throughout April [1] - By April 29, 2023, multiple domestic companies, including Yingmi Fund and Guangfa Securities, had begun offering services through Alibaba's MCP platform, covering areas such as fund advisory and stock analysis [3][19] - Baidu's integration of MCP into its products allows users to complete transactions directly through intelligent agents, marking a significant step in e-commerce capabilities [13][16] Group 2: Performance Testing - Initial tests of Alibaba's MCP service showed a limited range of services, but subsequent tests revealed a growing number of providers and functionalities [3][19] - The intelligent agent created by the reporter was able to recommend specific funds after integrating with Yingmi Fund's MCP service, showcasing the enhanced capabilities of MCP [5][4] - ByteDance's intelligent agent demonstrated significant improvements in task execution speed and accuracy after integrating MCP, completing complex tasks in a fraction of the time compared to previous methods [9][12] Group 3: Market Trends and Challenges - The integration of MCP services is transforming platforms into application stores for AI, with companies exploring new business models and user engagement strategies [23][24] - The varying number of MCP services across different platforms indicates a competitive landscape, with each company aiming to enhance their offerings [19][20] - Concerns regarding the security of MCP protocols have been raised, highlighting the need for robust measures to protect user data and ensure safe interactions between intelligent agents [29][30]