Agent 热潮年度回望：一切火爆早有预兆

Core Insights - The article discusses the rapid acceleration of AI agents since the beginning of 2026, highlighting key variables that have driven this concentrated explosion in the field [1] - It draws parallels between the current excitement around AI agents and the early discussions surrounding the internet in 1999, emphasizing a shift in organizational structures and the role of humans [2] - The narrative indicates a transition from excitement to a more grounded understanding of the practical challenges and engineering details involved in deploying AI agents in real-world environments [4][5] Group 1: Development and Challenges of AI Agents - The past year has been termed the "Year of the Agent," marking a paradigm shift where models are not just for conversation but can actively perform tasks, plan, and even write code [4] - Despite initial excitement, real-world applications reveal challenges such as model drift, unclear permission boundaries, and unpredictable costs, making them unsuitable for serious workflows [4] - The complexity of integrating agents into existing systems is highlighted, as they face diverse toolsets and commercial boundaries, complicating the establishment of standardized protocols [6][7] Group 2: Protocol and Architecture - The first systematic attempts in the agent direction stem from protocols like MCP and A2A, aiming to create unified interfaces for model integration and cross-platform collaboration [6][7] - The article emphasizes the importance of establishing a layered architecture for agents, where a cognitive core handles understanding and planning, while execution capabilities are clearly defined and controlled [9][10] - The shift from creating specialized agents for each scenario to a more modular approach allows for reusable execution capabilities, enhancing efficiency and governance [10][11] Group 3: Skills and Density - The concept of "skills" has evolved from simple plugins to a more structured framework where skills are defined as callable, constrained, and auditable actions within a system [11][17] - The article posits that the density of skills—how many high-quality skills are available—will determine the effectiveness of AI agents, as a higher density allows for more complex problem-solving capabilities [19][20] - The comparison to the mobile internet era suggests that the true value lies not in the number of skills but in their interconnectivity and ability to be reused across different models and systems [20] Group 4: Memory and Continuity - The introduction of memory is seen as a crucial advancement, allowing agents to maintain context and continuity across tasks, which is essential for long-term collaboration [22][25] - The article distinguishes between different types of memory, emphasizing the need for persistent memory that encompasses task status, long-term context, and decision history [23][24] - This capability transforms agents from being one-time tools to systems that can accumulate organizational knowledge and provide ongoing value [25] Group 5: The Role of Open Source Models - The rise of open-source large models in China is highlighted as a significant factor in changing the power dynamics within the AI landscape, enabling developers to integrate these models into real workflows [26][29] - The article notes that local deployment of models allows for greater control and customization, particularly in sensitive industries like healthcare and finance [29][30] - Open-source models lower barriers to experimentation and innovation, facilitating the development of vertical agents tailored to specific industry needs [30]