Core Insights - The article discusses a new paradigm in AI agents that can autonomously create tools to fulfill tasks without human intervention, showcasing significant advancements in self-evolving capabilities [1][2][21]. Group 1: Agent Capabilities - The new agent, powered by Gemini 3 Pro, demonstrated superior performance in the Humanity's Last Exam (HLE), achieving scores nearly 20 points higher than other agents using disclosed methods [2][12]. - This agent can generate tools on-the-fly, creating 128 unique tools during its evaluation across various benchmarks, indicating a self-sufficient evolution process [12][13]. - The agent's performance improved significantly with the use of previously created tools, demonstrating a clear trend of diminishing returns after reaching a stable number of tools [13][15]. Group 2: Evolution Framework - The research introduces a novel framework called In-situ Self-evolving Agent, which allows the agent to evolve during the inference phase without external supervision, relying on internal feedback and past experiences [21][27]. - This approach contrasts with traditional self-evolving methods that depend heavily on pre-defined training and expert supervision, making it more adaptable and efficient in real-world applications [22][24]. Group 3: Tool Utilization - The agent prioritizes tool creation as a means of evolution, which directly influences its capabilities and performance, allowing it to handle a wide range of tasks effectively [36][40]. - The framework emphasizes the importance of tools in determining the agent's operational boundaries, ensuring high-quality feedback through code execution [37][38]. Group 4: Research and Development - The research was conducted by a team from Yunjue Technology, founded by former Alibaba executive Peng Chao, with a focus on wearable general intelligence [53][56]. - The project was completed with a modest budget of 150,000 yuan, highlighting the potential for impactful research with limited resources [60][61]. Group 5: Open Source and Industry Impact - The self-evolving framework is open-source, allowing for community engagement and further development, which could lead to significant advancements in AI capabilities [49][75]. - The article suggests that the integration of this self-evolving agent could address the challenges of cost, safety, and adaptability in AI applications, particularly in consumer-facing scenarios [62][71].
Skills刚火,就有零Skill的Agent来了…
3 6 Ke·2026-01-26 11:40