Workflow
DeepSeek新大招曝光:下一步智能体
量子位·2025-09-05 01:49

Core Viewpoint - DeepSeek is reportedly developing a new model with enhanced AI Agent capabilities, expected to launch by the end of this year [3][8]. Group 1: Model Development - DeepSeek's recent update in August introduced DeepSeek-V3.1, which features improved Agent capabilities through Post-Training optimization, enhancing performance in tool usage and agent tasks [5][11]. - The upcoming model is designed to execute complex operations with minimal prompts and can self-evolve based on historical actions [7][8]. - The transition from DeepSeek V3 to V3.1 over nine months indicates a focus on incremental improvements rather than major version changes [9][10]. Group 2: Performance Metrics - DeepSeek-V3.1 shows significant performance improvements in various benchmarks compared to its predecessors: - SWE-bench: 66.0 (V3.1) vs. 45.4 (V3) and 44.6 (R1) - SWE-bench Multilingual: 54.5 (V3.1) vs. 29.3 (V3) and 30.5 (R1) - Terminal-Bench: 31.3 (V3.1) vs. 13.3 (V3) and 5.7 (R1) [12]. - In search agent evaluations, V3.1 also demonstrated comprehensive performance enhancements over R1 [12]. Group 3: Future Outlook - The introduction of DeepSeek R1 has significantly influenced the global large model industry, marking a pivotal moment in its development [15]. - The concept of AI agents is gaining traction, with predictions that by mid-2025, nearly all large model products will incorporate agent functionalities [16][18]. - There is speculation about the potential reduction in price barriers for AI agents if DeepSeek leads this initiative [19].