全球前二、国内第一!钉钉AI重大技术突破 DeepResearch国际权威测评超越OpenAI、Claude
智通财经网·2025-11-12 04:09

Core Insights - Dingtalk-DeepResearch, developed by the DingTalk team, achieved a significant milestone by ranking second globally and first domestically in the DeepResearch Bench test with a score of 48.49, surpassing major systems like OpenAI and Claude [1][6]. Group 1: Technological Advancements - The Dingtalk-DeepResearch system represents a dual breakthrough in achieving international benchmarks and practical application, marking a significant advancement for Chinese enterprise-level AI technology [6]. - The system integrates a multi-agent deep research framework designed for real enterprise scenarios, effectively combining deep research generation, heterogeneous table parsing, and multi-modal report generation [7]. - It features a three-layer architecture that supports parallel processing and multi-stage reasoning for complex tasks, such as automatically parsing and converting intricate factory production tables into insightful analysis reports [7]. Group 2: Continuous Learning and Adaptation - Dingtalk-DeepResearch employs an online learning mechanism that allows the AI to evolve continuously, adapting to dynamic enterprise scenarios without manual intervention [8]. - The system can autonomously learn and remember user preferences regarding report formats and styles, enhancing its output to align with user needs over time [8]. Group 3: Quality Assurance and Optimization - The Dingtalk-DeepResearch includes the DingAutoEvaluator system, which conducts multi-dimensional quality checks on generated reports, ensuring data accuracy, logical coherence, and compliance with tool usage standards [9]. - Any identified issues are fed back into the training process to optimize the model, creating a continuous improvement loop from generation to evaluation and optimization [9]. Group 4: Practical Applications - The system has been successfully applied in various real business scenarios, particularly in supply chain and manufacturing, providing intelligent analysis and decision support [10]. - Dingtalk-DeepResearch can quickly analyze complex cross-departmental table data in supply chains and convert raw operational data in manufacturing into visual analysis reports for predictive maintenance [10]. - The CTO of DingTalk emphasized that the system combines adaptive optimization and multi-modal reasoning to address complex and evolving business tasks, making advanced AI technology more relevant to actual production needs [10].