Core Viewpoint - The update of Deepseek R1 enhances its deep thinking capabilities, positioning it alongside top international models like OpenAI-o3 and Gemini-2.5-Pro-0506, which is expected to accelerate the growth of domestic computing power demand and the implementation of edge models [1][2]. Summary by Sections Performance Improvement - Deepseek R1-0528 has achieved performance iteration through improved training methods, showing significant enhancements in deep thinking capabilities across various benchmarks, closely matching the performance of leading international models [3]. - The distilled model, Deepseek-R1-0528-Qwen3-8B, demonstrates strong performance in mathematical testing, ranking just below Deepseek-R1-0528 and comparable to Qwen3-235B [3]. - The updated model has reduced hallucination rates by approximately 45-50% in tasks such as rewriting and reading comprehension, while also optimizing for different writing styles, enabling the generation of more structured long-form content [3]. Commercialization Potential - The performance improvements in deep thinking and writing, along with reduced hallucination rates, are expected to enhance user experience, potentially increasing user penetration and daily usage frequency, thereby driving growth in the domestic computing power industry [4]. - The outstanding performance of the distilled training model is anticipated to accelerate the deployment of large models on edge devices such as smartphones, PCs, and smart glasses, improving the intelligence level of these devices and enabling AI empowerment [4]. Catalyst - The iterative upgrade of Deepseek's large model performance serves as a catalyst for further advancements in the field [5].
国泰海通|电子:Deepseek R1更新,商业场景拓展加速
国泰海通证券研究·2025-06-02 12:31