Workflow
代码能力
icon
Search documents
DeepSeek小版本大升级,新R1模型代码能力媲美OpenAI o3
Di Yi Cai Jing· 2025-05-29 03:04
Core Insights - DeepSeek has released a minor version upgrade of its R1 model, named DeepSeek-R1-0528, which has shown significant improvements in coding capabilities, nearly matching the performance of OpenAI's o3-high model [1][5] - Developers have noted enhancements in writing tasks, with outputs appearing more natural and better formatted compared to previous versions [7] - The model's performance in context recall has improved for contexts up to 32K, although there is a decline in performance for 60K contexts [7][8] Performance Metrics - In the Live CodeBench testing platform, DeepSeek-R1-0528 achieved a Pass@1 score of 73.1, ranking fourth among various models [3] - The top three models in the same test were 04-Mini (High) with 80.2, 03 (High) with 75.8, and 04-Mini (Medium) with 74.2 [3] Developer Feedback - Developers have expressed that the upgrade represents a significant victory for open-source initiatives [4] - Some developers have conducted personal tests comparing DeepSeek-R1 with Claude-4, finding R1 superior in certain aspects, such as the visual effects of a simulated collision [5] - There is anticipation for the next major version, R2, with hopes for improvements in context length and multimodal capabilities [8]