开源模型升级
Search documents
DeepSeek上新,又一次“开源的巨大胜利”
第一财经· 2025-05-29 04:52
Core Viewpoint - The recent upgrade of the DeepSeek R1 model, specifically the release of DeepSeek-R1-0528, has significantly improved its coding capabilities, making it competitive with OpenAI's o3-high model [1]. Group 1: Model Performance - The DeepSeek-R1-0528 model has shown remarkable improvements in code execution and generation, achieving performance levels comparable to leading models in the Live CodeBench testing platform [1]. - In the current leaderboard, DeepSeek-R1-0528 ranks fourth with a Pass@1 score of 73.1, indicating its strong performance in coding tasks [4]. Group 2: Developer Feedback - Developers have expressed that the upgrade represents a significant victory for open-source initiatives, highlighting the model's enhanced writing capabilities and more natural output [6]. - Testing by developers has shown that the new model performs better in specific tasks, such as text recall within a 32K context, although performance declines in a 60K context [6]. Group 3: Future Expectations - There is anticipation for the next version, R2, with developers hoping for improvements in context length and multimodal capabilities, which are crucial for practical applications [7]. - The industry speculates that DeepSeek's approach to versioning may differ from competitors, focusing on training data adjustments rather than structural updates [7].