Core Viewpoint - The article discusses the competitive landscape of coding models, highlighting that DeepSeek's new version R1 has surpassed Claude Opus 4 in web programming capabilities, indicating a shift in the dominance of coding models in the AI space [1][2]. Group 1: Model Performance - DeepSeek-R1-0528 achieved a score of 73.4 in coding tasks, ranking fourth overall, while Claude Opus 4 scored 1418, ranking sixth [4][27]. - In specific categories, DeepSeek-R1 ranked fourth in difficult prompts and fifth in mathematics among open-source models, showcasing its competitive edge [27][28]. Group 2: User Experience - DeepSeek-R1 is noted for being more user-friendly for domestic users compared to Claude, as it is free and easily accessible [23][24]. - The model demonstrated significant improvements in coding capabilities, although it still has room for enhancement [23]. Group 3: Additional Achievements - DeepSeek-R1 was recognized as the best open-source text model under the MIT license, ranking sixth overall in the coding model arena [25][26]. - The article mentions a new model, Kimi-Dev, which has achieved a state-of-the-art score of 60.4% in open-source coding benchmarks, outperforming DeepSeek-R1 [29][30].
网页编程众测排名:DeepSeek-R1超越Claude 4加冕全球第一
量子位·2025-06-17 07:41