吊打谷歌!DeepSeek开源首个“奥数金牌”AI
Seek .Seek .(US:SKLTY) Ge Long Hui·2025-11-28 07:09

Core Insights - DeepSeek has launched a new model, DeepSeekMath-V2, which is the first open-source model to reach the International Mathematical Olympiad (IMO) gold medal level [2][4] - The model has shown superior performance in various benchmarks, outperforming Google's Gemini DeepThink series in some areas [2][4] Performance Metrics - In the Basic benchmark, DeepSeekMath-V2 scored nearly 99%, significantly higher than Gemini DeepThink's 89% [4] - In the Advanced subset, Math-V2 scored 61.9%, slightly lower than Gemini DeepThink's 65.7%, indicating competitive performance [4] - The model achieved gold medal level in IMO 2025 by solving 5 out of 6 problems, and also reached gold level in CMO 2024 and scored 118 in Putnam 2024, close to the maximum score of 120 [4][7] Technological Advancements - DeepSeekMath-V2 introduces a self-verifying mathematical reasoning approach, marking a significant milestone in AI mathematical reasoning [10] - The model features a new training mechanism that includes: 1. A reliable verifier that checks each step of theorem proofs for logical consistency [10] 2. A generator that learns to self-improve by identifying and correcting issues during the proof generation process [11] 3. An evolving verification capability that adapts as the generator improves, focusing on difficult-to-verify proofs for further training [11] Industry Impact - The release of DeepSeekMath-V2 is seen as a strategic move in a competitive landscape, coinciding with releases from other major players like OpenAI and Google [10] - The open-source nature of the model under the Apache 2.0 license allows global developers to explore and fine-tune the gold medal-level model, breaking the monopoly of closed-source models in top-tier mathematical reasoning [10]