Seek .(SKLTY)
Search documents
吊打谷歌!DeepSeek开源首个“奥数金牌”AI
Ge Long Hui· 2025-11-28 07:09
Core Insights - DeepSeek has launched a new model, DeepSeekMath-V2, which is the first open-source model to reach the International Mathematical Olympiad (IMO) gold medal level [2][4] - The model has shown superior performance in various benchmarks, outperforming Google's Gemini DeepThink series in some areas [2][4] Performance Metrics - In the Basic benchmark, DeepSeekMath-V2 scored nearly 99%, significantly higher than Gemini DeepThink's 89% [4] - In the Advanced subset, Math-V2 scored 61.9%, slightly lower than Gemini DeepThink's 65.7%, indicating competitive performance [4] - The model achieved gold medal level in IMO 2025 by solving 5 out of 6 problems, and also reached gold level in CMO 2024 and scored 118 in Putnam 2024, close to the maximum score of 120 [4][7] Technological Advancements - DeepSeekMath-V2 introduces a self-verifying mathematical reasoning approach, marking a significant milestone in AI mathematical reasoning [10] - The model features a new training mechanism that includes: 1. A reliable verifier that checks each step of theorem proofs for logical consistency [10] 2. A generator that learns to self-improve by identifying and correcting issues during the proof generation process [11] 3. An evolving verification capability that adapts as the generator improves, focusing on difficult-to-verify proofs for further training [11] Industry Impact - The release of DeepSeekMath-V2 is seen as a strategic move in a competitive landscape, coinciding with releases from other major players like OpenAI and Google [10] - The open-source nature of the model under the Apache 2.0 license allows global developers to explore and fine-tune the gold medal-level model, breaking the monopoly of closed-source models in top-tier mathematical reasoning [10]
不只是“做题家”!DeepSeek最新模型打破数学推理局限,部分性能超越Gemini DeepThink
Tai Mei Ti A P P· 2025-11-28 05:45
Core Insights - DeepSeek has released its latest mathematical model, DeepSeek Math-V2, which has generated significant excitement in the AI community due to its self-verifying capabilities in deep reasoning, particularly in mathematics [1][2]. Model Performance - Math-V2 demonstrates strong theorem-proving abilities, distinguishing itself from previous models that merely solved problems without rigorous reasoning [2]. - The model achieved gold medal-level results in the IMO 2025 and CMO 2024 competitions, and scored 118 out of 120 in the Putnam 2024 competition, showcasing its superior performance [2]. Benchmarking Results - In the IMO-Proof Bench evaluation, Math-V2 scored 99%, outperforming Google's Gemini Deep Think (89%) and GPT-5 (59%) [3]. - In advanced testing, Math-V2 scored 61.9%, just behind Gemini Deep Think's 65.7% [3]. Community Impact - The release of Math-V2 has sparked discussions across social media platforms and communities, highlighting its potential to automate verification-heavy tasks in programming languages [5][8]. - Experts in the AI field have praised DeepSeek's return and the significance of Math-V2, indicating a shift from "chatbot" to "reasoner" era in AI development [8][9].
新突破!DeepSeek推出新模型,科创AIETF(588790)红盘震荡
Xin Lang Cai Jing· 2025-11-28 03:15
Group 1: Market Performance - The Shanghai Stock Exchange Sci-Tech Innovation Board Artificial Intelligence Index increased by 0.22% as of November 28, 2025, with notable gains from companies such as Zhongke Xingtou (up 4.13%) and Hongsoft Technology (up 3.00%) [1] - The Sci-Tech AI ETF (588790) showed a mixed performance, with a recent price of 0.76 yuan and a cumulative increase of 1.75% over the past week as of November 27, 2025 [1] - The trading volume for the Sci-Tech AI ETF was 1.17 billion yuan, with a turnover rate of 1.96% [1] Group 2: AI Industry Development - China's generative artificial intelligence is in a rapid development phase, with improving fundamentals for AI-related companies across both software and hardware sectors [2] - The demand for AI applications continues to grow, and domestic computing power is rising quickly, indicating a clear development trend in China's AI sector [2] - By 2026, the focus will shift towards the application and innovation of AI, as the large model market begins to consolidate [2] Group 3: Fund Performance and Composition - The Sci-Tech AI ETF has seen a significant growth of 2.848 billion yuan in scale over the past six months [3] - The fund's shares increased by 318 million shares this month, indicating substantial growth [3] - The latest net outflow for the Sci-Tech AI ETF was 110 million yuan, but over the past 19 trading days, there were 11 days of net inflow totaling 249 million yuan [3] - The index tracks 30 major companies in the AI sector, with the top ten stocks accounting for 70.92% of the index [3]
DeepSeek上新模型;摩尔线程部分新股遭弃购丨科技风向标
2 1 Shi Ji Jing Ji Bao Dao· 2025-11-28 02:05
Group 1: Technology Developments - DeepSeek launched a new mathematical reasoning model, DeepSeekMath-V2, which achieved gold medal levels in international competitions and demonstrated the feasibility of self-verifying reasoning paths [2] - Quark AI glasses were released by Alibaba, featuring dual-chip design and various models priced from 1,899 to 3,799 yuan [4] - Tianfu Communication announced its mass production capabilities for 800G and 1.6T high-speed optical engines, with ongoing investments in R&D for performance optimization [6] Group 2: Corporate Restructuring and Workforce Changes - HP announced a global layoff plan affecting 4,000 to 6,000 employees, approximately 10% of its workforce, to streamline operations and enhance productivity through AI [3] - ByteDance is in negotiations to sell its subsidiary, Shanghai Mutong Technology, to Saudi Arabia's Savvy Games Group, with the deal potentially valued at 14.5 billion yuan [5] Group 3: Market and Investment Activities - Dongxin Co. reported a strategic cooperation framework agreement with a leading domestic cloud service provider, focusing on various technological solutions [12] - Hechang New Materials plans to acquire a 51% stake in Shenzhen Xinwei Communications for approximately 234.6 million yuan, gaining control over the company [15] - Wuwen Chip has completed nearly 500 million yuan in A+ round financing, attracting significant investment from both state-owned and market-oriented funds [16] Group 4: Regulatory and Industry Insights - The National Development and Reform Commission addressed the rapid growth and potential "bubble" in the humanoid robot industry, noting over 150 companies in the sector with a growth rate exceeding 50% [7] - The Chinese Electronic Technology Standardization Institute clarified that existing 3C certified power banks will not be affected by new safety standards, easing consumer concerns [10]
DeepSeek上新模型;摩尔线程部分新股遭弃购丨新鲜早科技
2 1 Shi Ji Jing Ji Bao Dao· 2025-11-28 01:56
Group 1: Technology Developments - DeepSeek launched a new mathematical reasoning model, DeepSeekMath-V2, which achieved gold medal levels in major math competitions, showcasing the feasibility of self-verifying reasoning paths [2] - Quark AI glasses were released by Alibaba, featuring advanced hardware and dual operating systems, with prices starting from 1,899 yuan [4] - Tianfu Communication announced its capability for mass production of 800G and 1.6T high-speed optical engines, with ongoing investments in R&D for performance optimization [6] Group 2: Corporate Restructuring and Acquisitions - HP announced a global layoff plan affecting 4,000 to 6,000 employees, approximately 10% of its workforce, to streamline operations and enhance productivity through AI [3] - ByteDance is in negotiations to sell its subsidiary, Shanghai Mutong Technology, to Saudi Arabia's Savvy Games Group, with the deal's outcome uncertain [5] - Haichang New Materials plans to acquire a 51% stake in Shenzhen Xinwei Communications for approximately 234.6 million yuan, gaining control over the company [15] Group 3: Market Trends and Responses - The National Development and Reform Commission highlighted the rapid growth of humanoid robots, which are expanding at over 50% annually, while cautioning against market saturation and product redundancy [7] - Hongmeng Zhixing reported a surge in online attacks against the company, asserting that it will pursue legal action against those spreading false information [8] - The Chinese Electronic Technology Standardization Institute clarified that existing 3C certified power banks will remain valid despite rumors of new standards coming into effect [10] Group 4: Financial Activities - Muxi Co. announced its IPO plans, aiming to raise 3.904 billion yuan, potentially becoming the second domestic GPU company listed on the A-share market [13] - Moer Thread reported a significant number of shares were abandoned during its IPO, with over 29302 shares worth approximately 334.86 million yuan not subscribed [14] - Wuwen Chip completed nearly 500 million yuan in A+ round financing, attracting investments from various state-owned and market-oriented funds [16]
GPT-5危了,DeepSeek开源世界首个奥数金牌AI,正面硬刚谷歌
3 6 Ke· 2025-11-28 01:55
Core Insights - DeepSeek has launched its new model, DeepSeekMath-V2, which has won the IMO 2025 gold medal, showcasing capabilities that rival or even surpass Google's IMO gold medal model [1][3][22] - This is the first open-source IMO gold medal model, marking a significant advancement in AI [1][24] Model Performance - DeepSeekMath-V2 demonstrated strong theorem-proving abilities, solving 5 out of 6 problems in the IMO 2025, achieving a gold medal level [3][4] - In the CMO 2024, it also reached gold medal status, and in the Putnam 2024, it scored 118 out of 120, surpassing the highest human score of 90 [3][4] Comparison with Competitors - DeepSeekMath-V2 outperformed Google's Gemini Deep Think in the ProofBench-Basic tests and closely followed it in the ProofBench-Advanced tests [5][22] - The model's performance indicates a significant leap in capabilities compared to existing models like OpenAI's GPT-5 and Gemini 2.5-Pro [26][28] Self-Verification Mechanism - A key breakthrough of DeepSeekMath-V2 is its self-verification capability, allowing it to self-assess and improve its proofs [12][36] - The model employs a unique "three-in-one" system consisting of a Generator, Verifier, and Meta-Verifier to enhance its proof quality [15][16] Training Methodology - The training process involved a high-compute search strategy, generating numerous candidate proofs and validating them rigorously [32][35] - The model's ability to self-correct and refine its proofs through multiple iterations significantly improved its performance [38] Implications for AI Development - The success of DeepSeekMath-V2 suggests a shift in AI from merely mimicking human responses to emulating human thought processes, emphasizing the importance of self-reflection in achieving advanced AI [36][37]
第1个获得数学奥赛金牌的开源模型!DeepSeek新模型获网友盛赞:公开技术文件,了不起!
Hua Er Jie Jian Wen· 2025-11-28 00:46
Core Insights - DeepSeek has launched its latest open-source mathematical reasoning model, DeepSeekMath-V2, which has achieved gold medal status in the highly competitive International Mathematical Olympiad (IMO) 2025, marking a significant breakthrough in open-source AI capabilities in complex reasoning [1][3]. Group 1: Model Performance - DeepSeekMath-V2 solved 5 out of 6 problems in the simulated IMO 2025, becoming the first open-source model to achieve gold medal status in such a prestigious competition [1]. - The model also demonstrated top-tier performance in other challenging mathematics competitions, including achieving gold medal status in the Chinese Mathematical Olympiad (CMO) and scoring 118 out of 120 in the Putnam Mathematics Competition 2024, surpassing the highest human score of 90 [3]. Group 2: Innovation in Training Framework - The model employs an innovative self-verification training framework, which includes a dedicated verifier that assesses the quality of the proof process rather than just the correctness of the final answer [2][11]. - To prevent overfitting, DeepSeek has implemented a dynamic evolution strategy that increases computational demands and automatically labels difficult proofs, ensuring that the verifier and generator evolve in sync [12]. Group 3: Open Source and Community Impact - DeepSeekMath-V2's weights are publicly available under the Apache 2.0 license, allowing researchers and developers to download and utilize the model freely, which is seen as a significant step towards the democratization of AI [2][4]. - The release has sparked discussions about the potential impact of open-source models on the commercial viability of closed-source products, particularly concerning major players like NVIDIA [2].
DeepSeek上新,“奥数金牌水平”
Di Yi Cai Jing· 2025-11-28 00:40
Core Insights - DeepSeek has released a new model, DeepSeek-Math-V2, which is the first open-source model to achieve International Mathematical Olympiad (IMO) gold medal level performance [3][5] - The model outperforms Google's Gemini DeepThink in certain benchmarks, showcasing its capabilities in mathematical reasoning [5][9] Performance Metrics - DeepSeek-Math-V2 achieved 83.3% in IMO 2025 and 73.8% in CMO 2024, while scoring 98.3% in the Putnam 2024 competition [4] - In the Basic benchmark, Math-V2 scored nearly 99%, significantly higher than Gemini DeepThink's 89%, but in the Advanced subset, Math-V2 scored 61.9%, slightly lower than Gemini's 65.7% [5] Research Implications - The paper titled "DeepSeek Math-V2: Towards Self-Validating Mathematical Reasoning" emphasizes the importance of rigorous mathematical proof processes rather than just correct answers [8] - DeepSeek advocates for self-validation in mathematical reasoning to enhance the development of more powerful AI systems [8] Industry Reactions - The release of Math-V2 has generated excitement in the industry, with comments highlighting its unexpected success over Google's model [9] - The competitive landscape is evolving, with other major players like OpenAI and Google releasing new models, raising anticipation for DeepSeek's next moves [10]
DeepSeek上新!首个奥数金牌水平的模型来了
Di Yi Cai Jing· 2025-11-28 00:22
Core Insights - DeepSeek has released a new model, DeepSeek-Math-V2, which is the first open-source model to achieve International Mathematical Olympiad (IMO) gold medal level performance [1] - The model outperforms Google's Gemini DeepThink in certain benchmarks, showcasing its capabilities in mathematical reasoning [1][5] Performance Metrics - DeepSeek-Math-V2 achieved 83.3% on IMO 2025 problems and 73.8% on CMO 2024 problems [4] - In the Putnam 2024 competition, it scored 98.3%, demonstrating exceptional performance [4] - On the Basic benchmark, Math-V2 scored nearly 99%, while Gemini DeepThink scored 89% [5] - In the Advanced subset, Math-V2 scored 61.9%, slightly below Gemini DeepThink's 65.7% [5] Research and Development Focus - The model emphasizes self-verification in mathematical reasoning, moving from a result-oriented approach to a process-oriented one [8] - DeepSeek aims to enhance the rigor and completeness of mathematical proofs, which is crucial for solving open problems [8] - The research indicates that self-verifying mathematical reasoning is a viable direction for developing more powerful AI systems [8] Industry Reaction - The release has generated significant interest, with comments highlighting DeepSeek's competitive edge over Google's model [9] - The industry is keenly awaiting further developments from DeepSeek, especially regarding their flagship model updates [10]
DeepSeek强势回归,开源IMO金牌级数学模型
3 6 Ke· 2025-11-27 23:34
Core Insights - DeepSeek has introduced a new model, DeepSeek-Math-V2, which aims to enhance self-verifiable mathematical reasoning capabilities in AI [1][2] - The model reportedly outperforms Gemini DeepThink, achieving gold medal-level performance in mathematical competitions [3] Model Development - DeepSeek-Math-V2 is based on the previous version, DeepSeek-Math-7b, which utilized 7 billion parameters to match the performance of GPT-4 and Gemini-Ultra [4] - The new model addresses limitations in current AI mathematical reasoning by focusing on the rigor of the reasoning process rather than just the accuracy of final answers [5][6] Self-Verification Mechanism - The model incorporates a self-verification system that includes a proof verification component, a meta-verification layer, and a self-evaluating generator [7][11] - The verification system is designed to assess the reasoning process in detail, providing feedback similar to human experts [8][10] Training and Evaluation - The training process involves a unique honest reward mechanism, where the model is incentivized to self-assess its performance and identify its own errors [11][15] - The model has demonstrated impressive results in various mathematical competitions, achieving high scores in IMO 2025, CMO 2024, and Putnam 2024 [16][17] Performance Metrics - In the IMO-ProofBench benchmark, DeepSeek-Math-V2 achieved nearly 99% accuracy in basic problems and performed competitively in advanced problems [18] - The model's dual improvement cycle between the verifier and generator significantly reduces the occurrence of hallucinations in large models [20] Future Implications - DeepSeek emphasizes that self-verifiable mathematical reasoning represents a promising research direction that could lead to the development of more powerful mathematical AI systems [20]