Workflow
DeepSeek
icon
Search documents
第1个获得数学奥赛金牌的开源模型!DeepSeek新模型获网友盛赞:公开技术文件,了不起!
Hua Er Jie Jian Wen· 2025-11-28 00:46
Core Insights - DeepSeek has launched its latest open-source mathematical reasoning model, DeepSeekMath-V2, which has achieved gold medal status in the highly competitive International Mathematical Olympiad (IMO) 2025, marking a significant breakthrough in open-source AI capabilities in complex reasoning [1][3]. Group 1: Model Performance - DeepSeekMath-V2 solved 5 out of 6 problems in the simulated IMO 2025, becoming the first open-source model to achieve gold medal status in such a prestigious competition [1]. - The model also demonstrated top-tier performance in other challenging mathematics competitions, including achieving gold medal status in the Chinese Mathematical Olympiad (CMO) and scoring 118 out of 120 in the Putnam Mathematics Competition 2024, surpassing the highest human score of 90 [3]. Group 2: Innovation in Training Framework - The model employs an innovative self-verification training framework, which includes a dedicated verifier that assesses the quality of the proof process rather than just the correctness of the final answer [2][11]. - To prevent overfitting, DeepSeek has implemented a dynamic evolution strategy that increases computational demands and automatically labels difficult proofs, ensuring that the verifier and generator evolve in sync [12]. Group 3: Open Source and Community Impact - DeepSeekMath-V2's weights are publicly available under the Apache 2.0 license, allowing researchers and developers to download and utilize the model freely, which is seen as a significant step towards the democratization of AI [2][4]. - The release has sparked discussions about the potential impact of open-source models on the commercial viability of closed-source products, particularly concerning major players like NVIDIA [2].
DeepSeek上新,“奥数金牌水平”
Di Yi Cai Jing· 2025-11-28 00:40
Core Insights - DeepSeek has released a new model, DeepSeek-Math-V2, which is the first open-source model to achieve International Mathematical Olympiad (IMO) gold medal level performance [3][5] - The model outperforms Google's Gemini DeepThink in certain benchmarks, showcasing its capabilities in mathematical reasoning [5][9] Performance Metrics - DeepSeek-Math-V2 achieved 83.3% in IMO 2025 and 73.8% in CMO 2024, while scoring 98.3% in the Putnam 2024 competition [4] - In the Basic benchmark, Math-V2 scored nearly 99%, significantly higher than Gemini DeepThink's 89%, but in the Advanced subset, Math-V2 scored 61.9%, slightly lower than Gemini's 65.7% [5] Research Implications - The paper titled "DeepSeek Math-V2: Towards Self-Validating Mathematical Reasoning" emphasizes the importance of rigorous mathematical proof processes rather than just correct answers [8] - DeepSeek advocates for self-validation in mathematical reasoning to enhance the development of more powerful AI systems [8] Industry Reactions - The release of Math-V2 has generated excitement in the industry, with comments highlighting its unexpected success over Google's model [9] - The competitive landscape is evolving, with other major players like OpenAI and Google releasing new models, raising anticipation for DeepSeek's next moves [10]
DeepSeek上新,“奥数金牌水平”
第一财经· 2025-11-28 00:35
Core Viewpoint - DeepSeek has released an open-source model, DeepSeek-Math-V2, which is the first model to achieve IMO gold medal level in mathematics and outperforms Google's Gemini DeepThink in certain benchmarks [3][5]. Group 1: Model Performance - DeepSeek-Math-V2 achieved nearly 99% on the Basic benchmark, significantly outperforming Gemini DeepThink, which scored 89% [5]. - In the more challenging Advanced subset, Math-V2 scored 61.9%, slightly below Gemini DeepThink's 65.7% [5]. - The model has demonstrated gold medal-level performance in IMO 2025 and CMO 2024, and nearly perfect scores in the Putnam 2024 exam (118/120) [8]. Group 2: Research and Development Insights - DeepSeek emphasizes the importance of verifying mathematical reasoning comprehensively and rigorously, moving from a result-oriented approach to a process-oriented one [8]. - The model is designed to teach AI to review proof processes like a mathematician, enhancing its ability to solve complex mathematical proofs without human intervention [8]. Group 3: Industry Reactions and Expectations - The release of Math-V2 has generated excitement in the industry, with reactions noting that DeepSeek has surpassed expectations by defeating Google's IMO Gold model by a 10% margin [9]. - There is anticipation regarding DeepSeek's next moves, especially concerning updates to its flagship models, as the industry awaits further developments [9].
DeepSeek上新!首个奥数金牌水平的模型来了
Di Yi Cai Jing· 2025-11-28 00:22
Core Insights - DeepSeek has released a new model, DeepSeek-Math-V2, which is the first open-source model to achieve International Mathematical Olympiad (IMO) gold medal level performance [1] - The model outperforms Google's Gemini DeepThink in certain benchmarks, showcasing its capabilities in mathematical reasoning [1][5] Performance Metrics - DeepSeek-Math-V2 achieved 83.3% on IMO 2025 problems and 73.8% on CMO 2024 problems [4] - In the Putnam 2024 competition, it scored 98.3%, demonstrating exceptional performance [4] - On the Basic benchmark, Math-V2 scored nearly 99%, while Gemini DeepThink scored 89% [5] - In the Advanced subset, Math-V2 scored 61.9%, slightly below Gemini DeepThink's 65.7% [5] Research and Development Focus - The model emphasizes self-verification in mathematical reasoning, moving from a result-oriented approach to a process-oriented one [8] - DeepSeek aims to enhance the rigor and completeness of mathematical proofs, which is crucial for solving open problems [8] - The research indicates that self-verifying mathematical reasoning is a viable direction for developing more powerful AI systems [8] Industry Reaction - The release has generated significant interest, with comments highlighting DeepSeek's competitive edge over Google's model [9] - The industry is keenly awaiting further developments from DeepSeek, especially regarding their flagship model updates [10]
DeepSeek强势回归,开源IMO金牌级数学模型
3 6 Ke· 2025-11-27 23:34
Core Insights - DeepSeek has introduced a new model, DeepSeek-Math-V2, which aims to enhance self-verifiable mathematical reasoning capabilities in AI [1][2] - The model reportedly outperforms Gemini DeepThink, achieving gold medal-level performance in mathematical competitions [3] Model Development - DeepSeek-Math-V2 is based on the previous version, DeepSeek-Math-7b, which utilized 7 billion parameters to match the performance of GPT-4 and Gemini-Ultra [4] - The new model addresses limitations in current AI mathematical reasoning by focusing on the rigor of the reasoning process rather than just the accuracy of final answers [5][6] Self-Verification Mechanism - The model incorporates a self-verification system that includes a proof verification component, a meta-verification layer, and a self-evaluating generator [7][11] - The verification system is designed to assess the reasoning process in detail, providing feedback similar to human experts [8][10] Training and Evaluation - The training process involves a unique honest reward mechanism, where the model is incentivized to self-assess its performance and identify its own errors [11][15] - The model has demonstrated impressive results in various mathematical competitions, achieving high scores in IMO 2025, CMO 2024, and Putnam 2024 [16][17] Performance Metrics - In the IMO-ProofBench benchmark, DeepSeek-Math-V2 achieved nearly 99% accuracy in basic problems and performed competitively in advanced problems [18] - The model's dual improvement cycle between the verifier and generator significantly reduces the occurrence of hallucinations in large models [20] Future Implications - DeepSeek emphasizes that self-verifiable mathematical reasoning represents a promising research direction that could lead to the development of more powerful mathematical AI systems [20]
AI员工几分钟响应 跨镇街建十大万亩级园区
Nan Fang Du Shi Bao· 2025-11-27 23:11
Core Viewpoint - The modernization of urban governance in Zhongshan is being driven by technology empowerment, institutional innovation, and ecological prioritization, aiming to create a livable, resilient, and smart city that enhances operational efficiency and reduces management costs [2]. Group 1: Technology Empowerment - Zhongshan has implemented an "AI employee" in its government services, achieving an average response time of 0.8 seconds and an accuracy rate of over 80% for inquiries related to public housing [3]. - The AI service integrates with 12 departmental business systems, maintaining over 800 high-frequency service items, thus facilitating a shift from fragmented responses to systematic optimization of government services [3][4]. - A domestic government big data center has been established, aggregating over 60 billion data entries from more than 300 government systems, which supports AI model training and enables intelligent approval processes, reducing processing times to under 2 minutes for 14 high-frequency business items [4]. Group 2: Institutional Reform - Zhongshan is actively pursuing integrated reforms across various sectors, including industrial, water, land, and urban reforms, to break down barriers and enhance governance efficiency [5]. - The city has decentralized 107 permissions to town streets and has implemented a pilot program for 43 major and 77 minor permissions in specific districts, addressing the challenges faced by local governance [5]. Group 3: Land and Urban Development - Comprehensive land reform initiatives have been launched, recovering 23,600 acres of farmland and promoting a new land utilization model that emphasizes ecological preservation and efficient agricultural practices [6]. - Zhongshan is developing ten large-scale modern industrial parks across town streets to foster regional collaboration and economic integration, enhancing the overall development landscape [6]. Group 4: Environmental Governance - The city is committed to rigorous water pollution control, having laid 6,638 kilometers of pipelines and constructed 17 wastewater treatment plants, effectively eliminating black and odorous water bodies in urban areas [8]. - Zhongshan is also enhancing its urban landscape through various beautification projects, including the improvement of highway aesthetics and the establishment of a comprehensive park system that ensures green spaces are accessible to residents [9].
北京发布太空数据中心建设规划方案;国家发改委将健全具身智能准入和退出机制;XREAL联合谷歌12月发布AI眼镜——《投资早参》
Mei Ri Jing Ji Xin Wen· 2025-11-27 23:01
Important Market News - Brent crude oil for January closed up $0.21, an increase of over 0.33%, at $63.34 per barrel. WTI crude oil saw a daily increase of 1.00%, closing at $59.23 per barrel [1] - Major European stock indices closed mixed, with Germany's DAX30 up 0.31% at 23,767.56 points, the UK FTSE 100 down 0.02% at 9,689.65 points, France's CAC40 up 0.04% at 8,099.47 points, and the Euro Stoxx 50 down 0.06% at 5,652.15 points [1] Industry Insights - The Beijing Municipal Science and Technology Commission and the Zhongguancun Science City Management Committee released a plan for the construction of a space data center. The plan proposes a centralized large data center system in the 700 to 800 km dawn-dusk orbit, capable of accommodating a million-class server cluster for space-based data relay transmission and computing services [2] - The "Star Eye" space perception constellation plan was officially launched, consisting of 156 satellites aimed at creating a space information analysis platform and space management service platform [2] - The satellite internet industry is becoming a new frontier in global technology competition, with the satellite communication market currently valued at approximately 40-50 billion yuan, expected to exceed 200-400 billion yuan by 2030, with an annual compound growth rate of 10%-28% [3] - The industry is at a critical turning point from "concept validation" to "scale application," with advancements in technology, cost reductions, and expanded application scenarios expected to create a new communication pattern of "integrated space and ground, interconnected everything" over the next decade [3] - The National Development and Reform Commission plans to promote the healthy and standardized development of the embodied intelligence industry through three main approaches: establishing industry standards, accelerating core technology breakthroughs, and promoting infrastructure construction [3] - The humanoid robot industry is expected to see significant growth by 2025, driven by leading companies enhancing component performance and reducing costs, with a focus on core supply chains and application scenarios [4] - XREAL officially launched its global headquarters in Shanghai and announced a partnership with Google to develop the "Project Aura" AR glasses, which will integrate Google Gemini AI as its core [5][6]
阿维塔“递表”港股IPO;DeepSeek推出新模型丨每经早参
Mei Ri Jing Ji Xin Wen· 2025-11-27 22:19
Group 1 - The third New Quality Productivity Automotive Conference will be held from November 28 to 30, 2025 [3] - Huawei's Mate80 and Mate80 Pro series will officially go on sale on November 28 [3] - The first batch of seven dual-innovation artificial intelligence ETFs will collectively launch on November 28 [3] Group 2 - The Hong Kong fire in Tai Po has resulted in 83 fatalities as of November 28 [6] - The Hong Kong government will provide emergency relief of 10,000 HKD per household affected by the fire [6] - A total of over 600 million HKD has been pledged in donations from various enterprises and organizations for disaster relief and recovery efforts [13][14] Group 3 - The Ministry of Commerce of China held a video conference with the German Federal Minister of Economics and Energy to discuss issues related to Nexperia [6] - The Chinese government is taking targeted measures to enhance credit repair, simplifying application materials and improving efficiency [9] Group 4 - Japan plans to issue approximately 11.7 trillion JPY (about 529.9 billion CNY) in government bonds to fund a new economic stimulus plan [11] - The former President of Peru, Pedro Castillo, has been sentenced to over 11 years in prison for conspiracy to commit rebellion [11] Group 5 - Anta Sports has responded to rumors regarding a potential bid for Puma, stating it does not comment on market speculation [20] - The leadership change at Wahaha Group may lead to strategic adjustments that could impact the competitive landscape [21] Group 6 - Joy City Property has officially delisted from the Hong Kong Stock Exchange after 12 years, following a privatization plan [23] - Avita Technology has submitted its IPO application to the Hong Kong Stock Exchange, marking a significant move for a state-owned enterprise in the new energy vehicle sector [27] Group 7 - The Chinese open-source AI model download share has surpassed that of the United States, indicating a significant advancement in AI technology [31]
“北溪”爆炸案一嫌疑人至德国受审;香港大埔火灾致83人遇难;外交部:中方绝不接受日方的自说自话;阿维塔“递表”港股IPO;DeepSeek推出新模型丨每经早参
Mei Ri Jing Ji Xin Wen· 2025-11-27 22:00
Group 1 - The Hong Kong fire in Tai Po has resulted in 83 fatalities, prompting the government to provide emergency relief funds of 10,000 HKD per household and establish a 300 million HKD aid fund [6][13][14] - Over 40 companies and organizations have pledged donations exceeding 600 million HKD for rescue and recovery efforts following the fire [13][14][15][16][17] Group 2 - The Chinese Ministry of Commerce held a video conference with Germany's Federal Minister of Economics to discuss issues related to semiconductor supply chains, emphasizing the need for constructive solutions to stabilize the global semiconductor market [5][8] - The National Development and Reform Commission announced measures to enhance credit repair, including simplifying application processes and improving efficiency [9] Group 3 - Anta Sports has been rumored to consider bidding for Puma, with potential collaboration with a private equity firm, reflecting ongoing industry merger and acquisition dynamics [21] - The resignation of Zong Fuli as chairman of Wahaha Group may lead to strategic adjustments within the company, impacting the competitive landscape of the industry [22] Group 4 - Joy City Property officially delisted from the Hong Kong Stock Exchange after 12 years, as part of a privatization plan valued at approximately 2.932 billion HKD [24] - Avita Technology has submitted an IPO application to the Hong Kong Stock Exchange, marking a significant move for a state-owned enterprise in the new energy vehicle sector [28] Group 5 - The release of the white paper on China's military control and disarmament reflects the country's commitment to global security governance and multilateral arms control processes [7] - The recent increase in open-source AI model downloads from China surpassing that of the US indicates a significant advancement in China's AI technology capabilities [32]
腾讯研究院AI速递 20251128
腾讯研究院· 2025-11-27 16:21
Group 1: Google TPU Development - Google TPU was developed in 2015 to address AI computing efficiency bottlenecks, with the seventh generation TPU (codename Ironwood) expected to challenge NVIDIA's dominance by 2025 [1] - The TPU v7 single chip achieves an FP8 computing power of 4.6 petaFLOPS, and a Pod integrating 9216 chips can exceed 42.5 exaFLOPS, utilizing a 2D/3D toroidal topology combined with optical switching networks, with an annual availability of 99.999% [1] - Google's vertical integration strategy allows it to avoid expensive CUDA taxes, resulting in inference costs that are 30%-40% lower than GPU systems, with Meta considering deploying TPU in data centers by 2027 and renting computing power through Google Cloud [1] Group 2: Anthropic's New Agent Architecture - Anthropic released a dual-agent architecture solution for long-range agents, addressing memory challenges across sessions by having an initialization agent build environments and a coding agent manage incremental progress [2] - The environment management includes a feature list (200+ functional points marked), incremental progress (Git commits and progress files), and end-to-end testing (using Puppeteer browser automation) [2] - This solution is based on the Claude Agent SDK, enabling agents to maintain consistent progress across sessions, successfully completing complex tasks over hours or even days [2] Group 3: DeepSeek-Math-V2 Model - DeepSeek introduced the DeepSeek-Math-V2 model based on DeepSeek-V3.2-Exp-Base, achieving IMO gold medal-level performance, surpassing Gemini DeepThink [3] - The model innovatively incorporates a self-verification mathematical reasoning framework, including proof verifiers (scoring 0/0.5/1), meta-verification (checking the reasonableness of comments), and an honesty reward mechanism (rewarding models that honestly indicate errors) [3] - It achieved nearly 99% high scores on the Basic subset of the IMO-ProofBench benchmark and scored 118/120 in the extended tests of Putnam 2024, breaking through traditional reinforcement learning limitations [3] Group 4: Suno and Warner Music Agreement - AI music platform Suno reached a global agreement with Warner Music Group for the first "legitimate licensed AI music" framework, marking a milestone in AI music legalization [4] - Suno plans to launch a new model based on high-quality licensed music training in 2026, promising to surpass the existing v5 model, with Warner artists having the option to authorize and earn revenue [4] - Future free users will be unable to download created audio, only able to play and share, while paid users will retain download functionality but with monthly limits; Suno also acquired Warner's concert service Songkick to expand its offline ecosystem [4] Group 5: Musk's Grok 5 Challenge - Musk announced that Grok 5 will challenge the strongest League of Legends team T1 in 2026, incorporating "pure visual perception" and "human-level reaction latency" [5] - Grok 5 is expected to have 60 trillion parameters, functioning as a multimodal LLM by "reading" game instructions and "watching" match videos to build a world model, relying on logical reasoning rather than brute force [5] - The visual-action model of Grok 5 will be directly applied to Tesla's Optimus humanoid robot, using gaming team battles as a training ground to validate embodied intelligence capabilities [5] Group 6: Alibaba's Z-Image Model - Alibaba open-sourced the 6 billion parameter image generation model Z-Image, which includes three main versions: Z-Image-Turbo (achieving mainstream competitor performance in 8 steps), Z-Image-Base (non-distilled base model), and Z-Image-Edit (image editing version) [7] - Z-Image-Turbo achieves sub-second inference speed on enterprise-level H800 GPUs and can easily run on consumer devices with 16GB memory, excelling in photo-realistic generation and bilingual text rendering [7] - The model employs a scalable single-stream DiT (S3-DiT) architecture, maximizing parameter utilization by concatenating text, visual semantic tokens, and image VAE tokens into a unified input stream [7] Group 7: Wukong AI Infrastructure Financing - Wukong AI Infrastructure completed nearly 500 million yuan in A+ round financing, led by Zhuhai Technology Group and Foton Capital, accumulating nearly 1.5 billion yuan in funding over 2.5 years [8] - Wukong AI Cloud achieved cross-brand chip mixed training with a maximum computing power utilization rate of 97.6%, managing over 25,000 P of computing power across 53 data centers in 26 cities nationwide [8] - The company launched the Wukong Tianquan model (3B cost, 7B memory requirement achieving 21B-level intelligence) and the Wukong Kaiyang inference acceleration engine (3x latency reduction, 40% energy savings), aiming to build an Agentic Infra [8] Group 8: Tsinghua University's AI Education Guidelines - Tsinghua University officially released the "Guidelines for AI Education Applications," proposing five core principles: "subject responsibility," "compliance and integrity," "data security," "prudent thinking," and "fairness and inclusiveness" [9] - The guidelines explicitly prohibit the direct submission of AI-generated content as academic results and forbid using AI to replace academic training or write papers, requiring teachers to be responsible for AI-generated teaching content [9] - Tsinghua has integrated AI teaching practices into over 390 courses and developed a "three-layer decoupling architecture" and a fully functional intelligent companion "Qing Xiao Da," completing the guidelines after two years of research across 25 global universities [9] Group 9: US Genesis Mission - The US initiated the "Genesis Mission" as an AI Manhattan Project, aiming to train foundational scientific models and create research intelligent agents to deeply embed AI in the entire research process [10] - The Deputy Secretary of Science at the Department of Energy emphasized that the value of AI lies in generating verifiable results rather than merely summarizing, requiring mobilization of national laboratories, enterprises, and top universities [11] - A concurrent editorial in "Nature" proposed a "neuro-symbolic AI" approach, combining statistical learning of large models with symbolic reasoning and planning modules, potentially key to achieving human-level intelligence [11]