Seek .(SKLTY)
Search documents
OpenAI危,DeepSeek放大招:追平谷歌最强,手撕GPT-5 High
3 6 Ke· 2025-12-02 00:56
Core Insights - DeepSeek has officially released the V3.2 version, which significantly outperforms GPT-5 High and is on par with Google's Gemini-3.0 Pro in various reasoning benchmarks [1][4][9] - The new model has achieved four international competition gold medal-level results, showcasing its advanced capabilities [2][5] - DeepSeek V3.2 incorporates a unique DSA (Sparse Attention) architecture, breaking the "impossible triangle" of speed, cost, and intelligence in AI [1][17][22] Model Performance - DeepSeek V3.2 has demonstrated superior performance in multiple benchmarks compared to other models, including GPT-5 and Gemini-3.0 [1][21] - The model's scores in key competitions include: - AIME 2025: 96.0 (DeepSeek-V3.2-Speciale) vs. 95.0 (Gemini-3.0) - HMMT Feb 2025: 99.2 (DeepSeek-V3.2-Speciale) vs. 97.5 (Gemini-3.0) [21] Model Features - DeepSeek V3.2 is the first model to integrate thinking directly into tool usage, allowing it to operate in both thinking and non-thinking modes [6][9] - The V3.2-Speciale version is designed specifically for reasoning tasks and is currently available only via API [2][4] Technological Advancements - The DSA architecture allows for a significant reduction in computational complexity, enabling the model to process large documents efficiently [16][20] - This technology has led to a remarkable increase in processing speed and a reduction in operational costs, making advanced AI capabilities more accessible [17][20] Training and Development - DeepSeek V3.2 underwent extensive training in a virtual environment, utilizing over 1,800 simulated operating systems and generating 85,000 complex instructions to enhance its problem-solving skills [13][14] - The model's evolution from the experimental version (V3.2-Exp) to the official release showcases improvements in agent capabilities and context management [8][11]
现货白银价格再创新高;DeepSeek发布两个正式版模型|盘前情报
2 1 Shi Ji Jing Ji Bao Dao· 2025-12-02 00:38
Market Overview - On December 1, the A-share market experienced a rebound, with the Shanghai Composite Index returning above 3900 points, and both the Shenzhen Composite Index and the ChiNext Index rising over 1% [2] - The Shanghai Composite Index closed at 3914.01, up 0.65%, while the Shenzhen Composite Index closed at 13146.72, up 1.25%, and the ChiNext Index closed at 3092.5, up 1.31% [3] - The total trading volume in the Shanghai and Shenzhen markets reached 1.87 trillion yuan, an increase of 288.1 billion yuan compared to the previous trading day [2] Sector Performance - The consumer electronics sector saw a collective surge, while the commercial aerospace concept continued to show strength [2] - The non-ferrous metals sector was active, and the photolithography concept experienced a rapid increase [2] - Conversely, the battery sector faced a pullback after an initial rise [2] International Market Trends - Major U.S. stock indices fell on December 1, with the Dow Jones Industrial Average down 427.09 points (0.90%), the S&P 500 down 36.46 points (0.53%), and the Nasdaq Composite down 89.76 points (0.38%) [4] - European stock indices also declined, with the FTSE 100 down 17.98 points (0.18%), the CAC 40 down 25.71 points (0.32%), and the DAX down 247.35 points (1.04%) [4] - International oil prices rose, with WTI crude oil up $0.77 to $59.32 per barrel (1.32%) and Brent crude oil up $0.79 to $63.17 per barrel (1.27%) [4] Commodity Prices - Spot silver rose 2.85% to $57.987 per ounce, continuing to set historical highs [5] - COMEX gold futures increased by 0.24% to $4265 per ounce, while COMEX silver futures rose 2.25% to $58.45 per ounce [5] Policy and Economic Developments - Jiangsu Changzhou announced a new housing assistance policy to support low-income groups in purchasing new homes, providing up to 20,000 yuan in subsidies for new homes and 18,000 yuan for existing homes [9] - The State Post Bureau reported that China's express delivery volume surpassed 1.8 billion packages as of November 30, marking a historical high [10] - The Ministry of Industry and Information Technology encouraged Chinese companies in solar, wind, lithium batteries, and electric vehicles to expand internationally and invest in green energy projects [11] Institutional Insights - Zhongyuan Securities highlighted the positive outlook for AI and domestic self-controlled sectors, driven by advancements in chip technology and AI applications [13] - Datong Securities noted the favorable conditions in the paper industry, including price increases and cost reductions, enhancing profitability [13] - Aijian Securities suggested that successful launches of reusable rockets could significantly reduce satellite launch costs, benefiting the domestic low-orbit satellite industry [13] Focused Announcements - Tsinghua Unigroup plans to acquire 51% stakes in Beitelai and Shanghai Tongtu for 3.21 billion yuan and 3.57 billion yuan, respectively [14] - Wolong New Energy is investing 8.04 billion yuan in a 200,000 kW/1.2 million kWh energy storage demonstration project [14] - Top Group is planning to issue H-shares and list on the Hong Kong Stock Exchange [14] Fund Flow Analysis - The telecommunications equipment sector saw a net inflow of 4.19%, with ZTE Corporation being the top stock [15] - The semiconductor sector also experienced a net inflow of 2.77%, led by兆易创新 [15] - In contrast, the photovoltaic equipment sector faced a net outflow of 6.59%, with阳光电源 being the most affected stock [15] Individual Stock Movements - ZTE Corporation saw a significant net inflow of 38.06 billion yuan, with a price increase of 10% [16] - 兆易创新 experienced a net inflow of 11.04 billion yuan, with a price increase of 4.84% [16] - 阳光电源 faced a net outflow of 14.01 billion yuan, with a price decrease of 1.83% [16]
DeepSeek最强开源Agent模型炸场;我国首艘火箭网系回收海上平台近日成功交付;字节跳动发布豆包手机助手技术预览版——《投资早参》
Mei Ri Jing Ji Xin Wen· 2025-12-02 00:38
(二)行业掘金 每经记者|杨建 每经编辑|彭水萍 (一)重要市场新闻 1、美股三大指数集体收跌,道指跌0.89%,纳指跌0.38%,标普500指数跌0.53%,热门科技股多数下 跌,博通跌超4%,谷歌、微软跌超1%,英伟达、苹果涨超1%;加密货币、太阳能板块跌幅居前, Sunrun跌超8%,Bit Digital跌超5%,Coinbase跌超4%。中概股多数上涨,纳斯达克中国金龙指数涨 0.87%,网易涨约5%,阿里巴巴涨超4%,微博涨逾3%,蔚来跌超5%,金山云跌超4%,贝壳跌逾3%。 2、加密货币价格再度大幅走低,比特币盘中一度下跌8%至83824美元,自10月初以来累计跌幅近 30%。国际金价走高,截至发稿时,现货黄金涨0.38%,报4239.15美元/盎司;COMEX黄金期货涨 0.41%,报4272.5美元/盎司。国际油价走高,美油主力合约涨1.57%,报59.47美元/桶;布伦特原油主力 合约涨1.39%,报63.26美元/桶。欧洲三大股指收盘全线下跌,德国DAX指数跌1.04%报23589.44点,法 国CAC40指数跌0.32%报8097点,英国富时100指数跌0.18%报9702.53点。 ...
A股盘前播报 | DeepSeek发布两款新模型 新版本强化Agent能力
智通财经网· 2025-12-02 00:38
Industry Developments - DeepSeek has released the V3.2 series models, enhancing agent capabilities with reasoning abilities on par with GPT-5, integrating thinking modes with tool invocation for everyday applications [1] - In the electric vehicle sector, Leap Motor achieved a delivery volume of 70,327 units in November, marking a year-on-year increase of over 75%, while NIO saw a 76.3% increase in deliveries [3] Market Trends - Silver prices have reached a new high of $58.8 per ounce, with a year-to-date increase exceeding 100%, driven by supply tightness and speculative pressures [2] - Gold prices have also risen, reaching a six-week high of $4,264 per ounce [2] Macroeconomic Insights - The Minister of Finance, Liu Fuan, emphasized the need to increase residents' income through various channels and to boost consumption as part of a proactive fiscal policy [4] Investment Insights - Analysts suggest that the market is currently experiencing frequent style shifts, with a focus on sectors like artificial intelligence and new energy for potential growth in the coming year [7][8] - Morgan Stanley predicts that Google's large-scale sales of TPU chips will significantly increase production forecasts, benefiting related hardware suppliers [9]
ChatGPT 三周年遭 DeepSeek 暴击,23 页技术报告藏着开源登顶的全部秘密
3 6 Ke· 2025-12-02 00:16
Core Insights - DeepSeek has launched two new models, DeepSeek-V3.2 and DeepSeek-V3.2-Speciale, which aim to enhance AI's reasoning capabilities and tool usage, rivaling models like GPT-5 and Gemini-3.0-Pro [1][5][11]. Model Features - DeepSeek-V3.2 focuses on cost-effectiveness and everyday use, achieving reasoning capabilities comparable to GPT-5, while DeepSeek-V3.2-Speciale targets high-performance tasks, matching Gemini-3.0-Pro [5][11]. - Both models utilize a new sparse attention mechanism (DSA) to improve processing speed and efficiency, particularly for long documents, by focusing only on relevant parts of the text [4][7]. Training Innovations - DeepSeek has invested over 10% of its pre-training budget into post-training resources, enhancing model stability and scalability through a robust reinforcement learning framework [8][10]. - The training process includes "expert distillation" to create specialized models in various domains, which are then used to generate training data for the final model [10][11]. Performance Metrics - In benchmark tests, DeepSeek-V3.2 has shown competitive performance with GPT-5 and Kimi-K2-Thinking across multiple metrics, while the Speciale version has outperformed Gemini-3.0-Pro in specific tasks [20][22][24]. - The models have achieved notable results in prestigious competitions, with Speciale ranking 2nd in ICPC and 10th in IOI, demonstrating high-level reasoning and problem-solving capabilities [25][26]. Self-Training Mechanism - DeepSeek has developed a self-training pipeline with over 18,000 tasks, allowing AI to autonomously generate, validate, and improve its own training data, enhancing its reasoning abilities [17][19]. - This approach shifts the paradigm from human-led training to AI-driven self-improvement, fostering a new level of model evolution [19][32]. Future Directions - Despite the advancements, DeepSeek acknowledges that V3.2 still has gaps compared to top proprietary models, particularly in knowledge coverage and token efficiency, indicating plans for future enhancements [30][32].
DeepSeek更新线上模型,大幅缩小与闭源模型差距
Xuan Gu Bao· 2025-12-01 23:20
Group 1 - DeepSeek launched the official version of DeepSeek V3.2, enhancing agent capabilities and integrating reasoning, achieving the highest level among current open-source models, significantly narrowing the gap with closed-source models [1] - The V3.2-Speciale model version won gold medals at several prestigious competitions, including IMO 2025, CMO 2025, ICPC World Finals 2025, and IOI 2025 [1] - The AI industry is expected to continue improving, with companies like DeepSeek and Doubao likely to achieve rapid iterations, driven by the ongoing technological wave [1][2] Group 2 - Major companies with full-stack AI capabilities have advantages in different application scenarios, and the demand for computing power in the AI industry remains strong, indicating a vast market space that is still expanding [2] - DreamNet Technology integrated DeepSeek's capabilities into its Tianhui Zhihui platform for generating rich media content and enhancing internal operations [3] - Hangzhou Steel's subsidiary successfully adapted and deployed DeepSeek-R1, achieving deployment of all distilled models with 70B parameters and below [3]
DeepSeek又上新!模型硬刚谷歌 承认开源与闭源差距拉大
Di Yi Cai Jing· 2025-12-01 23:13
Core Insights - DeepSeek has launched two new models, DeepSeek-V3.2 and DeepSeek-V3.2-Speciale, which are positioned to compete with leading proprietary models like GPT-5 and Gemini 3.0, showcasing significant advancements in reasoning capabilities [1][4]. Model Overview - DeepSeek-V3.2 aims to balance reasoning ability and output length, making it suitable for everyday applications such as Q&A and general intelligence tasks. It has achieved performance levels comparable to GPT-5 and is slightly below Google's Gemini 3 Pro in public reasoning tests [4]. - DeepSeek-V3.2-Speciale is designed to push the limits of reasoning capabilities, integrating enhanced long-thinking features and theorem-proving abilities from DeepSeek-Math-V2. It has surpassed Gemini 3 Pro in several reasoning benchmarks, including prestigious math competitions [4][5]. Benchmark Performance - In various benchmarks, DeepSeek models have shown competitive results: - AIME 2025: DeepSeek-V3.2 scored 93.1, while GPT-5 and Gemini-3.0 scored 94.6 and 95.0 respectively [5]. - Harvard MIT Math Competition: DeepSeek-V3.2-Speciale scored 92.5, outperforming Gemini 3 Pro's 97.5 [5]. - International Math Olympiad: DeepSeek-V3.2-Speciale scored 78.3, close to Gemini 3 Pro's 83.3 [5]. Limitations and Future Plans - Despite these achievements, DeepSeek acknowledges limitations compared to proprietary models, including narrower world knowledge and lower token efficiency. The team plans to enhance pre-training and optimize reasoning chains to improve model performance [6][7]. - DeepSeek has identified three key areas where open-source models lag behind proprietary ones: reliance on standard attention mechanisms, insufficient computational resources during post-training, and gaps in generalization and instruction-following capabilities [7]. Technological Innovations - DeepSeek has introduced a sparse attention mechanism (DSA) to reduce computational complexity without sacrificing long-context performance. This innovation has been integrated into the new models, contributing to significant performance improvements [7]. Availability - The official website, app, and API for DeepSeek-V3.2 have been updated, while the enhanced Speciale version is currently available only through a temporary API for community evaluation [8]. Community Reception - The release has been positively received in social media, with users noting that DeepSeek's models have effectively matched the capabilities of GPT-5 and Gemini 3 Pro, highlighting the importance of rigorous engineering design over sheer parameter size [9].
DeepSeek 重大发布
Zheng Quan Shi Bao· 2025-12-01 15:04
Core Insights - DeepSeek has released two official model versions: DeepSeek-V3.2 and DeepSeek-V3.2-Speciale, with the former available on the official website, app, and API, while the latter is currently accessible only as a temporary API for community evaluation [1][3]. Model Performance - DeepSeek-V3.2 aims to balance reasoning capability and output length, making it suitable for daily use. In benchmark tests, it achieved performance comparable to GPT-5 and slightly below Gemini-3.0-Pro, with a significant reduction in output length compared to Kimi-K2-Thinking, leading to lower computational costs and reduced user wait times [3][4]. - DeepSeek-V3.2-Speciale is designed to push the limits of reasoning capabilities, serving as an enhanced version of DeepSeek-V3.2, and incorporates theorem-proving abilities from DeepSeek-Math-V2. It performed comparably to Gemini-3.0-Pro in mainstream reasoning benchmarks and won gold medals in several prestigious competitions, including IMO 2025 and ICPC World Finals 2025, achieving second and tenth place among human competitors, respectively [3][4]. Benchmark Comparisons - In various benchmark tests, DeepSeek-V3.2 and DeepSeek-V3.2-Speciale demonstrated competitive performance: - AIME 2025: DeepSeek-V3.2 scored 93.1, while DeepSeek-V3.2-Speciale scored 96.0 [4]. - HMMT Feb 2025: DeepSeek-V3.2 scored 92.5, and DeepSeek-V3.2-Speciale scored 99.2 [4]. - IMOAnswerBench: DeepSeek-V3.2 scored 78.3, and DeepSeek-V3.2-Speciale scored 84.5 [4]. - CodeForces: DeepSeek-V3.2 scored 2386, while DeepSeek-V3.2-Speciale scored 2701 [4]. Cost Efficiency - The introduction of DeepSeek-V3.2-Exp, based on V3.1-Terminus with a new attention mechanism (DSA), has led to significant improvements in training and reasoning efficiency, resulting in a notable reduction in model costs. This cost reduction enhances the model's cost-effectiveness and potential for broader application [4].
DeepSeek 上新
Zhong Guo Zheng Quan Bao· 2025-12-01 15:04
Core Insights - DeepSeek has released two official model versions: DeepSeek-V3.2 and DeepSeek-V3.2-Speciale, aimed at enhancing reasoning capabilities and output length for various applications [1][4] Model Performance - DeepSeek-V3.2 achieved performance comparable to GPT-5 in public reasoning benchmarks, slightly below Gemini-3.0-Pro, while significantly reducing output length compared to Kimi-K2-Thinking, thus lowering computational costs and user wait times [1][3] - The DeepSeek-V3.2-Speciale model demonstrated exceptional instruction-following, rigorous mathematical proof, and logical validation capabilities, achieving gold medal-level results in major competitions such as IMO 2025 and ICPC World Finals 2025 [2] Benchmark Comparisons - In various benchmark tests, DeepSeek-V3.2-Speciale outperformed the standard version in complex tasks, although it required significantly more tokens, indicating higher costs [3] - Specific benchmark scores include: - AIME 2025: DeepSeek-V3.2-Speciale scored 96.0, while DeepSeek-V3.2 scored 93.1 [3] - HMMT Feb 2025: DeepSeek-V3.2-Speciale scored 99.2, compared to DeepSeek-V3.2's 92.5 [3] - IMOAnswerBench: DeepSeek-V3.2-Speciale scored 84.5, while DeepSeek-V3.2 scored 78.3 [3] Model Features - DeepSeek-V3.2 is the first model to integrate reasoning with tool usage, supporting both reasoning and non-reasoning modes for tool calls, enhancing its versatility [4] - The model has improved generalization capabilities through a large-scale agent training data synthesis method, allowing it to perform well in real-world applications [4]
DeepSeek发布最强开源新品,瞄向全能Agent,给GPT-5与Gemini 3下战书
Tai Mei Ti A P P· 2025-12-01 15:03
Core Insights - DeepSeek has launched two new models, DeepSeek-V3.2 and DeepSeek-V3.2-Speciale, marking a significant advancement in AI capabilities, particularly in reasoning and output efficiency [2][3] - The V3.2 model is positioned as the strongest open-source large model, outperforming competitors in various benchmarks while significantly reducing output length and computational costs [3][4] - The V3.2 model integrates a new sparse attention mechanism (DSA) to enhance performance in long-context scenarios, while also improving the model's ability to follow instructions and generalize in complex environments [8][9] Model Performance - In benchmark tests, DeepSeek-V3.2 achieved competitive scores against models like GPT-5, Claude 4.5, and Gemini 3 Pro, with notable strengths in specific areas [4][5] - The V3.2 model demonstrated superior performance in question-and-answer scenarios, providing detailed and accurate travel recommendations through advanced tool usage [5][6] - The V3.2 Speciale model focuses on maximizing reasoning capabilities, achieving results comparable to Gemini 3.0 Pro in mainstream reasoning benchmarks, although it requires a higher token cost and is not designed for everyday use [9][10] Development Focus - DeepSeek emphasizes practical usability and generalization in its models, aiming to overcome common pitfalls in AI interactions, such as making basic common-sense errors [6][8] - The company is committed to enhancing the reasoning abilities of its models, as evidenced by the integration of advanced mathematical reasoning capabilities from the recently released DeepSeek-Math-V2 [9][10] - The competitive landscape for large models is intensifying, with major players like GPT-5 and Gemini 3 pushing the boundaries of AI capabilities, suggesting a dynamic future for AI development [10]