Seek . - filings, earnings calls, financial reports, news

DeepSeek-R1

DeepSeekV2

人工智能

刚刚，DeepSeek梁文锋入选Nature年度十大人物，被称为“科技颠覆者”

DeepSeek-R1

DeepSeekV2

3 6 Ke· 2025-12-09 02:24

刚刚，梁文锋入选《自然》2025年度十大人物榜单！《自然》评出年度十大科学人物，DeepSeek梁文锋和中国科学院深海科学与工程研究所研究员杜梦然入选。 Nature给出的评语是：科技颠覆者！正式报道中，则用「这位中国金融奇才的DeepSeek AI模型惊艳了世界」。今年1月，中国的一则公告震撼了人工智能界！同时，Nature也用「让科学家感到兴奋」为标题进行了专题报道。 DeepSeek公司突然发布了功能强大但价格低廉的R1模型——瞬间证明美国在人工智能领域的领先优势并不像许多专家想象的那样巨大。这则爆炸性公告的背后是40岁的前金融分析师梁文锋。据信他此前通过将人工智能算法应用于股市赚取了数百万美元，并于2023年用这笔资金在杭州创立了DeepSeek公司。梁文锋行事低调，仅接受过少数几家中国媒体的采访（并且他拒绝了《自然》杂志的采访请求）。 DeepSeek和他本人的低调神秘形成了鲜明对比。 R1是一款推理大语言模型（LLM），擅长通过将复杂任务分解为步骤来解决数学和编程等难题。它是首个以开放权重形式发布的此类模型，意味着研究者可以免费下载并基于此模型进行开发，这为希望将算法适配到自身领域 ...

DeepSeek创始人梁文峰入选《自然》杂志2025年最具影响力人物榜单

Xin Hua She· 2025-12-09 00:32

《自然》杂志专题编辑布伦丹·马赫表示，2025 年榜单旨在表彰"对新领域的探索、突破性医学进步的前景、对维护科学诚信的坚定承诺，以及那些制定拯救生命的全球政策的人"。据《自然》杂志报道，今年的"自然十大"涵盖了广泛的科学探索和社会挑战，涉及天文学、深海研究、生物医学、科研诚信、公共卫生政策和人工智能等领域。这些故事共同展现了自然界从宏观到微观层面取得的进步，以及科研诚信和公共卫生政策等幕后工作，将如何塑造2025年的科学和社会。该榜单由《自然》杂志的编辑编制，并非奖项或排名，而是一份探索过去一年科学领域重要进展和故事以及在其中发挥重要作用的一些人（通常是大型研究团队的成员）的名单。伦敦12月8日，中国人工智能公司DeepSeek创始人梁文峰和"深潜者"中国地球科学家杜梦然入选《自然》杂志年度"自然10人"榜单，该榜单重点介绍2025年一些最重大科学事件的核心人物。他们分别因推动强大的大规模人工智能模型的发展和开创深海探索而受到认可，深海探索揭示了地球上一些最深的动物生态系统。《自然》杂志在对梁志强的专题报道中指出，他的公司DeepSeek在1月份发布了功能强大且性价比极高的R1模型，此 ...

第二波DeepSeek 冲击：V3.2 改写中国云生态与芯片生态的推理经济学

2025-12-08 15:36

Summary of DeepSeek V3.2 Conference Call Industry Overview - The conference call discusses the **Chinese Internet Industry**, specifically focusing on the **AI market** and the impact of the **DeepSeek V3.2** release on the ecosystem [1][20]. Key Points and Arguments 1. **DeepSeek V3.2 Release**: - The launch of DeepSeek V3.2 marks the beginning of the second wave of "DeepSeek impact" in the domestic AI market, providing near-state-of-the-art open-source inference capabilities at moderate domestic prices [1][20]. - The model API prices have been reduced by **30-70%**, and long-context inference may save **6-10 times** the workload [1][3]. 2. **Technical Enhancements**: - DeepSeek V3.2 retains the mixed expert (MoE) architecture of V3.1 but introduces the DeepSeek Sparse Attention mechanism (DSA), which reduces long-context computation complexity and maintains performance in public benchmarks [2][24]. - The model is designed for "agent" construction, integrating "thinking + tool invocation" in a single trajectory, trained on approximately **1,800 synthetic agent environments** and **85,000 complex instructions** [2][24]. 3. **Economic Impact**: - The DSA mechanism improves inference speed by **2-3 times** and reduces GPU memory usage by **30-40%** when processing **128k tokens** compared to V3.1 [3][24]. - The input/output pricing for V3.2 is set at **$0.28** and **$0.42** per million tokens, respectively, significantly lower than previous models [3][19]. 4. **Beneficiaries in the AI Ecosystem**: - Key beneficiaries identified include **cloud operators** (e.g., Alibaba Cloud, Tencent Cloud, Baidu Smart Cloud) and **domestic chip manufacturers** (e.g., Cambricon, Hygon) [13][14]. - The release is expected to drive demand for domestic chips and AI servers, reducing execution risks for Chinese AI buyers [14][16]. 5. **Competitive Positioning**: - DeepSeek V3.2 is positioned as a price disruptor in the large language model API market, with pricing significantly lower than similar models globally, while maintaining high intelligence levels comparable to **GPT-5** and others [26][27]. - The Chinese models are noted for their attractive value proposition, with higher intelligence scores and lower costs compared to U.S. counterparts [27][29]. Additional Important Content - The report emphasizes the shift towards domestic hardware support, with V3.2 optimized for non-CUDA ecosystems, including Huawei's CANN stack and Ascend hardware [14][24]. - The model's capabilities are expected to enhance the efficiency and economic viability of AI SaaS developers and vertical industry applications, such as coding and legal assistance [16][24]. - The analysis indicates a significant evolution from V3.1 to V3.2, with a **22% increase** in the Artificial Analysis intelligence index and over **50% reduction** in effective token pricing [17][19]. This summary encapsulates the critical insights from the conference call regarding the implications of DeepSeek V3.2 on the Chinese AI landscape and its competitive positioning within the global market.

PriceSeek提醒：铝锭现货价格普遍下跌

Xin Lang Cai Jing· 2025-12-08 12:25

华东市场对外报价21920元/吨，华南市场对外报价21810元/吨，西南市场对外报价21840元/吨，中原市场对外报价21770元/吨；较上一交易日分别下跌170元/吨、160元/吨、160元/吨、170元/吨。 PriceSeek评析铝，多空评分：-1 文章显示中国铝业铝锭(AL99.70)现货价格在华东、华南、西南、中原市场分别下跌170元/吨、160元/ 吨、160元/吨、170元/吨，跌幅约0.7-0.8%，表明市场供应充足或需求疲软，对现货价格构成一般利空影响。【大宗商品公式定价原理】生意社基准价是基于价格大数据与生意社价格模型产生的交易指导价，又称生意社价格。可用于确定以下两种需求的交易结算价： 1、指定日期的结算价生意社12月08日讯中国铝业股份有限公司2025年12月8日铝锭(AL99.70)现货价格各地区价格下跌，具体如下： C：升贴水，包括物流成本、品牌价差、区域价差等因素。生意社12月08日讯中国铝业股份有限公司2025年12月8日铝锭(AL99.70)现货价格各地区价格下跌，具体如下：华东市场对外报价21920元/吨，华南市场对外报价21810元/吨，西南市场对外 ...

DeepSeek双模型发布：一位是“话少助手” 一位是“偏科天才”

Ke Ji Ri Bao· 2025-12-08 10:03

Core Insights - DeepSeek has released two new models, DeepSeek-V3.2 and DeepSeek-V3.2-Speciale, which have garnered attention for their performance in comparison to leading models like OpenAI's GPT-5 and Google's Gemini3 Pro [1][2] Model Features - DeepSeek-V3.2 is designed as a high-efficiency assistant with strong reasoning and agent capabilities, aimed at automating complex tasks such as report generation and coding [2] - DeepSeek-V3.2-Speciale focuses on solving high-difficulty mathematical problems and academic research, pushing the limits of open-source model reasoning [2] Technological Innovations - The new models incorporate two significant breakthroughs: Domain-Specific Architecture (DSA) and Thinking Tool Invocation technology [2] - DSA enhances efficiency by allowing the model to retrieve only the most relevant information, reducing resource consumption [2] - Thinking Tool Invocation enables multi-round reasoning and tool usage, allowing the model to think, execute, and iterate on tasks autonomously [2] Market Positioning - The release of these models aims to bridge the performance gap between open-source and closed-source models, providing a competitive edge for open-source development [3][4] - DeepSeek's focus on practicality and generalization is intended to create pressure on closed-source vendors, transforming aspirations into competitive realities [4] Community Engagement - DeepSeek has updated its official web platform, app, and API to the new version, while the Speciale version is currently available only as a temporary API for community evaluation [4]

大语言模型

DSA技术

思考型工具调用技术