DeepSeek
Search documents
港股异动丨第四范式拉升涨近8%,股价创2024年3月以来新高
Ge Long Hui· 2025-09-30 03:38
Core Viewpoint - The Hong Kong stock market's AI concept stocks are experiencing a collective surge, with Fourth Paradigm (6682.HK) rising nearly 8% to HKD 68.5, marking its second consecutive day of gains and reaching a new high since March 2024 [1] Industry Summary - DeepSeek announced the update of its official App, web version, and mini-program to DeepSeek-V3.2-Exp, significantly reducing service costs due to new model implementations, with API prices dropping by over 50% [1] Company Summary - Fourth Paradigm has launched the "Virtual VRAM" plug-in virtual memory expansion card, which converts physical memory into a dynamically scheduled memory buffer pool, allowing for elastic expansion of GPU computing resources [1] - The "Virtual VRAM" creates a high-speed data channel between memory and video memory, effectively increasing the virtual memory capacity of a single graphics card to a maximum of 256GB without major hardware changes [1]
OpenAI和英伟达,正在把GPU玩成“金融产品”
3 6 Ke· 2025-09-30 03:25
Core Insights - The potential investment of up to $100 billion by Nvidia in collaboration with OpenAI to build a 10 GW AI data center highlights the financialization of computing power [1] - In 2024, global generative AI financing reached $56 billion, accounting for over half of the total AI industry financing, with major companies like Microsoft and Google significantly increasing their capital expenditures [1] - The shift from traditional GPU purchasing to a rental model is emerging as a solution to the challenges faced by AI companies, allowing for more flexible financial management [2][4] Financialization of GPUs - Traditional GPU procurement involves significant upfront costs and depreciation, which has become unsustainable due to rapid technological advancements [2] - The rental model transforms GPUs into financial products that can be leased, financed, and traded, mitigating the risks associated with ownership [4][5] - Companies like CoreWeave and Lambda Labs are leading the way in GPU rental services, with CoreWeave securing $1.7 billion in funding and Lambda Labs offering hourly rental services [5] Capital Logic of Computing Power - The financialization of computing power may disrupt the AI industry more profoundly than innovations like ChatGPT, as it introduces new investment opportunities and risks [6][8] - Future developments may include the securitization of GPU rental contracts, allowing for trading in capital markets and creating a new asset class [7] - The concentration of capital, computing power, and energy resources in the U.S. is likened to an oligopoly, where larger companies can leverage financing to maintain a competitive edge [9][11] Challenges for China - China's hardware and financial systems lag behind the U.S., with export controls limiting access to advanced GPUs and a lack of a mature financial infrastructure for computing power [12] - Chinese companies are exploring algorithm optimization and efficiency improvements, but without a robust GPU rental market and credit rating system, they risk being marginalized [12] - The need for China to develop its own GPU leasing market and financial infrastructure is critical to avoid being sidelined in the global computing power landscape [12] Conclusion - The rumored collaboration between OpenAI and Nvidia signifies a shift in industry logic, where the financialization of GPUs could accelerate AI development while potentially exacerbating inequalities in access to computing resources [13][14]
DeepSeek发新模型;库克确认持有加密货币丨科技风向标
2 1 Shi Ji Jing Ji Bao Dao· 2025-09-30 03:07
21世纪经济报道新质生产力研究院综合报道 早上好,新的一天又开始了。在过去的24小时内,科技行业发生了哪些有意思的事情?来跟21tech一起 看看吧。 【巨头风向标】 DeepSeek-V3.2-Exp发布,API 价格大幅下调 DeepSeek宣布官方App、网页端、小程序均已同步更新为DeepSeek-V3.2-Exp,得益于新模型服务成本 的大幅降低,官方API价格也相应下调,新价格即刻生效。在新的价格政策下,开发者调用DeepSeek API的成本将降低50%以上。目前,华为云和寒武纪均表示,已完成对该模型的适配工作。有消息称, 智谱新模型GLM-4.6也将于近日发布,目前已可通过API接口调用。 苹果CEO库克确认持有比特币和以太坊等加密货币 苹果公司CEO库克近日透露,自己是一个加密数字货币投资者,并明确持有比特币和以太币。然而,库 克已给"美国第三大公司(苹果)接受用加密货币购买iPhone和Mac、或者将公司资产投入比特币"这样 的想法泼冷水。库克表示,自己对加密货币经过一番研究,得出的结论是,(个人)为了投资组合多样 化而持有加密货币是合理的,"我已经研究了好一阵子了,我认为这是有意思的"。 ...
DeepSeek突然拥抱国产GPU语言,TileLang对标CUDA替代Triton,华为昇腾Day0官宣支持适配
3 6 Ke· 2025-09-30 02:52
DeepSeek v3.2有一个新改动,在论文里完全没提,只在官方公告中出现一次,却引起墙裂关注。 开源TileLang版本算子,其受关注程度甚至超过新稀疏注意力机制DSA,从画线转发的数量就可以看出来。 海外社区也注意到DeepSeek使用了它而不是OpenAI开发的Triton语言。 有接触过的开发者感叹TileLang是一种非常优雅的语言,只需不到100行代码就能写出比Flash Attention 2原版快30%的注意力实现。 那么什么是TileLang,又为何引人瞩目? 首先,TileLang是一种专门用来开发GPU内核的领域专用语言,性能上可以对标英伟达CUDA,DeepSeek官方推荐使用此版本做实验,在方便调试和快速 迭代上有优势。 更重要的是,TileLang与国产算力生态适配,连华为昇腾都要在第一时间公告对TileLang的支持。 在几周前的华为全联接大会2025的开发者日上,TileLang团队成员董宇骐就介绍了TileLang实现FlashAttention算子开发,代码量从500+行减少至80行,并 保持了与官方版本持平的性能。 此外TileLang团队成员王磊和沐曦集成电路的高级总 ...
DeepSeek和智谱都将于近日发布新模型,或将迎来重大突破
Sou Hu Cai Jing· 2025-09-30 02:00
Group 1 - DeepSeek announced the upload of its new model DeepSeek-V3.2 to the HuggingFace community platform on September 29 [2] - Zhiyu's new model GLM-4.6 is expected to be released soon, with some users already able to access it via API [2] - Both DeepSeek and Zhiyu, leading companies in the large model sector in China, are poised for significant advancements [2] Group 2 - The DeepSeek-V3.1 model, released in August, features a hybrid inference architecture that supports both thinking and non-thinking modes [2] - DeepSeek-V3.1-Think offers improved thinking efficiency, providing answers in a shorter time compared to DeepSeek-R1-0528 [2] - The new model has enhanced agent capabilities through post-training optimization, showing significant improvements in tool usage and agent tasks [2] Group 3 - Zhiyu's flagship model GLM-4.5, launched in July, integrates reasoning, coding, and agent capabilities within a single model to meet complex application needs [2] - In August, Zhiyu introduced the GLM-4.5V, a top-performing open-source visual reasoning model with 100 billion parameters (total parameters 106 billion, active parameters 12 billion) [2]
AI概念股多数走高 DeepSeek新模型成本下降超50% 机构看好AI应用商业化拐点临近
Zhi Tong Cai Jing· 2025-09-30 01:52
华泰证券曾表示,模型降价将吸引更多的开发者开发AI应用,或进一步提振算力需求,提升Super App 出现概率。中银国际认为,AI应用商业化拐点临近。在算力层,推理效率与性价比大幅提升,国产芯 片加速替代;在模型层,通用大模型的能力已逐步达到商用标准;在数据层,行业专属数据的积累与合 成数据技术成熟之下,企业加速实现数据闭环训练与模型微调。三者共同推动AI能力从"单点突破"走 向"体系协同",为AI应用大规模商业化落地创造条件。 AI概念股早盘多数走高,截至发稿,汇量科技(01860)涨4.47%,报19.88港元;迈富时(02556)涨4.33%, 报51.35港元;创新奇智(02121)涨3.65%,报7.95港元;第四范式(06682)涨3.15%,报65.5港元;美图公 司(01357)涨3.26%,报9.16港元。 消息面上,DeepSeek昨日宣布,官方App、网页端、小程序均已同步更新为DeepSeek-V3.2-Exp。 DeepSeek介绍,得益于新模型服务成本的大幅降低,官方API价格也相应下调,新价格即刻生效。在新 的价格政策下,开发者调用DeepSeek API的成本将降低50%以上。 ...
港股异动 | AI概念股多数走高 DeepSeek新模型成本下降超50% 机构看好AI应用商业化拐点临近
Zhi Tong Cai Jing· 2025-09-30 01:52
Group 1 - AI concept stocks saw a majority increase in early trading, with notable gains from companies such as 汇量科技 (4.47% increase), 迈富时 (4.33% increase), and 创新奇智 (3.65% increase) [1] - DeepSeek announced a significant update to its services, reducing the cost of its API by over 50% due to a new model that lowers service costs [1] - The National Development and Reform Commission (NDRC) plans to support various enterprises, including private companies, to deeply engage in AI initiatives [1] Group 2 - Huatai Securities indicated that the reduction in model prices will attract more developers to create AI applications, potentially boosting demand for computing power and increasing the likelihood of Super Apps [2] - Zhongyin International believes that the commercialization inflection point for AI applications is approaching, driven by improvements in reasoning efficiency and cost-effectiveness of domestic chips [2] - The combination of advancements in model capabilities, data accumulation, and synthetic data technology is facilitating a shift from "single-point breakthroughs" to "systematic collaboration" in AI capabilities, paving the way for large-scale commercialization [2]
国产AI重磅!DeepSeek-V3.2发布!寒武纪、昇腾均已适配!国产芯片深度协同有望受益
Xin Lang Ji Jin· 2025-09-30 01:30
Group 1 - DeepSeek officially released the DeepSeek-V3.2-Exp model, which incorporates a sparse Attention architecture to reduce computational resource consumption and enhance inference efficiency [1] - The new pricing policy allows developers to access the DeepSeek API at a cost reduction of over 50% [1] - Cambricon announced it has adapted to DeepSeek-V3.2-Exp and open-sourced the vLLM-MLU inference engine, indicating deep collaboration among leading companies in China's AI industry [1] Group 2 - Analysts believe that with continuous investment in computing infrastructure, domestic computing power is expected to maintain good growth momentum, potentially surpassing overseas computing power in the medium term [2] - The Sci-Tech Innovation Artificial Intelligence ETF (589520) focuses on the domestic AI industry chain, with Cambricon accounting for 16.62% of its weight as of September 29 [2] Group 3 - The top ten holdings of the Sci-Tech Innovation Artificial Intelligence ETF include Cambricon, Lattice Semiconductor, Chipone Technology, Kingsoft Office, Stone Technology, Amlogic, Hengxuan Technology, Yuntian Lifei, Fudan Microelectronics, and Espressif Systems [3] Group 4 - The Sci-Tech Innovation Artificial Intelligence ETF highlights three key points: 1. Policy support is igniting AI growth, with AI expected to lead the current market trend [4] 2. Domestic substitution and self-control are crucial in the context of technological friction, emphasizing the importance of AI as a core technology [4] 3. The ETF offers high elasticity and strong offensive potential, with over 70% of its top ten holdings concentrated in the semiconductor sector [4]
DeepSeek新模型开源,新架构亮了,国产AI芯片集体狂欢
3 6 Ke· 2025-09-30 01:15
Core Insights - DeepSeek has announced the open-source release of the DeepSeek-V3.2-Exp experimental model, which introduces the DeepSeek Sparse Attention mechanism, significantly improving long text training and inference efficiency without compromising output quality [1][9] - The new model reduces service costs by over 50%, with the price for outputting 1 million tokens dropping to 3 yuan, which is one-fourth of the previous model's cost [3][5] - Major cloud platforms and AI chip manufacturers have quickly adapted to the new model, indicating strong industry support and interest [5][10] Model Performance - DeepSeek-V3.2-Exp shows comparable performance to its predecessor, DeepSeek-V3.1-Terminus, across various benchmarks, although it uses significantly fewer tokens for task completion [5][6] - In specific benchmarks, DeepSeek-V3.2-Exp maintained scores such as 85.0 in MMLU-Pro and improved in BrowseComp with an accuracy of 40.1, while some scores like Humanity's Last Exam saw a slight decline [6][39] - The model's architecture allows for a reduction in complexity from quadratic to near-linear, enhancing training and inference costs [36][42] Technical Innovations - The model employs a "continue pre-training + post-training" approach, integrating a Lightning Indexer and a fine-grained token selection mechanism to optimize performance [36][38] - The DSA mechanism is still in its prototype phase, indicating potential for further development and refinement [36][44] - DeepSeek has also released related technical reports and code to facilitate research and experimentation [7][9] Industry Impact - The rapid adaptation of DeepSeek-V3.2-Exp by companies like Huawei and Cambrian demonstrates the model's significance in the AI landscape [10][15][17] - The model's launch has sparked discussions in developer communities, highlighting its potential to mark a significant moment in AI development [21][22] - User feedback indicates a mix of excitement and skepticism regarding the model's performance, with some noting improvements in speed but concerns over capability [19][31][32]
DeepSeek 开源 TileLang 与 CUDA 算子:AI 底层国产替代的关键尝试
小熊跑的快· 2025-09-30 01:11
Core Viewpoint - DeepSeek's release of TileLang and CUDA operator versions represents a significant step towards achieving "independence and control" in AI foundational technology, particularly in the GPU operator development field, addressing issues of technical autonomy, domestic hardware compatibility, ecological collaboration, and innovation efficiency [2][11]. Group 1: Breaking CUDA Monopoly - The dominance of CUDA, a closed-source platform led by NVIDIA, poses risks of technological dependency for domestic developers, limiting their ability to customize operators for new model research [2][3]. - Domestic GPUs, despite improving in computational power, face high migration costs due to the lack of compatible operator libraries and development tools with CUDA [3][5]. Group 2: Lowering Barriers for Domestic Hardware - DeepSeek's open-source solution, TileLang, allows developers to quickly validate operator logic without relying on CUDA, thus reducing dependency on NVIDIA [4][6]. - The dual-version approach provides a precision baseline for domestic platforms, facilitating the verification of operator implementations and lowering debugging costs [4][6]. Group 3: Activating Open Source Community Collaboration - The success of domestic alternatives relies on ecological collaboration, where DeepSeek's open-source initiative encourages community participation in developing new operators [7][8]. - Researchers can quickly develop and share new operator prototypes using TileLang, which can then be adapted by domestic hardware manufacturers [8]. Group 4: Accelerating Domestic Research Pathways - The reliance on CUDA and its tools can hinder innovation in cutting-edge fields like large models and multi-modal research, creating an "optimization black box" [9][10]. - DeepSeek's dual-version operators provide a pathway for domestic teams to innovate without the constraints of CUDA compatibility and licensing issues [10][11]. Group 5: From Single Point Replacement to Ecological Breakthrough - DeepSeek's actions signify a shift from passive following to active construction in the domestic AI foundational technology stack, addressing the challenges of high barriers, long cycles, and adaptation difficulties in GPU operator development [11]. - The approach of using open-source to break monopolies, abstracting complexities, and fostering collaboration may become a crucial paradigm for domestic alternatives in the AI foundational technology sector [11].