Workflow
DeepSeek
icon
Search documents
OpenAI和英伟达,正在把GPU玩成“金融产品”
3 6 Ke· 2025-09-30 03:25
Core Insights - The potential investment of up to $100 billion by Nvidia in collaboration with OpenAI to build a 10 GW AI data center highlights the financialization of computing power [1] - In 2024, global generative AI financing reached $56 billion, accounting for over half of the total AI industry financing, with major companies like Microsoft and Google significantly increasing their capital expenditures [1] - The shift from traditional GPU purchasing to a rental model is emerging as a solution to the challenges faced by AI companies, allowing for more flexible financial management [2][4] Financialization of GPUs - Traditional GPU procurement involves significant upfront costs and depreciation, which has become unsustainable due to rapid technological advancements [2] - The rental model transforms GPUs into financial products that can be leased, financed, and traded, mitigating the risks associated with ownership [4][5] - Companies like CoreWeave and Lambda Labs are leading the way in GPU rental services, with CoreWeave securing $1.7 billion in funding and Lambda Labs offering hourly rental services [5] Capital Logic of Computing Power - The financialization of computing power may disrupt the AI industry more profoundly than innovations like ChatGPT, as it introduces new investment opportunities and risks [6][8] - Future developments may include the securitization of GPU rental contracts, allowing for trading in capital markets and creating a new asset class [7] - The concentration of capital, computing power, and energy resources in the U.S. is likened to an oligopoly, where larger companies can leverage financing to maintain a competitive edge [9][11] Challenges for China - China's hardware and financial systems lag behind the U.S., with export controls limiting access to advanced GPUs and a lack of a mature financial infrastructure for computing power [12] - Chinese companies are exploring algorithm optimization and efficiency improvements, but without a robust GPU rental market and credit rating system, they risk being marginalized [12] - The need for China to develop its own GPU leasing market and financial infrastructure is critical to avoid being sidelined in the global computing power landscape [12] Conclusion - The rumored collaboration between OpenAI and Nvidia signifies a shift in industry logic, where the financialization of GPUs could accelerate AI development while potentially exacerbating inequalities in access to computing resources [13][14]
DeepSeek发新模型;库克确认持有加密货币丨科技风向标
Group 1: Technology Developments - DeepSeek has released version V3.2-Exp, significantly reducing API prices by over 50% due to lower service costs [2] - Huawei's CEO confirmed that the company has adapted to the new DeepSeek model and is supporting it with their Ascend architecture [13] - The latest AI model Qwen3-Omni from Alibaba has topped the global open-source model rankings, showcasing advanced multimodal capabilities [5] Group 2: Corporate Announcements - Apple CEO Tim Cook disclosed personal investments in Bitcoin and Ethereum but clarified that Apple will not accept cryptocurrency for product purchases [3] - Saisys has completed the payment for acquiring a 10% stake in Shenzhen Yiwang Intelligent Technology from Huawei, totaling RMB 11.5 billion [4] - Xiaomi's public relations manager stated that there are no plans to reduce orders for the Xiaomi 17 series, indicating an increase in overall product orders [9] Group 3: Financial Performance - Saisys announced a cash dividend of RMB 3.10 per 10 shares, totaling RMB 506 million, representing 17.22% of its net profit for the first half of the year [14] - Yinglian Co. expects a significant increase in net profit for the first three quarters of 2025, projecting a growth of 1531.13% to 1672.97% [15] Group 4: Infrastructure and Standards - CATL plans to invest in the construction of 100 battery swap stations in Hainan by 2030 to enhance green transportation [10] - A new national standard for cross-border personal information security management will be implemented in March 2026, aimed at regulating data processing activities [11]
DeepSeek突然拥抱国产GPU语言,TileLang对标CUDA替代Triton,华为昇腾Day0官宣支持适配
3 6 Ke· 2025-09-30 02:52
Core Insights - DeepSeek v3.2 introduces a significant change by adopting TileLang, a domain-specific language for GPU kernel development, which has garnered substantial attention in the tech community [1][4][6] - TileLang is noted for its performance, allowing developers to implement attention mechanisms faster than existing solutions, with claims of achieving a 30% speed increase over Flash Attention 2 [3][5] Group 1: TileLang Overview - TileLang is designed to simplify the development of high-performance GPU/CPU kernels, comparable to NVIDIA's CUDA, and is recommended by DeepSeek for experiments due to its debugging and rapid iteration advantages [4][13] - The language is built on a Python-like syntax and operates on top of the TVM compiler infrastructure, enabling developers to focus on productivity without sacrificing performance [13] - TileLang features three programming interfaces catering to different developer skill levels, from high-level abstractions for beginners to low-level controls for performance experts [15] Group 2: DeepSeek's Adoption of TileLang - DeepSeek's collaboration with TileLang was first highlighted at the Beijing Zhiyuan Conference in June, where a report indicated that TileLang's operator implementation could be faster [6][19] - The DeepSeek team has utilized TileLang for rapid prototype development, subsequently optimizing performance with lower-level methods [17][23] - Following the release of DeepSeek v3.2, TileLang's capabilities were validated, demonstrating its effectiveness in model training [23]
DeepSeek和智谱都将于近日发布新模型,或将迎来重大突破
Sou Hu Cai Jing· 2025-09-30 02:00
Group 1 - DeepSeek announced the upload of its new model DeepSeek-V3.2 to the HuggingFace community platform on September 29 [2] - Zhiyu's new model GLM-4.6 is expected to be released soon, with some users already able to access it via API [2] - Both DeepSeek and Zhiyu, leading companies in the large model sector in China, are poised for significant advancements [2] Group 2 - The DeepSeek-V3.1 model, released in August, features a hybrid inference architecture that supports both thinking and non-thinking modes [2] - DeepSeek-V3.1-Think offers improved thinking efficiency, providing answers in a shorter time compared to DeepSeek-R1-0528 [2] - The new model has enhanced agent capabilities through post-training optimization, showing significant improvements in tool usage and agent tasks [2] Group 3 - Zhiyu's flagship model GLM-4.5, launched in July, integrates reasoning, coding, and agent capabilities within a single model to meet complex application needs [2] - In August, Zhiyu introduced the GLM-4.5V, a top-performing open-source visual reasoning model with 100 billion parameters (total parameters 106 billion, active parameters 12 billion) [2]
AI概念股多数走高 DeepSeek新模型成本下降超50% 机构看好AI应用商业化拐点临近
Zhi Tong Cai Jing· 2025-09-30 01:52
华泰证券曾表示,模型降价将吸引更多的开发者开发AI应用,或进一步提振算力需求,提升Super App 出现概率。中银国际认为,AI应用商业化拐点临近。在算力层,推理效率与性价比大幅提升,国产芯 片加速替代;在模型层,通用大模型的能力已逐步达到商用标准;在数据层,行业专属数据的积累与合 成数据技术成熟之下,企业加速实现数据闭环训练与模型微调。三者共同推动AI能力从"单点突破"走 向"体系协同",为AI应用大规模商业化落地创造条件。 AI概念股早盘多数走高,截至发稿,汇量科技(01860)涨4.47%,报19.88港元;迈富时(02556)涨4.33%, 报51.35港元;创新奇智(02121)涨3.65%,报7.95港元;第四范式(06682)涨3.15%,报65.5港元;美图公 司(01357)涨3.26%,报9.16港元。 消息面上,DeepSeek昨日宣布,官方App、网页端、小程序均已同步更新为DeepSeek-V3.2-Exp。 DeepSeek介绍,得益于新模型服务成本的大幅降低,官方API价格也相应下调,新价格即刻生效。在新 的价格政策下,开发者调用DeepSeek API的成本将降低50%以上。 ...
港股异动 | AI概念股多数走高 DeepSeek新模型成本下降超50% 机构看好AI应用商业化拐点临近
Zhi Tong Cai Jing· 2025-09-30 01:52
Group 1 - AI concept stocks saw a majority increase in early trading, with notable gains from companies such as 汇量科技 (4.47% increase), 迈富时 (4.33% increase), and 创新奇智 (3.65% increase) [1] - DeepSeek announced a significant update to its services, reducing the cost of its API by over 50% due to a new model that lowers service costs [1] - The National Development and Reform Commission (NDRC) plans to support various enterprises, including private companies, to deeply engage in AI initiatives [1] Group 2 - Huatai Securities indicated that the reduction in model prices will attract more developers to create AI applications, potentially boosting demand for computing power and increasing the likelihood of Super Apps [2] - Zhongyin International believes that the commercialization inflection point for AI applications is approaching, driven by improvements in reasoning efficiency and cost-effectiveness of domestic chips [2] - The combination of advancements in model capabilities, data accumulation, and synthetic data technology is facilitating a shift from "single-point breakthroughs" to "systematic collaboration" in AI capabilities, paving the way for large-scale commercialization [2]
国产AI重磅!DeepSeek-V3.2发布!寒武纪、昇腾均已适配!国产芯片深度协同有望受益
Xin Lang Ji Jin· 2025-09-30 01:30
Group 1 - DeepSeek officially released the DeepSeek-V3.2-Exp model, which incorporates a sparse Attention architecture to reduce computational resource consumption and enhance inference efficiency [1] - The new pricing policy allows developers to access the DeepSeek API at a cost reduction of over 50% [1] - Cambricon announced it has adapted to DeepSeek-V3.2-Exp and open-sourced the vLLM-MLU inference engine, indicating deep collaboration among leading companies in China's AI industry [1] Group 2 - Analysts believe that with continuous investment in computing infrastructure, domestic computing power is expected to maintain good growth momentum, potentially surpassing overseas computing power in the medium term [2] - The Sci-Tech Innovation Artificial Intelligence ETF (589520) focuses on the domestic AI industry chain, with Cambricon accounting for 16.62% of its weight as of September 29 [2] Group 3 - The top ten holdings of the Sci-Tech Innovation Artificial Intelligence ETF include Cambricon, Lattice Semiconductor, Chipone Technology, Kingsoft Office, Stone Technology, Amlogic, Hengxuan Technology, Yuntian Lifei, Fudan Microelectronics, and Espressif Systems [3] Group 4 - The Sci-Tech Innovation Artificial Intelligence ETF highlights three key points: 1. Policy support is igniting AI growth, with AI expected to lead the current market trend [4] 2. Domestic substitution and self-control are crucial in the context of technological friction, emphasizing the importance of AI as a core technology [4] 3. The ETF offers high elasticity and strong offensive potential, with over 70% of its top ten holdings concentrated in the semiconductor sector [4]
DeepSeek新模型开源,新架构亮了,国产AI芯片集体狂欢
3 6 Ke· 2025-09-30 01:15
Core Insights - DeepSeek has announced the open-source release of the DeepSeek-V3.2-Exp experimental model, which introduces the DeepSeek Sparse Attention mechanism, significantly improving long text training and inference efficiency without compromising output quality [1][9] - The new model reduces service costs by over 50%, with the price for outputting 1 million tokens dropping to 3 yuan, which is one-fourth of the previous model's cost [3][5] - Major cloud platforms and AI chip manufacturers have quickly adapted to the new model, indicating strong industry support and interest [5][10] Model Performance - DeepSeek-V3.2-Exp shows comparable performance to its predecessor, DeepSeek-V3.1-Terminus, across various benchmarks, although it uses significantly fewer tokens for task completion [5][6] - In specific benchmarks, DeepSeek-V3.2-Exp maintained scores such as 85.0 in MMLU-Pro and improved in BrowseComp with an accuracy of 40.1, while some scores like Humanity's Last Exam saw a slight decline [6][39] - The model's architecture allows for a reduction in complexity from quadratic to near-linear, enhancing training and inference costs [36][42] Technical Innovations - The model employs a "continue pre-training + post-training" approach, integrating a Lightning Indexer and a fine-grained token selection mechanism to optimize performance [36][38] - The DSA mechanism is still in its prototype phase, indicating potential for further development and refinement [36][44] - DeepSeek has also released related technical reports and code to facilitate research and experimentation [7][9] Industry Impact - The rapid adaptation of DeepSeek-V3.2-Exp by companies like Huawei and Cambrian demonstrates the model's significance in the AI landscape [10][15][17] - The model's launch has sparked discussions in developer communities, highlighting its potential to mark a significant moment in AI development [21][22] - User feedback indicates a mix of excitement and skepticism regarding the model's performance, with some noting improvements in speed but concerns over capability [19][31][32]
DeepSeek 开源 TileLang 与 CUDA 算子:AI 底层国产替代的关键尝试
小熊跑的快· 2025-09-30 01:11
Core Viewpoint - DeepSeek's release of TileLang and CUDA operator versions represents a significant step towards achieving "independence and control" in AI foundational technology, particularly in the GPU operator development field, addressing issues of technical autonomy, domestic hardware compatibility, ecological collaboration, and innovation efficiency [2][11]. Group 1: Breaking CUDA Monopoly - The dominance of CUDA, a closed-source platform led by NVIDIA, poses risks of technological dependency for domestic developers, limiting their ability to customize operators for new model research [2][3]. - Domestic GPUs, despite improving in computational power, face high migration costs due to the lack of compatible operator libraries and development tools with CUDA [3][5]. Group 2: Lowering Barriers for Domestic Hardware - DeepSeek's open-source solution, TileLang, allows developers to quickly validate operator logic without relying on CUDA, thus reducing dependency on NVIDIA [4][6]. - The dual-version approach provides a precision baseline for domestic platforms, facilitating the verification of operator implementations and lowering debugging costs [4][6]. Group 3: Activating Open Source Community Collaboration - The success of domestic alternatives relies on ecological collaboration, where DeepSeek's open-source initiative encourages community participation in developing new operators [7][8]. - Researchers can quickly develop and share new operator prototypes using TileLang, which can then be adapted by domestic hardware manufacturers [8]. Group 4: Accelerating Domestic Research Pathways - The reliance on CUDA and its tools can hinder innovation in cutting-edge fields like large models and multi-modal research, creating an "optimization black box" [9][10]. - DeepSeek's dual-version operators provide a pathway for domestic teams to innovate without the constraints of CUDA compatibility and licensing issues [10][11]. Group 5: From Single Point Replacement to Ecological Breakthrough - DeepSeek's actions signify a shift from passive following to active construction in the domestic AI foundational technology stack, addressing the challenges of high barriers, long cycles, and adaptation difficulties in GPU operator development [11]. - The approach of using open-source to break monopolies, abstracting complexities, and fostering collaboration may become a crucial paradigm for domestic alternatives in the AI foundational technology sector [11].
9月30日国际晨讯 | 现货黄金价格升破3840美元再创新高 美国关键经济数据或延迟发布
Sou Hu Cai Jing· 2025-09-30 01:09
Market Review - Spot gold prices have surpassed $3840 per ounce, reaching a new high [6] - On September 29, US major stock indices experienced slight gains, with the Dow Jones up 0.15% at 46316.07 points, S&P 500 up 0.26% at 6661.21 points, and Nasdaq up 0.48% at 22591.15 points [6] - European stock indices also saw minor increases, with Germany's DAX up 0.02% at 23745.06 points, France's CAC40 up 0.13% at 7880.87 points, and the UK's FTSE 100 up 0.16% at 9299.84 points [6] International Macro - US President Trump met with congressional leaders on September 29 to discuss avoiding a government shutdown, with significant disagreements noted by Senate Democratic leader Chuck Schumer [7] - The US federal government is set to run out of funding by midnight on September 30, risking a government shutdown if no agreement is reached on funding legislation [7] - The Bureau of Labor Statistics (BLS) has announced that it will halt data collection and not release planned reports, including the monthly non-farm payroll report, if funding is interrupted [7] Corporate News - On September 29, DeepSeek-V3.2-Exp model was officially released and open-sourced on the Hugging Face platform, introducing a sparse attention mechanism for improved efficiency in training and inference of long texts [8] - OpenAI plans to release the new Sora 2 video generator as a standalone application, which may generate videos containing copyrighted content unless rights holders opt out [8] Institutional Views - Goldman Sachs analysts predict that global stock markets are likely to continue rising until the end of the year, supported by strong US economic performance and favorable stock valuations [9] - The team led by Christian Mueller-Glissmann has upgraded the global stock market allocation rating to "overweight" for the next three months, suggesting that stocks typically perform well in the later stages of economic slowdown with strong policy support [9]