Workflow
昇腾NPU
icon
Search documents
多款国产芯片Day0支持智谱GLM-5
Guan Cha Zhe Wang· 2026-02-12 02:05
Core Insights - Zhipu AI has launched and open-sourced GLM-5, with multiple domestic chip manufacturers completing Day-0 adaptation for the model, ensuring compatibility and stable operation from the first day of release [1][2] Group 1: GLM-5 Launch and Adaptation - Zhipu AI's GLM-5 has been adapted by major domestic chip platforms including Huawei Ascend, Moore Threads, Cambricon, Kunlun, Muxi, Suiruan, and Haiguang, achieving high throughput and low latency on domestic computing clusters [2] - Haiguang's DCU team collaborated closely with Zhipu AI to optimize the underlying operators and hardware acceleration, enabling GLM-5 to run stably on Haiguang DCU with high throughput and low latency [1] - Moore Threads completed full-process adaptation and verification on its flagship AI training and inference GPU MTT S5000, leveraging its MUSA architecture to enhance model inference performance while reducing memory usage [1] Group 2: Model Performance and Features - GLM-5 has expanded its parameter scale from 355 billion (activated 32 billion) to 744 billion (activated 40 billion) and increased pre-training data from 23 terabytes to 28.5 terabytes, significantly enhancing its general intelligence capabilities [2] - The model incorporates a new "Slime" asynchronous reinforcement learning framework, supporting larger model scales and more complex tasks, and utilizes DeepSeek Sparse Attention to maintain long text performance while reducing deployment costs and improving token efficiency [2] - A month prior, Zhipu AI released the GLM-Image model, which employs a hybrid architecture of autoregressive and diffusion decoders, marking a significant exploration in cognitive generative technology [2]
多款国产芯片宣布Day0支持智谱GLM-5
Guan Cha Zhe Wang· 2026-02-12 01:59
Core Insights - Zhipu AI has launched and open-sourced GLM-5, with multiple domestic chips completing Day0 adaptation for optimized performance [1][2] - GLM-5 has achieved state-of-the-art (SOTA) performance in coding and agent capabilities, with significant improvements in parameter scale and pre-training data [2] Group 1: GLM-5 Launch and Adaptation - Zhipu AI's GLM-5 has been adapted for high throughput and low latency on domestic chip platforms including Huawei Ascend, Moore Threads, and others [2] - The collaboration with Haiguang Information has optimized the underlying operators and hardware acceleration for stable operation on Haiguang DCU [1] - Moore Threads has completed full-process adaptation and verification on its MTT S5000 GPU, achieving high-performance inference while reducing memory usage [1] Group 2: Technical Enhancements and Performance Metrics - GLM-5's parameter scale has expanded from 355 billion (with 32 billion activated) to 744 billion (with 40 billion activated), and pre-training data has increased from 23 terabytes to 28.5 terabytes [2] - The introduction of the "Slime" asynchronous reinforcement learning framework supports larger model scales and complex tasks, enhancing learning from long-term interactions [2] - The integration of DeepSeek Sparse Attention mechanism significantly reduces deployment costs while maintaining long text performance [2] Group 3: Support from Huawei - Huawei's Ascend NPU and MindSpore AI framework provide comprehensive support from data to training, contributing to the development of SOTA models based on self-innovated computing power [3]
中国企业一年投3.9万亿夯实硬实力 华为“压强式”研发七大领域实现突破
Chang Jiang Shang Bao· 2026-02-08 23:55
Core Insights - The year 2025 is pivotal for China's technology sector, showcasing significant advancements and a robust innovation ecosystem [2] - Major companies like Huawei, Alibaba, and Industrial Fulian are leading breakthroughs in AI, cloud computing, and traditional industry integration [3][5][6] Group 1: Technological Advancements - Huawei achieved breakthroughs in seven key areas including computing power, HarmonyOS, and digital energy, with a smartphone market share of 17% in 2025 [3][4] - The AI chip industry saw significant progress with companies like Cambricon achieving domestic breakthroughs and expected profitability [4] - China's innovation index ranks among the top ten globally, with over 5 million effective invention patents and leading PCT international patent applications for six consecutive years [2] Group 2: Industry Integration and Growth - Industrial Fulian transformed into a global AI computing infrastructure supplier, holding over 35% market share in AI server manufacturing and projecting a net profit increase of 51% to 54% in 2025 [5] - Alibaba is investing 380 billion yuan over three years to enhance its AI and cloud infrastructure, aiming to create a comprehensive AI ecosystem [6] - The pharmaceutical sector is witnessing a shift with companies like Heng Rui Pharmaceutical increasing the proportion of innovative drug revenue to over 60% [6] Group 3: Investment and R&D - Chinese enterprises are projected to invest 3.93 trillion yuan in R&D in 2025, surpassing the OECD average for the first time [2][6] - Huawei's R&D investment reached 96.95 billion yuan in the first half of 2025, contributing to a total exceeding 1 trillion yuan over six and a half years [6]
在白山黑水之间,书写振兴东北的数智新篇
Huan Qiu Wang Zi Xun· 2025-12-18 07:55
Core Viewpoint - Heilongjiang is leveraging its unique resources and strategic partnerships, particularly with Huawei, to drive digital transformation and innovation in artificial intelligence and computing power integration, positioning itself as a key player in China's digital economy [1][3]. Group 1: Digital Transformation as a Core Economic Engine - Heilongjiang has established AI innovation and computing power integration as the core engine for economic development during the 14th Five-Year Plan, driven by its inherent "seeking change" ethos [1]. - The province's confidence is rooted in its rich resource endowment, including a concentration of computing power demand and abundant electricity resources, making it an ideal hub for a new type of intelligent computing center [1]. - Heilongjiang boasts a strong educational foundation with institutions like Harbin Institute of Technology and Harbin Engineering University, providing a continuous supply of talent for AI and computing industries [1]. Group 2: Challenges and Shortcomings - Compared to coastal eastern regions, Heilongjiang's digital economy is relatively underdeveloped, with a weak industrial digitalization foundation and an existing "digital divide" [2]. - Traditional enterprises, especially SMEs, face significant challenges in transformation due to high technical barriers, substantial capital investment, and a lack of talent [2]. - There is a pressing need for continuous improvement in digital infrastructure, including computing power scheduling efficiency and data element circulation [2]. Group 3: Partnership with Huawei - Huawei has committed to deeply engage with Heilongjiang, integrating its technological expertise in computing, communication, energy, cloud, and AI to support the province's digital economy and high-quality urban development [3]. - The collaboration aims to provide a comprehensive suite of innovative technologies and extensive industrial ecosystem resources, tailored to Heilongjiang's unique advantages and strategic needs [3]. - Huawei's proposed solutions include stable green electricity for computing power through collaborative technologies and the establishment of resource pools for various industry applications [3]. Group 4: Industry Ecosystem Development - Huawei is facilitating Heilongjiang's digital transformation by establishing collaborative platforms, such as joint innovation labs with China Railway Harbin Bureau Group, focusing on standardizing solutions for the railway sector [4]. - The establishment of the "AI Empowering Digital Development Industry Alliance" signifies a new cooperative paradigm aimed at bridging the gap between technology research and industrial application [5]. - Huawei's initiatives are permeating various sectors, enhancing infrastructure intelligence and building a sustainable talent pipeline for long-term development [5]. Group 5: Practical Applications of Digital Transformation - The "Longzheng Zhishu" government model exemplifies successful digital governance, integrating over 400 billion pieces of data and significantly improving efficiency in local government operations [6]. - The logistics company Dada Zhiyun has transformed its operations into a comprehensive supply chain service provider by leveraging Huawei's technology to create an integrated logistics platform [6]. - The establishment of a modern industrial college in collaboration with Huawei is revolutionizing IT talent training, providing practical experience and skills aligned with industry needs [7]. Group 6: Conclusion - The partnership between Huawei and Heilongjiang represents a deep integration of technological empowerment and regional advantages, driving the revitalization of the Northeast and contributing to high-quality development in the digital economy [7].
宏观策略周报:2025世界人工智能大会描绘AI新未来,7月制造业PMI为49.3%-20250801
Yuan Da Xin Xi· 2025-08-01 10:47
Group 1 - The report highlights the significance of the 2025 World Artificial Intelligence Conference (WAIC) held in Shanghai, showcasing advancements in AI technology and global collaboration opportunities, which are expected to drive industrial intelligence upgrades [2][11][12] - In June, the profits of industrial enterprises above designated size decreased by 1.8% year-on-year, with a notable recovery in the equipment manufacturing sector, indicating the positive impact of the "two new" policies [14][17] - The manufacturing PMI for July was reported at 49.3%, reflecting a slight decline of 0.4 percentage points from the previous month, indicating a cooling in manufacturing activity, although the production index remains above the critical point, suggesting overall expansion in production activities [19][22] Group 2 - The investment strategy emphasizes the development of new productive forces as a key policy direction, suggesting a focus on sectors such as artificial intelligence, innovative pharmaceuticals, robotics, and deep-sea technology for potential excess returns [3][33] - There is a strong recommendation to boost domestic consumption, with expectations for consumer spending to increase, particularly in new consumption, home appliances, and automotive sectors [3][33] - The report suggests that gold may see sustained demand as a safe-haven asset amid rising geopolitical tensions and global economic uncertainties, indicating a long-term investment opportunity in gold [3][33]
突发,午后跳水!超4200只个股下跌,周期股跌麻了!一则重磅消息,这个板块逆市拉升...
雪球· 2025-07-31 08:25
Market Overview - The market experienced a significant decline, with the Shanghai Composite Index dropping by 1.18%, the Shenzhen Component Index falling by 1.73%, and the ChiNext Index decreasing by 1.66% [1] - The trading volume in the Shanghai and Shenzhen markets reached approximately 19.36 trillion yuan, an increase of about 91.7 billion yuan compared to the previous trading day, with over 4,200 stocks declining [2] Sector Performance - Cyclical stocks, including coal, steel, oil, and non-ferrous metals, led the market decline, with the steel sector falling over 3% and both non-ferrous metals and coal sectors dropping more than 2% [5] - Notable individual stock declines included Angang Steel and Baosteel, which fell over 7%, while Yunnan Zinc and Northern Rare Earth dropped more than 5% [5][6] Futures Market - Multiple previously popular futures contracts saw significant declines, with glass and coking coal main contracts dropping by 8%, polysilicon falling over 7%, and industrial silicon and lithium carbonate decreasing by over 6% and nearly 5%, respectively [7] - The Dalian Commodity Exchange has adjusted trading limits for certain contracts to maintain market stability, including a reduction in daily opening positions for industrial silicon, polysilicon, and lithium carbonate [8] Domestic Semiconductor Sector - Following a significant meeting with Nvidia regarding security risks associated with its H20 computing chip, domestic semiconductor stocks surged, with companies like Dongxin Co. and Cambrian Technologies seeing substantial gains [10] - Domestic GPU companies are accelerating their development, with firms like Moer Technology and Muxi Integrated Circuit announcing IPO plans to raise funds for GPU research and market expansion [12] Infant and Child Industry - The infant and child sector continued to show strength, with companies like Sunshine Dairy and Anzheng Fashion achieving three consecutive trading limits, while several others saw notable increases [14] - The Chinese government has allocated approximately 90 billion yuan for child-rearing subsidies, and Beijing has introduced measures to enhance support for childbirth, including establishing a subsidy system and improving maternity insurance [16]
直线飙涨!刚刚,重磅突发
Core Insights - The National Internet Information Office of China has summoned NVIDIA regarding security risks associated with its H20 chip sold in China, requiring explanations and proof of safety measures [1][2] - Following this news, domestic alternatives in the semiconductor sector saw significant stock price increases, with companies like Cambrian Technology rising over 7% [1][5] - NVIDIA's H20 chip, designed specifically for the Chinese market, was previously banned by the U.S. government due to national security concerns, leading to substantial financial losses for NVIDIA [4] Company Developments - Cambrian Technology's stock surged, reflecting investor confidence in domestic alternatives following NVIDIA's security issues [1][5] - Domestic GPU companies, including Moer Technology and Muxi Integrated Circuit, are accelerating their development and have filed for IPOs to raise funds for GPU research and market expansion [7] - Huawei announced the launch of its CloudMatrix384AI supernode, significantly enhancing its AI computing capabilities, which may impact the competitive landscape in the GPU market [8] Market Trends - The domestic GPU market is witnessing a rapid development pace, with several companies preparing for IPOs to alleviate research and development pressures [7] - The rise of domestic alternatives is expected to increase market share for local GPU manufacturers, driven by technological advancements and funding from IPOs [7] - The overall sentiment in the semiconductor sector is shifting towards domestic solutions, as evidenced by the stock performance of companies like Cambrian Technology and others [1][5]
直线飙涨!刚刚,重磅突发!
券商中国· 2025-07-31 05:59
Core Viewpoint - The article discusses the recent developments regarding Nvidia's H20 chip and its implications for the Chinese market, highlighting the security concerns raised by the Chinese government and the subsequent rise of domestic alternatives in the GPU sector [1][2][6]. Group 1: Nvidia's H20 Chip and Security Concerns - The Chinese government has summoned Nvidia to explain the security risks associated with the H20 chip, citing laws related to cybersecurity and data protection [1][2]. - The H20 chip, designed specifically for the Chinese market, has faced scrutiny due to potential backdoor vulnerabilities, leading to increased calls for advanced chips to include tracking and remote shutdown features [2][5]. Group 2: Market Reactions and Domestic Alternatives - Following the news of Nvidia's security issues, domestic companies like Cambrian Technology saw significant stock price increases, with Cambrian rising over 7% [1][6]. - Several domestic GPU companies, including Moore Threads and Muxi Integrated Circuit, are accelerating their development and have recently filed for IPOs to raise funds for GPU research and market expansion [6]. - Moore Threads aims to raise 8 billion yuan for its GPU development, indicating strong market potential and a commitment to enhancing domestic capabilities in AI and graphics processing [6]. Group 3: Technological Advancements - Huawei has announced the CloudMatrix384AI super node, significantly enhancing its computing power from 6.4 pFLOPS to 300 pFLOPS, marking a 50-fold increase [7]. - The new architecture supports advanced AI model inference, improving throughput and reducing latency, showcasing the rapid advancements in domestic technology [7].
华为首个开源大模型来了!Pro MoE 720亿参数,4000颗昇腾训练
Hua Er Jie Jian Wen· 2025-06-30 07:27
Core Insights - Huawei has announced the open-sourcing of its Pangu models, including the 70 billion parameter dense model and the 720 billion parameter mixture of experts (MoE) model, marking a significant step in the domestic large model open-source competition [1][3][20] Model Performance - The Pangu Pro MoE model achieves a single-card inference throughput of 1148 tokens/s on the Ascend 800I A2, which can be further enhanced to 1528 tokens/s using speculative acceleration technology, outperforming similar-sized dense models [3][11] - The Pangu Pro MoE model is built on the MoGE architecture, with a total parameter count of 720 billion and an active parameter count of 160 billion, optimized specifically for Ascend hardware [4][11] Training and Evaluation - Huawei utilized 4000 Ascend NPUs for pre-training on a high-quality corpus of 13 trillion tokens, divided into general, inference, and annealing phases to progressively enhance model capabilities [11] - The Pangu Pro MoE model has demonstrated superior performance in various benchmarks, including achieving a score of 91.2 in the DROP benchmark, closely matching the best current models [12][14] Competitive Landscape - The open-sourcing of Pangu models coincides with a wave of domestic AI model releases, with leading companies like MiniMax and Alibaba also upgrading their open-source models, leading to a price reduction of 60%-80% for large models [3][20] - The Pangu Pro MoE model ranks fifth in the SuperCLUE Chinese large model benchmark, surpassing several existing models and indicating its competitive position in the market [17][18] Technological Integration - Huawei's ecosystem, integrating chips (Ascend NPU), frameworks (MindSpore), and models (Pangu), represents a significant technological achievement, providing a viable high-performance alternative to Nvidia's dominance in the industry [20]
训练大模型,终于可以“既要又要还要”了
虎嗅APP· 2025-05-29 10:34
Core Insights - The article discusses the advancements in the MoE (Mixture of Experts) model architecture, particularly focusing on Huawei's Pangu Ultra MoE, which aims to balance model performance and efficiency while addressing challenges in training large-scale models [1][6][33] Group 1: MoE Model Innovations - Huawei's Pangu Ultra MoE model features a parameter scale of 718 billion, designed to optimize the performance and efficiency of large-scale MoE architectures [6][9] - The model incorporates advanced architectures such as MLA (Multi-head Latent Attention) and MTP (Multi-token Prediction), enhancing its training and inference capabilities [6][7] - The Depth-Scaled Sandwich-Norm (DSSN) and TinyInit methods are introduced to improve training stability, reducing gradient spikes by 51% and enabling long-term stable training with over 10 trillion tokens [11][12][14] Group 2: Load Balancing and Efficiency - The EP (Expert Parallelism) group load balancing method is designed to ensure efficient token distribution among experts, enhancing training efficiency without compromising model specialization [19][20] - The Pangu Ultra MoE model employs an EP-Group load balancing loss that allows for flexible routing choices, promoting expert specialization while maintaining computational efficiency [20][21] Group 3: Training Techniques and Performance - The model's pre-training phase utilizes dropless training, achieving a long sequence capability of 128k, which enhances its learning efficiency on target data [8][14] - The introduction of MTP allows for speculative inference, significantly improving the acceptance length by 38% compared to single-token predictions [24][27] - The reinforcement learning system designed for post-training focuses on iterative hard example mining and multi-capability collaboration, ensuring comprehensive performance across various tasks [28][31] Group 4: Future Implications - The advancements presented in Pangu Ultra MoE provide a viable path for deploying sparse large models at scale, pushing the performance limits and engineering applicability of MoE architectures [33]