昇腾NPU
Search documents
多款国产芯片Day0支持智谱GLM-5
Guan Cha Zhe Wang· 2026-02-12 02:05
Core Insights - Zhipu AI has launched and open-sourced GLM-5, with multiple domestic chip manufacturers completing Day-0 adaptation for the model, ensuring compatibility and stable operation from the first day of release [1][2] Group 1: GLM-5 Launch and Adaptation - Zhipu AI's GLM-5 has been adapted by major domestic chip platforms including Huawei Ascend, Moore Threads, Cambricon, Kunlun, Muxi, Suiruan, and Haiguang, achieving high throughput and low latency on domestic computing clusters [2] - Haiguang's DCU team collaborated closely with Zhipu AI to optimize the underlying operators and hardware acceleration, enabling GLM-5 to run stably on Haiguang DCU with high throughput and low latency [1] - Moore Threads completed full-process adaptation and verification on its flagship AI training and inference GPU MTT S5000, leveraging its MUSA architecture to enhance model inference performance while reducing memory usage [1] Group 2: Model Performance and Features - GLM-5 has expanded its parameter scale from 355 billion (activated 32 billion) to 744 billion (activated 40 billion) and increased pre-training data from 23 terabytes to 28.5 terabytes, significantly enhancing its general intelligence capabilities [2] - The model incorporates a new "Slime" asynchronous reinforcement learning framework, supporting larger model scales and more complex tasks, and utilizes DeepSeek Sparse Attention to maintain long text performance while reducing deployment costs and improving token efficiency [2] - A month prior, Zhipu AI released the GLM-Image model, which employs a hybrid architecture of autoregressive and diffusion decoders, marking a significant exploration in cognitive generative technology [2]
多款国产芯片宣布Day0支持智谱GLM-5
Guan Cha Zhe Wang· 2026-02-12 01:59
不仅如此,摩尔线程还首次公布了MTT S5000的算力参数。该卡基于第四代MUSA架构"平湖"打造。其 单卡AI算力最高可达1000 TFLOPS,配备80GB显存,显存带宽达到1.6TB/s,卡间互联带宽为 784GB/s,完整支持从FP8到FP64的全精度计算。 随后有消息传出,目前智谱GLM-5已完成与华为昇腾、摩尔线程、寒武纪、昆仑芯、沐曦、燧原、海 光等主流国产芯片平台的深度推理适配与算子级优化,能够在国产算力集群上实现高吞吐、低延迟的稳 定运行。 2月11日,智谱AI上线并开源GLM-5,随后多款国产芯片完成对GLM-5的Day0适配。 海光信息方面表示,在本次Day0适配合作中,海光DCU团队深度协同智谱AI,凭借DTK等自研软件栈 优势,重点优化底层算子与硬件加速,使GLM-5在海光DCU上实现高吞吐、低延迟的稳定运行,充分 发挥"国产算力+国产大模型"价值。 摩尔线程表示,基于SGLang推理框架,该公司在旗舰级AI训推一体全功能GPU MTT S5000上,Day-0 完成了全流程适配与验证。凭借MUSA架构算子覆盖与生态兼容能力,摩尔线程打通了模型推理全链 路,并深度释放MTT S500 ...
中国企业一年投3.9万亿夯实硬实力 华为“压强式”研发七大领域实现突破
Chang Jiang Shang Bao· 2026-02-08 23:55
这一年,中国科技成果丰硕,创新生态日趋完善。华为持续"压强式"研发,在算力、鸿蒙生态、通信网 络、终端市场等七大领域实现硬核突破;寒武纪10年攻关实现AI芯片国产化突破;商业航天竞速,6家 企业集体冲击IPO…… 这一年,科技创新与产业融合加速。阿里巴巴3800亿元投向未来,通义千问串起阿里AI生态;全球代 工巨头工业富联(601138)蜕变为万亿算力巨头;恒瑞医药(600276)从仿制转向创新,I类新药频频 借船出海…… 科技突破、产业竞争力的提升,来自对创新的投入。根据国家统计局披露的数据,2025年,中国企业研 发经费达3.93万亿元,稳居世界第二,研发投入强度2.8%,首次超越OECD国家平均水平。 AI时代奔涌而来,创新浪潮澎湃。 2025年,是中国科技在风浪中磨砺韧性的一年,也是于变局中开创新局的关键一年。 传统产业融合焕发新动能 新兴产业不断突围前行,传统产业融合新兴科技,释放出充满活力的新质生产力。 在AI浪潮之下,"代工之王"工业富联实现关键突围,深度绑定全球科技巨头苹果、英伟达,实现了 向"全球AI算力基建供应商"的蜕变。根据公开信息,工业富联AI服务器全球代工市占率超35%,800G 高 ...
在白山黑水之间,书写振兴东北的数智新篇
Huan Qiu Wang Zi Xun· 2025-12-18 07:55
Core Viewpoint - Heilongjiang is leveraging its unique resources and strategic partnerships, particularly with Huawei, to drive digital transformation and innovation in artificial intelligence and computing power integration, positioning itself as a key player in China's digital economy [1][3]. Group 1: Digital Transformation as a Core Economic Engine - Heilongjiang has established AI innovation and computing power integration as the core engine for economic development during the 14th Five-Year Plan, driven by its inherent "seeking change" ethos [1]. - The province's confidence is rooted in its rich resource endowment, including a concentration of computing power demand and abundant electricity resources, making it an ideal hub for a new type of intelligent computing center [1]. - Heilongjiang boasts a strong educational foundation with institutions like Harbin Institute of Technology and Harbin Engineering University, providing a continuous supply of talent for AI and computing industries [1]. Group 2: Challenges and Shortcomings - Compared to coastal eastern regions, Heilongjiang's digital economy is relatively underdeveloped, with a weak industrial digitalization foundation and an existing "digital divide" [2]. - Traditional enterprises, especially SMEs, face significant challenges in transformation due to high technical barriers, substantial capital investment, and a lack of talent [2]. - There is a pressing need for continuous improvement in digital infrastructure, including computing power scheduling efficiency and data element circulation [2]. Group 3: Partnership with Huawei - Huawei has committed to deeply engage with Heilongjiang, integrating its technological expertise in computing, communication, energy, cloud, and AI to support the province's digital economy and high-quality urban development [3]. - The collaboration aims to provide a comprehensive suite of innovative technologies and extensive industrial ecosystem resources, tailored to Heilongjiang's unique advantages and strategic needs [3]. - Huawei's proposed solutions include stable green electricity for computing power through collaborative technologies and the establishment of resource pools for various industry applications [3]. Group 4: Industry Ecosystem Development - Huawei is facilitating Heilongjiang's digital transformation by establishing collaborative platforms, such as joint innovation labs with China Railway Harbin Bureau Group, focusing on standardizing solutions for the railway sector [4]. - The establishment of the "AI Empowering Digital Development Industry Alliance" signifies a new cooperative paradigm aimed at bridging the gap between technology research and industrial application [5]. - Huawei's initiatives are permeating various sectors, enhancing infrastructure intelligence and building a sustainable talent pipeline for long-term development [5]. Group 5: Practical Applications of Digital Transformation - The "Longzheng Zhishu" government model exemplifies successful digital governance, integrating over 400 billion pieces of data and significantly improving efficiency in local government operations [6]. - The logistics company Dada Zhiyun has transformed its operations into a comprehensive supply chain service provider by leveraging Huawei's technology to create an integrated logistics platform [6]. - The establishment of a modern industrial college in collaboration with Huawei is revolutionizing IT talent training, providing practical experience and skills aligned with industry needs [7]. Group 6: Conclusion - The partnership between Huawei and Heilongjiang represents a deep integration of technological empowerment and regional advantages, driving the revitalization of the Northeast and contributing to high-quality development in the digital economy [7].
宏观策略周报:2025世界人工智能大会描绘AI新未来,7月制造业PMI为49.3%-20250801
Yuan Da Xin Xi· 2025-08-01 10:47
Group 1 - The report highlights the significance of the 2025 World Artificial Intelligence Conference (WAIC) held in Shanghai, showcasing advancements in AI technology and global collaboration opportunities, which are expected to drive industrial intelligence upgrades [2][11][12] - In June, the profits of industrial enterprises above designated size decreased by 1.8% year-on-year, with a notable recovery in the equipment manufacturing sector, indicating the positive impact of the "two new" policies [14][17] - The manufacturing PMI for July was reported at 49.3%, reflecting a slight decline of 0.4 percentage points from the previous month, indicating a cooling in manufacturing activity, although the production index remains above the critical point, suggesting overall expansion in production activities [19][22] Group 2 - The investment strategy emphasizes the development of new productive forces as a key policy direction, suggesting a focus on sectors such as artificial intelligence, innovative pharmaceuticals, robotics, and deep-sea technology for potential excess returns [3][33] - There is a strong recommendation to boost domestic consumption, with expectations for consumer spending to increase, particularly in new consumption, home appliances, and automotive sectors [3][33] - The report suggests that gold may see sustained demand as a safe-haven asset amid rising geopolitical tensions and global economic uncertainties, indicating a long-term investment opportunity in gold [3][33]
突发,午后跳水!超4200只个股下跌,周期股跌麻了!一则重磅消息,这个板块逆市拉升...
雪球· 2025-07-31 08:25
Market Overview - The market experienced a significant decline, with the Shanghai Composite Index dropping by 1.18%, the Shenzhen Component Index falling by 1.73%, and the ChiNext Index decreasing by 1.66% [1] - The trading volume in the Shanghai and Shenzhen markets reached approximately 19.36 trillion yuan, an increase of about 91.7 billion yuan compared to the previous trading day, with over 4,200 stocks declining [2] Sector Performance - Cyclical stocks, including coal, steel, oil, and non-ferrous metals, led the market decline, with the steel sector falling over 3% and both non-ferrous metals and coal sectors dropping more than 2% [5] - Notable individual stock declines included Angang Steel and Baosteel, which fell over 7%, while Yunnan Zinc and Northern Rare Earth dropped more than 5% [5][6] Futures Market - Multiple previously popular futures contracts saw significant declines, with glass and coking coal main contracts dropping by 8%, polysilicon falling over 7%, and industrial silicon and lithium carbonate decreasing by over 6% and nearly 5%, respectively [7] - The Dalian Commodity Exchange has adjusted trading limits for certain contracts to maintain market stability, including a reduction in daily opening positions for industrial silicon, polysilicon, and lithium carbonate [8] Domestic Semiconductor Sector - Following a significant meeting with Nvidia regarding security risks associated with its H20 computing chip, domestic semiconductor stocks surged, with companies like Dongxin Co. and Cambrian Technologies seeing substantial gains [10] - Domestic GPU companies are accelerating their development, with firms like Moer Technology and Muxi Integrated Circuit announcing IPO plans to raise funds for GPU research and market expansion [12] Infant and Child Industry - The infant and child sector continued to show strength, with companies like Sunshine Dairy and Anzheng Fashion achieving three consecutive trading limits, while several others saw notable increases [14] - The Chinese government has allocated approximately 90 billion yuan for child-rearing subsidies, and Beijing has introduced measures to enhance support for childbirth, including establishing a subsidy system and improving maternity insurance [16]
直线飙涨!刚刚,重磅突发
Zheng Quan Shi Bao Wang· 2025-07-31 06:23
Core Insights - The National Internet Information Office of China has summoned NVIDIA regarding security risks associated with its H20 chip sold in China, requiring explanations and proof of safety measures [1][2] - Following this news, domestic alternatives in the semiconductor sector saw significant stock price increases, with companies like Cambrian Technology rising over 7% [1][5] - NVIDIA's H20 chip, designed specifically for the Chinese market, was previously banned by the U.S. government due to national security concerns, leading to substantial financial losses for NVIDIA [4] Company Developments - Cambrian Technology's stock surged, reflecting investor confidence in domestic alternatives following NVIDIA's security issues [1][5] - Domestic GPU companies, including Moer Technology and Muxi Integrated Circuit, are accelerating their development and have filed for IPOs to raise funds for GPU research and market expansion [7] - Huawei announced the launch of its CloudMatrix384AI supernode, significantly enhancing its AI computing capabilities, which may impact the competitive landscape in the GPU market [8] Market Trends - The domestic GPU market is witnessing a rapid development pace, with several companies preparing for IPOs to alleviate research and development pressures [7] - The rise of domestic alternatives is expected to increase market share for local GPU manufacturers, driven by technological advancements and funding from IPOs [7] - The overall sentiment in the semiconductor sector is shifting towards domestic solutions, as evidenced by the stock performance of companies like Cambrian Technology and others [1][5]
直线飙涨!刚刚,重磅突发!
券商中国· 2025-07-31 05:59
Core Viewpoint - The article discusses the recent developments regarding Nvidia's H20 chip and its implications for the Chinese market, highlighting the security concerns raised by the Chinese government and the subsequent rise of domestic alternatives in the GPU sector [1][2][6]. Group 1: Nvidia's H20 Chip and Security Concerns - The Chinese government has summoned Nvidia to explain the security risks associated with the H20 chip, citing laws related to cybersecurity and data protection [1][2]. - The H20 chip, designed specifically for the Chinese market, has faced scrutiny due to potential backdoor vulnerabilities, leading to increased calls for advanced chips to include tracking and remote shutdown features [2][5]. Group 2: Market Reactions and Domestic Alternatives - Following the news of Nvidia's security issues, domestic companies like Cambrian Technology saw significant stock price increases, with Cambrian rising over 7% [1][6]. - Several domestic GPU companies, including Moore Threads and Muxi Integrated Circuit, are accelerating their development and have recently filed for IPOs to raise funds for GPU research and market expansion [6]. - Moore Threads aims to raise 8 billion yuan for its GPU development, indicating strong market potential and a commitment to enhancing domestic capabilities in AI and graphics processing [6]. Group 3: Technological Advancements - Huawei has announced the CloudMatrix384AI super node, significantly enhancing its computing power from 6.4 pFLOPS to 300 pFLOPS, marking a 50-fold increase [7]. - The new architecture supports advanced AI model inference, improving throughput and reducing latency, showcasing the rapid advancements in domestic technology [7].
华为首个开源大模型来了!Pro MoE 720亿参数,4000颗昇腾训练
Hua Er Jie Jian Wen· 2025-06-30 07:27
Core Insights - Huawei has announced the open-sourcing of its Pangu models, including the 70 billion parameter dense model and the 720 billion parameter mixture of experts (MoE) model, marking a significant step in the domestic large model open-source competition [1][3][20] Model Performance - The Pangu Pro MoE model achieves a single-card inference throughput of 1148 tokens/s on the Ascend 800I A2, which can be further enhanced to 1528 tokens/s using speculative acceleration technology, outperforming similar-sized dense models [3][11] - The Pangu Pro MoE model is built on the MoGE architecture, with a total parameter count of 720 billion and an active parameter count of 160 billion, optimized specifically for Ascend hardware [4][11] Training and Evaluation - Huawei utilized 4000 Ascend NPUs for pre-training on a high-quality corpus of 13 trillion tokens, divided into general, inference, and annealing phases to progressively enhance model capabilities [11] - The Pangu Pro MoE model has demonstrated superior performance in various benchmarks, including achieving a score of 91.2 in the DROP benchmark, closely matching the best current models [12][14] Competitive Landscape - The open-sourcing of Pangu models coincides with a wave of domestic AI model releases, with leading companies like MiniMax and Alibaba also upgrading their open-source models, leading to a price reduction of 60%-80% for large models [3][20] - The Pangu Pro MoE model ranks fifth in the SuperCLUE Chinese large model benchmark, surpassing several existing models and indicating its competitive position in the market [17][18] Technological Integration - Huawei's ecosystem, integrating chips (Ascend NPU), frameworks (MindSpore), and models (Pangu), represents a significant technological achievement, providing a viable high-performance alternative to Nvidia's dominance in the industry [20]
训练大模型,终于可以“既要又要还要”了
虎嗅APP· 2025-05-29 10:34
Core Insights - The article discusses the advancements in the MoE (Mixture of Experts) model architecture, particularly focusing on Huawei's Pangu Ultra MoE, which aims to balance model performance and efficiency while addressing challenges in training large-scale models [1][6][33] Group 1: MoE Model Innovations - Huawei's Pangu Ultra MoE model features a parameter scale of 718 billion, designed to optimize the performance and efficiency of large-scale MoE architectures [6][9] - The model incorporates advanced architectures such as MLA (Multi-head Latent Attention) and MTP (Multi-token Prediction), enhancing its training and inference capabilities [6][7] - The Depth-Scaled Sandwich-Norm (DSSN) and TinyInit methods are introduced to improve training stability, reducing gradient spikes by 51% and enabling long-term stable training with over 10 trillion tokens [11][12][14] Group 2: Load Balancing and Efficiency - The EP (Expert Parallelism) group load balancing method is designed to ensure efficient token distribution among experts, enhancing training efficiency without compromising model specialization [19][20] - The Pangu Ultra MoE model employs an EP-Group load balancing loss that allows for flexible routing choices, promoting expert specialization while maintaining computational efficiency [20][21] Group 3: Training Techniques and Performance - The model's pre-training phase utilizes dropless training, achieving a long sequence capability of 128k, which enhances its learning efficiency on target data [8][14] - The introduction of MTP allows for speculative inference, significantly improving the acceptance length by 38% compared to single-token predictions [24][27] - The reinforcement learning system designed for post-training focuses on iterative hard example mining and multi-capability collaboration, ensuring comprehensive performance across various tasks [28][31] Group 4: Future Implications - The advancements presented in Pangu Ultra MoE provide a viable path for deploying sparse large models at scale, pushing the performance limits and engineering applicability of MoE architectures [33]