Workflow
华为“算力核弹”超越英伟达的秘密

Core Viewpoint - The emergence of Huawei's Ascend CLoudMatrix 384 supernode, which surpasses NVIDIA's flagship NVL72 system by 70% in computing power, signifies a shift in the AI computing landscape from single-point breakthroughs to system-level innovations, driven by the need to overcome traditional computing limitations under U.S. sanctions [1][6][29]. Group 1: AI Computing Landscape - The AI computing race is transitioning from hardware-centric approaches to architecture redefinition, with Huawei's innovations highlighting a unique path for China's system-level advancements [1][6]. - Huang Renxun, CEO of NVIDIA, has expressed increasing anxiety regarding China's rapid advancements in AI technology, emphasizing the impossibility of halting China's progress in this field [2][5][9]. Group 2: Huawei's Technological Advancements - Huawei's Ascend CLoudMatrix 384 supernode utilizes domestic Ascend chips and achieves a total computing power of 300 PFlops, significantly exceeding NVIDIA's NVL72 system [1][6][14]. - The architecture of the Ascend CLoudMatrix 384 supernode is based on a "fully equal architecture," which enhances communication efficiency and overcomes traditional bottlenecks such as the "memory wall" and "communication wall" [1][18][20]. Group 3: Competitive Dynamics - The U.S. government's sanctions have prompted NVIDIA to incur a $5.5 billion inventory loss, while simultaneously highlighting the importance of the Chinese market for NVIDIA's future [5][6]. - Huang Renxun acknowledges that China's advancements in AI technology could lead to a significant reduction in NVIDIA's market share in China, which has dropped from 95% to 50% in recent years [9][22]. Group 4: System-Level Innovations - The Ascend CLoudMatrix 384 supernode's design allows for the integration of thousands of cards, enabling it to support larger models and enhance training efficiency [1][6][14]. - The use of optical communication technology in the Ascend CLoudMatrix 384 supernode allows for high bandwidth and low latency, which is crucial for large-scale AI model training [20][21]. Group 5: Future Implications - The successful deployment of the Ascend CLoudMatrix 384 supernode and its ability to train large models like the Pangu Ultra MoE model demonstrates the potential for domestic AI infrastructure to achieve self-sufficiency [26][29]. - The emergence of Huawei's technology provides a viable alternative to NVIDIA's offerings, potentially reshaping the competitive landscape in the AI industry [22][29].