Core Viewpoint - The article discusses Huawei's innovative "super node" architecture, which aims to redefine large-scale effective computing power in AI by addressing the limitations of traditional server architectures and enhancing interconnectivity through the self-developed UnifiedBus protocol [3][4][12]. Group 1: Super Node Architecture - The super node architecture represents a deep restructuring of computing system architecture, moving from a "stacked" model to a "fused" model that allows multiple machines to function as a single device [4][9]. - This architecture aims to eliminate the communication bottlenecks inherent in traditional server setups, where data exchange between servers can lead to significant delays and inefficiencies [5][11]. - Huawei's super node can reduce communication latency to the nanosecond level, significantly improving cluster utilization and lowering communication costs, with the goal of achieving linear scalability of effective computing power [11][12]. Group 2: Product Offerings - Huawei introduced the Atlas 950 SuperPoD and Atlas 960 SuperPoD, which support 8192 and 15488 Ascend cards respectively, showcasing superior performance in key metrics such as card scale, total computing power, memory capacity, and interconnect bandwidth [17][20]. - The Atlas 850, an enterprise-grade air-cooled AI super node server, lowers the barrier for enterprises to adopt super node architecture without requiring complex liquid cooling modifications [21]. - The TaiShan 950 SuperPoD extends the super node architecture to general computing, offering ultra-low latency and memory pooling capabilities beneficial for databases and big data applications [25]. Group 3: Ecosystem Strategy - Huawei emphasizes an ecosystem strategy of "hardware openness and software open-source," encouraging industry partners to engage in secondary development and enrich product offerings based on the UnifiedBus protocol [26][28]. - The company aims to build a unified, scalable computing foundation that provides a consistent, high-performance computing experience across various environments, from cloud to enterprise [28].
华为超节点:用「一台机器」的逻辑,驱动AI万卡集群
机器之心·2025-09-19 13:23