Workflow
超节点架构创新,开源开放共筑全场景算力底座
中国能源报·2025-09-18 09:10

Core Viewpoint - Huawei has introduced an innovative super node architecture aimed at building a robust all-scenario computing foundation, emphasizing open-source and hardware openness to foster industry collaboration and innovation [1][10]. Group 1: Super Node Architecture - The super node architecture, based on the Lingqu interconnection protocol, allows multiple physical machines to be deeply interconnected, enabling them to function as a single logical unit for learning, reasoning, and efficient computing [1][2]. - This architecture addresses the challenges of traditional server stacking, which can lead to lower computing efficiency and frequent interruptions during training as cluster sizes increase [1][2]. Group 2: Product Launches - Huawei has launched several new products based on the super node architecture, including the Atlas 950 SuperPoD, Atlas 850 and Atlas 860 AI servers, Atlas 350 AI accelerator cards, and the TaiShan 950 SuperPoD [4][5]. - The Atlas 950 SuperPoD is designed for large-scale AI computing tasks, featuring innovations such as zero-cable electrical interconnection and enhanced liquid cooling reliability [4]. - The Atlas 850 is the first enterprise-grade air-cooled AI super node server, capable of forming a super node cluster with up to 128 units and 1024 cards [4]. Group 3: Performance Enhancements - The Atlas 350 accelerator card, utilizing the Ascend 950PR chip, offers a 2x increase in vector computing power and a 2.5x performance boost in recommendation inference scenarios [5]. - The TaiShan 950 SuperPoD features ultra-low latency of 370 nanoseconds and a bandwidth of 2.8T, significantly enhancing performance in database and virtual machine migration scenarios [5]. Group 4: Open Collaboration - Huawei is committed to open collaboration by sharing super node technology with the industry, allowing partners to develop products based on the Lingqu protocol and super node reference architecture [6]. - The company has opened its super node hardware, including NPU modules and AI accelerator cards, to facilitate incremental development by customers and partners [6]. Group 5: Software Innovation - Huawei is also focusing on software openness, with plans to open-source the Lingqu operating system components and support various open-source communities, accelerating developer innovation [9]. - The Ascend CANN and Mind series components will be open-sourced, prioritizing support for popular frameworks like PyTorch, enhancing flexibility for developers [9].