曙光scaleX640超节点
Search documents
国产万卡超集群亮相:中国人工智能,迈入新阶段
半导体芯闻· 2025-12-25 10:20
Core Viewpoint - The first HAIC2025 conference highlighted the challenges and innovations in AI computing architecture, emphasizing the need for collaboration and system-level solutions to meet the demands of large model training [2][4][13]. Group 1: AI Computing Challenges - The rapid evolution of large model technology has created unprecedented demands on computing equipment, particularly in memory capacity, bandwidth, energy efficiency, and system stability [2]. - The slowdown of Moore's Law has made it increasingly difficult for single-node solutions to meet the computational needs of AI, necessitating a shift towards system-level engineering [4][11]. - Companies are focusing on creating tightly integrated systems that can handle the complexities of AI applications, including the need for high-speed data transfer and energy efficiency [8][12]. Group 2: Innovations and Strategies - The introduction of the "Dual-Core Strategy" by Haiguang aims to enhance AI product diversity and deepen ecosystem connections within China, focusing on customized and application-specific solutions [5][6]. - The launch of the scaleX640 super node, which features advanced cooling and power supply technologies, represents a significant advancement in AI computing infrastructure, achieving a power usage effectiveness (PUE) of 1.04 [11][12]. - The scaleX super cluster, capable of deploying over 10,240 AI accelerator cards, showcases a total computing power exceeding 5 EFlops, marking a milestone in domestic AI cluster systems [11][12]. Group 3: Future Directions - The collaboration between Haiguang and Zhongke Shuguang aims to build a robust AI ecosystem by sharing technology and creating open standards for AI software stacks, which could lead to a "Chinese version of CUDA" [13][14]. - The focus on developing high-performance, reliable networks and systems is crucial for supporting the growing demands of AI applications and ensuring long-term operational reliability [13][14]. - The ongoing efforts in the AI sector reflect a commitment to overcoming international competition and establishing a prominent position for China's AI capabilities on the global stage [14].
超节点互连技术落地 国产万卡超集群首次真机亮相
2 1 Shi Ji Jing Ji Bao Dao· 2025-12-19 13:32
Core Insights - The article discusses the emergence of high-performance computing clusters, specifically the scaleX ultra-cluster developed by Sugon, which integrates 16 scaleX640 supernodes to achieve over 5 EFlops of computing power, marking a significant advancement in domestic AI computing infrastructure [4][5]. Group 1: Ultra-Cluster Development - The scaleX ultra-cluster is the world's first single-cabinet 640-card supernode, utilizing advanced technologies such as high-density blade servers and immersion cooling, resulting in a 20-fold increase in computing density and a PUE value as low as 1.04 [1][4]. - The scaleX ultra-cluster represents a shift from traditional scattered server deployments to a more integrated and efficient computing unit, showcasing the progress of domestic computing infrastructure from conceptual designs to tangible products [1][5]. Group 2: Demand for Computing Power - As mainstream AI models transition from hundreds of billions to trillions of parameters, the demand for computing power has surged, necessitating the development of EFLOPS-level and ten-thousand-card high-performance clusters as standard configurations for large models [2][3]. - The supernode architecture is becoming a preferred choice for new ten-thousand-card clusters due to its density and performance advantages, allowing for significant optimization in computing capabilities [3]. Group 3: Networking and Scalability - The scaleX ultra-cluster employs the scaleFabric high-speed network, which utilizes the first domestic 400G-class InfiniBand RDMA network cards, achieving 400 Gb/s bandwidth and under 1 microsecond communication latency, enhancing scalability to over 100,000 cards [7]. - The architecture allows for both Scale-up (vertical expansion) and Scale-out (horizontal expansion), addressing traditional communication bottlenecks and enabling the construction of large-scale intelligent computing clusters [6]. Group 4: Challenges and Considerations - The deployment of supernodes introduces systemic challenges, including heat dissipation from numerous chips, stability issues from mixed optical and copper interconnects, and reliability concerns from long-term operation of multiple components [8]. - As the scale of intelligent computing clusters expands, key challenges include ensuring scalability, reliability, and energy efficiency, necessitating breakthroughs in power supply technology and advanced software management for sustainable operation [8].
近一个月这些上市公司被“踏破门槛”!算力芯片“双龙头”获机构组团调研,机构来访接待量居前的个股名单一览
Xin Lang Cai Jing· 2025-12-14 01:37
Group 1 - The article highlights that 11 listed companies, including Jie Rui Co., Zhongke Shuguang, and Haiguang Information, have received over 90 institutional visits in the past month, indicating strong investor interest [1] - Haiguang Information and Zhongke Shuguang both had 341 institutional visits, drawing attention due to the termination of their merger plan, yet they plan to maintain independent operations while collaborating on core areas [1][2] - Zhongke Shuguang focuses on CPU and DCU chips, establishing a leading position in domestic core chips, while Haiguang Information emphasizes integrated computing infrastructure and high-end chip design [2] Group 2 - Weichuang Electric, Fule New Materials, and Changan Automobile have also attracted significant institutional interest, with visit counts of 141, 135, and 125 respectively, all linked to their developments in robotics [1][2] - Weichuang Electric is advancing in the robotics field with a comprehensive layout, recently launching various new products including micro motors and intelligent components for mobile robots [3] - Fule New Materials has redefined TPU architecture for robotics, focusing on integrating computing capabilities into electronic skin, enhancing interaction and safety [4] Group 3 - Changan Automobile is strategically developing its robotics business with a "1+N+X" approach, focusing on humanoid robots and various applications across different scenarios [4] - The company aims to integrate the robotics industry supply chain, covering components, software services, and infrastructure, to enhance its smart mobility and automotive robotics offerings [4]
海光信息与中科曙光分道扬帆 双双回应终止重组原因
Zheng Quan Shi Bao· 2025-12-10 18:49
Core Viewpoint - The merger between Haiguang Information and Zhongke Shuguang has been terminated, allowing both companies to accelerate their independent development in their respective fields of computing power [2][4]. Group 1: Stock Price Changes - Both companies experienced significant fluctuations in their stock prices since the announcement of the merger plan, with Haiguang Information's stock rising by up to 90% and Zhongke Shuguang's stock nearly doubling [4]. - The decision to terminate the merger was influenced by the substantial changes in the secondary market stock prices, driven by various factors including domestic and international environments, overall A-share market trends, and AI industry dynamics [3][4]. Group 2: Merger Termination Explanation - The termination was announced during an investor briefing, where executives from both companies denied any inadequacy in information disclosure, stating that the decision was made based on the evolving market conditions and the complexity of the merger [6][7]. - The companies emphasized that they had conducted thorough evaluations of the merger proposal, but the market environment had changed significantly since the initial planning stages [6][7]. Group 3: Future Collaboration and Strategy - Despite the termination, both companies plan to maintain independent operations while enhancing strategic collaboration, focusing on their core areas: Haiguang Information on chip design and Zhongke Shuguang on computing infrastructure [7][8]. - The companies aim to create a dual-core structure in the domestic computing power industry, promoting healthy competition and collaboration among chip manufacturers and system integrators [8]. Group 4: Market Position and Product Development - Haiguang Information is positioned as a key player in the domestic x86 architecture chip market, with plans to expand its commercial channels and increase chip shipments, particularly in AI applications [9][10]. - Zhongke Shuguang is developing AI computing solutions that support various mainstream AI acceleration cards, emphasizing compatibility and customer-specific needs [10].
中科曙光回应英伟达合作等问题:产品采用开放架构,支持多类AI加速卡
Bei Jing Shang Bao· 2025-12-10 12:40
Core Viewpoint - Zhongke Shuguang has disclosed its investor relations activity record, addressing market concerns regarding the Supernode 640 product orders, compatibility with H200, and collaboration with NVIDIA [1] Group 1: Product Features and Market Position - The Shuguang scaleX640 product is based on advanced hardware architecture, cooling, and power supply technologies, achieving a leading level of integration globally [1] - The comprehensive performance of the Supernode is leading domestically, with its advantages expected to become more prominent as the scale of large model parameters increases and high-throughput inference cluster deployments are implemented [1] - The scaleX640 Supernode adopts an AI computing open architecture, supporting mainstream domestic and international AI accelerator cards, allowing users to choose acceleration chips as needed for a "soft and hard collaboration, ecological compatibility" AI computing solution [1]
国产大规模智算超集群真机或将亮相HAIC2025
Huan Qiu Wang· 2025-12-05 12:11
Core Insights - The leading domestic large-scale intelligent computing supercluster system is set to debut at the HAIC2025 from December 17 to 19, 2025, in Kunshan, which will provide comprehensive support for breaking through the domestic AI computing power bottleneck and further improve the AI computing power industry chain layout [1][2] Group 1: Technology and Design - The supercluster system is designed with an open collaboration core concept and will feature domestically developed high-speed interconnect network technology, which is expected to break industry records for computing power density and achieve both "scale expansion" and "performance enhancement" [1] - The system is based on the Shuguang scaleX640 super node technology, potentially reaching a computing power scale of tens of thousands of cards, placing it at the forefront of domestic intelligent computing devices [2] Group 2: Performance and Compatibility - The scaleX640 super node is the world's first single cabinet-level 640-card super node, with a "one-to-two" high-density architecture that can form a thousand-card computing unit, doubling performance and increasing density by 20 times compared to similar products [2] - In scenarios involving MoE trillion-parameter large models, performance is expected to improve by 30% to 40%, and the system is compatible with multiple brands of domestic accelerator cards, showcasing significant adaptability and cost-performance advantages [2] Group 3: Industry Impact - The potential release of the tens of thousands of card supercluster represents a milestone for domestic intelligent computing, transitioning from "single-point breakthroughs" to "clustered implementation" [2]
超节点持续演进,看好国产算力 | 投研报告
Zhong Guo Neng Yuan Wang· 2025-11-12 02:53
Core Viewpoint - The computer industry index has underperformed compared to major stock indices, indicating a challenging market environment for the sector [1][2]. Market Review - During the week of November 3 to November 7, the Shanghai Composite Index rose by 1.08%, the ChiNext Index increased by 0.65%, and the CSI 300 Index gained 0.82%. In contrast, the computer (Shenwan) index fell by 2.54%, lagging behind the Shanghai Composite by 3.62 percentage points, the ChiNext by 3.19 percentage points, and the CSI 300 by 3.36 percentage points, ranking 30th among all industries [1][2]. Weekly Insights - NVIDIA is leading the trend of supernodes, a technology architecture for building large-scale computing clusters, which integrates thousands of GPUs into a single logical unit. The latest NVLink technology has reached its fifth generation, with each GPU having 18 NVLink connections, achieving a total bandwidth of 1800GB/s, significantly surpassing PCIe Gen6 [3]. - NVIDIA's upcoming NVL72, set to be released in March 2024, will integrate 36 Grace CPUs and 72 Blackwell GPUs into a liquid-cooled cabinet, delivering a total of 720 PFLOPs for AI training and 1440 PFLOPs for inference [3]. Domestic Major Players Accelerating Supernode Layout - **Inspur**: On November 6, during the World Internet Conference, Inspur launched the world's first single-cabinet 640-card supernode, achieving a 20-fold increase in computing density [4]. - **Huawei**: In April, Huawei introduced the CloudMatrix384 supernode, capable of creating a super-large cluster with over 160,000 cards. As of September, over 300 units have been sold, primarily to government and enterprise clients [4]. - **Alibaba**: At the 2025 Cloud Computing Conference, Alibaba Cloud unveiled the Panjiu 128 supernode AI server, which enhances inference performance by 50% compared to traditional architectures [5]. - **Baidu**: Announced the launch of the Kunlun supernode at the 2025 Baidu Cloud Intelligence Conference, making supercomputing capabilities available [5]. - **ZTE**: Developed a supernode server with 64 GPUs, featuring an innovative design that reduces latency to the nanosecond level [5]. - **Inspur Information**: Released the "Yuan Nao SD200" supernode AI server aimed at trillion-parameter models [5]. Investment Recommendations - Focus on companies involved in computing power such as Cambricon, Haiguang Information, Inspur, and others [6]. - Consider AIDC-related firms like Kehua Data and Yunse Intelligent [6]. - Explore AI application companies including Kingsoft Office, iFlytek, and others [6].
研报掘金丨华创证券:维持中科曙光“强推”评级,目标价126元
Ge Long Hui· 2025-11-10 08:49
Core Insights - The company has officially launched the world's first single cabinet-level 640-card super node, scaleX640, which was showcased at the Wuzhen Internet of Things Expo [1] - The scaleX640 super node features a "one-to-two" high-density architecture, achieving ultra-high-speed bus interconnection within a single cabinet, and can form a thousand-card computing unit through dual-node combinations [1] - Compared to similar products in the industry, the scaleX640 has doubled its overall computing performance and increased cabinet computing density by 20 times [1] - In applications such as MoE trillion-parameter model training and inference, the system-level performance improves by 30%-40% compared to traditional solutions [1] - The company maintains a positive outlook on its technological leadership and product competitiveness in the intelligent computing infrastructure sector, with net profit forecasts for 2025-2027 at 2.652 billion, 2.984 billion, and 3.349 billion yuan, representing year-on-year growth of 38.8%, 12.5%, and 12.2% respectively [1] - Valuation-wise, the company is compared to server manufacturers, with a target price of approximately 126 yuan based on a 62x PE valuation for 2026, maintaining a "strong buy" rating [1]
国产超节点推陈出新,性能+生态壁垒双双攻克!
傅里叶的猫· 2025-11-09 23:53
Core Viewpoint - The year 2025 is anticipated to be a breakthrough year for domestic supernodes, with major companies like Inspur, ZTE, Huawei, Alibaba, and Sugon making significant advancements in computing cluster construction, enhancing computing power integration, density, and ecosystem compatibility [2]. Group 1: Product Developments - Huawei's Ascend 384 has set a new industry standard as the largest high-speed bus interconnected supernode, featuring 32 cards per cabinet across 12 cabinets, showcasing Huawei's comprehensive capabilities in communication and computing [2]. - Alibaba's Panjiu AL128 supernode has achieved a record of supporting 128 accelerator cards in a single cabinet, with a computing power integration level four times that of the Ascend 384, demonstrating rapid advancements in software and hardware optimization [2]. - The Sugon scaleX640 supernode is the world's first single-cabinet 640-card supernode, achieving 20 times the computing power integration of the Ascend 384, designed on an open AI computing architecture to ensure compatibility with mainstream intelligent computing ecosystems [2]. Group 2: Performance Comparison - Domestic supernodes have undergone three significant leaps, overcoming barriers in performance and ecosystem, with the scaleX640 showing core advantages over NVL72 in comprehensive performance metrics [3]. - The scaleX640 has implemented advanced immersion phase change liquid cooling technology, achieving a minimum PUE of 1.04 and providing 1.72MW of cooling capacity for high-caliber computing units, validated through over 30 days of reliability testing [3]. - Despite a gap in single-card computing power compared to NV, the engineering characteristics of computing clusters present systemic opportunities for domestic manufacturers to catch up, with ongoing innovations in integration, compatibility, and reliability [3].