Investment Rating - The report does not explicitly state an investment rating for the AI industry or Huawei's AI initiatives. Core Insights - Huawei is exploring a full-stack AI competitive strategy through the integration of software and hardware, transitioning from merely catching up with state-of-the-art (SOTA) models to customizing model architectures to better leverage its self-developed Ascend hardware [6][20]. - The evolution of the Pangu model series reflects a shift from dense models to sparse architectures, addressing systemic issues in large-scale distributed systems and enhancing efficiency [6][22]. - The introduction of the CloudMatrix infrastructure supports the optimization of AI inference, enabling high throughput and low latency through a unified bus network and various operator-level optimizations [6][20]. Summary by Sections 1. Evolution of Pangu Models - The Pangu model series began with PanGu-α, a 200 billion parameter autoregressive Chinese language model, which established a technical route based on Ascend hardware [6][8]. - PanGu-Σ, launched in 2023, marked an exploration into trillion-parameter models, introducing a sparse architecture to reduce computational costs [8][10]. - Pangu 3.0 introduced a "5+N+X" architecture, focusing on industry-specific applications and enabling rapid deployment of AI capabilities across various sectors [15][16]. 2. Maximizing Ascend Hardware Efficiency - Pangu Pro MoE and Pangu Ultra MoE are designed to maximize the efficiency of Ascend hardware, with Pangu Pro MoE addressing load imbalance through a grouped expert mixture architecture [25][26]. - Pangu Ultra MoE employs a system-level optimization strategy, utilizing simulation-driven design to enhance performance on Ascend hardware [46][47]. 3. CloudMatrix Infrastructure - CloudMatrix serves as the physical foundation for AI inference, addressing new challenges posed by large language models and enabling high-performance computing through a distributed memory pool [6][20]. - The infrastructure supports various software innovations, allowing for efficient communication and optimization of AI models [6][20]. 4. Full-Stack Collaboration Strategy - Huawei's strategy emphasizes open-source models to build an ecosystem around Ascend hardware, integrating architecture, systems, and operators for comprehensive collaboration [6][20].
华为盘古大模型与腾AI计算平台,共同构建软硬一体的AI技术体系
GUOTAI HAITONG SECURITIES·2025-08-06 13:52