算力资源利用率

Search documents
华为芯片,究竟有多牛?(上)
2 1 Shi Ji Jing Ji Bao Dao· 2025-07-06 03:12
Core Viewpoint - Huawei's Ascend 384 Super Node has demonstrated performance that surpasses NVIDIA's products in certain aspects, indicating a significant advancement in domestic AI chip capabilities [2][3]. Group 1: Product Overview - Ascend is an AI chip developed by Huawei, specifically designed for AI tasks as an NPU, distinguishing it from traditional GPUs and CPUs [4]. - The main product, Ascend 910, has transitioned from being a backup option to a primary solution for training large models due to restrictions on high-end chips from NVIDIA and AMD [4][6]. Group 2: Performance Metrics - In recent developments, Huawei has successfully trained large models using Ascend chips, achieving a dense model with 135 billion parameters and a MoE model with 718 billion parameters [6]. - The key performance indicator, MFU (Modeling Function Utilization), reached over 50% for the dense model and 41% for the MoE model, indicating efficient utilization of computational resources [9]. Group 3: Competitive Analysis - In a direct comparison with NVIDIA's H100 and H800 during the deployment of large models, Ascend demonstrated comparable performance, achieving the best utilization rate in the competition [10]. - Although a single Ascend chip's performance is only one-third of NVIDIA's Blackwell, the 384 Super Node configuration, which utilizes five times the number of chips, results in an overall computational power that exceeds NVIDIA's GB200 [10].