华为发布AI新技术

Core Viewpoint - Huawei officially launched the AI container technology Flex:ai, aimed at addressing the low utilization of computing resources in the AI industry, which is critical for its development [1][2]. Group 1: Technology Development - Flex:ai is a pooling and scheduling software based on the Kubernetes platform, designed to optimize the management and scheduling of GPU and NPU resources, significantly improving computing resource utilization [2]. - The technology integrates research from three universities and Huawei, achieving breakthroughs in three core areas: resource partitioning, cross-node resource aggregation, and multi-level intelligent scheduling [2][3]. Group 2: Key Innovations - Resource partitioning allows a single GPU or NPU to be divided into multiple virtual computing units, improving average utilization by 30% in scenarios where small AI models are trained [2]. - Cross-node resource aggregation creates a "shared computing pool" from idle computing resources across nodes, enabling general servers to forward AI workloads to remote GPU/NPU resources [3]. - The Hi Scheduler intelligently matches AI workloads with computing resources, ensuring optimal resource allocation even under fluctuating loads [3]. Group 3: Open Source Initiative - The comprehensive open-sourcing of Flex:ai will provide developers from academia and industry access to all core technological capabilities, fostering global innovation and standardization in heterogeneous computing virtualization [4].