华为发布AI新技术

Core Insights - Huawei officially launched the AI container technology Flex:ai, aimed at addressing the low utilization of computing resources in the AI industry, which is facing a surge in demand for computational power [1] Group 1: Technology Innovations - The Flex:ai technology includes XPU pooling and scheduling software built on the Kubernetes platform, enabling precise management and intelligent scheduling of GPU and NPU resources to significantly enhance computing resource utilization [1] - The XPU pooling framework allows a single GPU or NPU card to be divided into multiple virtual computing units, improving average utilization by 30% in scenarios where one card runs one task [2] - The cross-node virtualization technology aggregates idle XPU resources from various nodes into a shared computing pool, allowing general servers to forward AI workloads to remote GPU/NPU cards, thus integrating general and intelligent computing resources [2] Group 2: Intelligent Scheduling - The Hi Scheduler, developed in collaboration with Xi'an Jiaotong University, enables optimal scheduling of heterogeneous computing resources by automatically sensing cluster loads and resource states, ensuring stable operation of AI workloads even under fluctuating loads [3] - The comprehensive open-sourcing of Flex:ai will provide access to core technological capabilities for developers across academia and industry, promoting the establishment of standardized solutions for efficient computing resource utilization in the global AI industry [3]